Group 6 Solution For Assignment
Group 6 Solution For Assignment
Group Number: 6
RB20042 SACHIN S D
RB20044 Harish M
RB20045 RAJESH P
RB20046 RENGASAMY B
Given:
a) Confidence interval for the population mean no. of early replacement buyers
Confiedence Interval=X ± ¿ ]
0.71
¿ 3.3 ±[ 2.575× ]
√ 800
¿ 3.3 ± 0.065
¿ [3.235 , 3.365]
b) Confidence interval for the population mean no. of Late replacement buyers
Confiedence Interval=X ± ¿ ]
0.66
¿ 4.3 ±[ 2.575 × ]
√500
¿ 4.3 ± 0.076
¿ [4.223 , 4.376]
The confidence interval in both cases contains the population mean. So we are 99% confident
that the bayus classification of “early replacement buyers” & “late replacement buyers” with the given
criteria is valid as the population mean is in the confidence interval.
(ii):
Given:
Criteria In Sec.,
Mean alter time - Manual 15
With AI display panel <8
Confiedence Interval=X ± ¿ ]
1.026
¿ 7.4 ±[ 2.145 × ]
√15
¿ 7.4 ± 0.568
¿ [6.832,7 .968]
b) The interval says the with AI panel, it is 95% confidence that the mean “alert time” is
between 6.832 to 7.968 seconds. Based on the interval the Ralph has strong evidence that the
new panel’s mean alter time is less than 8 seconds.
(iii)
Claim: National motor has equipped the ZX-900 to achieve shorter stopping distance than the
competitor (60ft)
a) Setting hypotheses
H0 : µ ≥ 60 ft Ha : µ < 60 ft
X n σ α
57.8 81 6.02 0.05
X−µ
Z=
σ / √n
57.8−60
Z=
6.02/ √ 81
Z ¿−3.289 , Z0.05 = -1.645
So we can reject the null hypotheses in favour of alternative hypotheses and television network
has the strong evidence to reject the null hypotheses and will be advertising the claim.
(iv):
Claim: Firm’s new button’s mean lifetime exceeds the mean lifetime of the best remote button
currently available in market
a) Setting hypotheses
X n Σ
1241.2 35 110.8
X−µ
Z=
σ / √n
1241.2−1200
Z=
110.8/ √ 35
Z=2.199
X n Σ
1241.2 100 110.8
1241.2−1200
Z=
110.8/ √ 100
Z=3.717
It is evident that the increase in sample size n=100 increases the evidence of H a is true.
So it is very difficult to promote the product to the customer by saying 3.4% improvement in
lifetime.
Also, results of (b) & (c) are important for the promotion of a product where television
networks use those results to confirm the claim.
In both sample sizes above 99% confidant interval we are rejecting the null hypothesis in
favour of alternate hypothesis i.e. increase mean life time which shows the same degree of
practical importance.
X N Σ
1524.6 35 102.8
1524.6−1200
Z=
102.8/ √ 35
Z=18.68
Since our Z value is 18.08 which is greater than 4 , for values above 4 in the Z table the P value is
1.0000 .
Since the alternate hypothesis is assumed for " > " mean lifetime value , the final P value will be 1 -
(1.0000) which is Zero .
Clearly indicating a statistically insignificant result .
But practically , The mean lifetime is more than 25% of the standard best life of button claimed earlier
which is very much visible to the customer .
Question 2
Residuals:
Min 1Q Median 3Q Max
-79.010 -17.411 0.281 12.838 85.577
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -11.537932 14.929279 -0.773 0.443
square.feet 0.112158 0.009837 11.401 <2e-16 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residuals:
Min 1Q Median 3Q Max
-89.621 -29.950 -6.719 19.660 136.940
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 13.134 25.766 0.510 0.612
rooms 22.561 4.071 5.541 7.67e-07 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Linear Regression Model equation for sale.price (Y) and rooms (X2) is as
follows
Y = 13.134 + 22.561*X2
Residuals:
Min 1Q Median 3Q Max
-74.693 -36.814 -8.048 20.952 151.952
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 31.11 31.80 0.978 0.331932
bedrooms 41.65 10.72 3.885 0.000265 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Linear Regression Model equation for sale.price (Y) and bedrooms (X3) is as
follows
Y = 31.11 + 41.65*X3
As T value is 11.401 and p value is 2.2 e-16 and Using H1:β1≠0, X1 is
significant Price and Square feet have association.
Residuals:
Min 1Q Median 3Q Max
-73.97 -41.36 -14.96 22.94 154.31
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 174.9518 12.1630 14.38 <2e-16 ***
age -0.7138 0.3038 -2.35 0.0222 *
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Linear Regression Model equation for sale.price (Y) and age (X4) is as follows
Y = 174.9518 - 0.7138*X4
As T value is -2.35 and p value is 0.022 and Using H0:β1=0, X1 is significant
Price and age has no association.
t
Variabl valu P
es e value RA2
11.4 2.00E- 0.686
Sqft 01 16 1
5.54 7.67E- 0.334
rooms 1 07 9
bedroo 3.88 0.0002 0.192
ms 5 65 8
- 0.071
age 2.35 0.0222 18
It is conclude that the model of sale price (Y) with Square feet (X1) Y = 13.134
+ 22.561*X2 is the best fir for considering single linear regression model.
Part 2:: Multiple regression model for sales data and it interpretation.
#Multiple regression model
sales.lm.mr= lm(sales.price~square.feet+rooms+bedrooms+age)
summary(sales.lm.mr)
Ans from R studio
Residuals:
Min 1Q Median 3Q Max
-59.14 -14.68 -1.13 15.62 66.08
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 19.48651 16.86523 1.155 0.25291
square.feet 0.10791 0.01228 8.788 4.62e-12 ***
rooms 11.88067 3.51702 3.378 0.00135 **
bedrooms -25.72796 8.13677 -3.162 0.00255 **
age -0.72074 0.15307 -4.709 1.73e-05 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Start: AIC=396.71
sales.price ~ square.feet + rooms + bedrooms + age
Call:
lm(formula = sales.price ~ square.feet + rooms + bedrooms + age)
Coefficients:
(Intercept) square.feet rooms bedrooms age
19.4865 0.1079 11.8807 -25.7280 -0.7207
From above details we conclude that 4 independent variables are significant on determining
the sales prices and it is the best fit model for the regression.
Ans::
Y = 19.4865 + 0.107*X1 + 11.88067*X2 – 25.72796*X3 –0.72074*X4
Y=19.4865 + (0.107*5000) + (11.88067*20) – (25.72796*10) – (0.72074*100)
Question 3:
Part(a)(i) Model to Predict Sales Representative Performance using first
five independent variables:
#Attach Sales File
Saleperform = lm(Sales~Time+MktPoten+Adver+MktShare+Change)
summary(Saleperform)
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -1.114e+03 4.199e+02 -2.653 0.01571 *
Time 3.612e+00 1.182e+00 3.057 0.00649 **
MktPoten 4.209e-02 6.731e-03 6.253 5.27e-06 ***
Adver 1.289e-01 3.704e-02 3.479 0.00251 **
MktShare 2.570e+02 3.914e+01 6.566 2.76e-06 ***
Change 3.245e+02 1.573e+02 2.063 0.05301 .
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
𝒀=−1114+3.612X1+0.04209X2+0.1289X3+257X4+324.5X5
Part(a)(ii) Model to Predict Sales Representative Performance using all
eight independent variables:
Saleperform2 =
lm(Sales~Time+MktPoten+Adver+MktShare+Change+Accts+WkLoad+Rating)
summary(Saleperform2)
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -1.508e+03 7.786e+02 -1.936 0.070675 .
Time 2.010e+00 1.931e+00 1.041 0.313408
MktPoten 3.721e-02 8.202e-03 4.536 0.000338 ***
Adver 1.510e-01 4.711e-02 3.205 0.005518 **
MktShare 1.990e+02 6.703e+01 2.969 0.009041 **
Change 2.909e+02 1.868e+02 1.557 0.138983
Accts 5.551e+00 4.776e+00 1.162 0.262130
WkLoad 1.979e+01 3.368e+01 0.588 0.564896
Rating 8.189e+00 1.285e+02 0.064 0.949977
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
𝒀=−1508+2.010X1+0.03721X2+0.151X3+199X4+290.9X5+5.5
51X6+19.79X7+8.189X8
Comparing both models model with 5 independent variants is better as it has Adjusted
R-squared = 0.8926 which is closer to 1 than with 8 predictors
step(Saleperform2)
Start: AIC=312.2
Sales ~ Time + MktPoten + Adver + MktShare + Change + Accts +
WkLoad + Rating
Step: AIC=310.2
Sales ~ Time + MktPoten + Adver + MktShare + Change + Accts +
WkLoad
Step: AIC=308.74
Sales ~ Time + MktPoten + Adver + MktShare + Change + Accts
Step: AIC=308.36
Sales ~ Time + MktPoten + Adver + MktShare + Change
Call:
lm(formula = Sales ~ Time + MktPoten + Adver + MktShare + Change)
Coefficients:
(Intercept) Time MktPoten Adver MktShare Change
-1114 3.612 4.209e-02 1.289e-01 2.570e+02 3.245e+02
Best model:
𝒀=−1114+3.612X1+0.04209X2+0.1289X3+257X4+324.5X5
Part(b)(ii) Discussion on the significant predictors of the best model:
As per t test (at α= 1%) The p values obtained from t19 (refer RStudio Coeffecients for Best
Fitted Model) of MktShare(X4) is .00000276 and of MktPoten(X2) is .00000527 are < .01
and closer to 0 as compared to other variables.
Thus X4 and X2 are the most significant variables, which implies enough evidence of
association between Sales Performance and Market Share and Market Potential controlling
Time, Advertisement and Change