Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
BUSINESS ANALYTICS
Module – BU52018 Level - 5 2 HOURS ANSWER QUESTION 1 IN SECTION A AND THREE QUESTIONS IN SECTION B ALL QUESTIONS CARRY EQUAL MARKS The use of approved calculators (FX83, FX85 or FX 115) is permitted in this examination BU52018 Page 2 of 7 SECTION A ANSWER ALL PARTS OF THE QUESTION IN THIS SECTION 1. A travel agency would like to forecast the number of UK citizen visits abroad so that they can develop a growth plan for their business. The agency’s statistician developed the following model, to be estimated using quarterly data for the past 10 years (40 observations):
ܸܫܵܫܶܵ௧ ൌ ߚଵ ߚଶܴܶܧܰܦ௧ ߚଷܩܦܲ_ܲܥ௧ ߚସܳ2௧ ߚହܳ3௧ ߚ2ܳ4௧ ߝ௧ where: • ܸܫܵܫܶܵ௧ is the number of UK citizen visits abroad during period ݐ, measured in millions, • ܴܶܧܰܦ௧ is an index indicating the period to which the data point corresponds (1, 2, 3, … 40), • ܩܦܲ_ܲܥ௧ is the Gross Domestic Product (GDP) per capita in the UK for period ݐ, • ܳ2௧ is a dummy variable equal to one if the observation corresponds to quarter 2 (April- May-June), • ܳ3௧ is a dummy variable equal to one if the observation corresponds to quarter 3 (July- August-September), and • ܳ4௧ is a dummy variable equal to one if the observation corresponds to quarter 4 (October-November-December).
The results of the model estimation are presented in the following table.
Dependent Variable: VISITS Method: Least Squares Date: --/--/-- Time: --:-- Sample: 1 40 Included observations: 40 Variable Coefficient Std. Error t-Statistic Prob. C 95.19166 45.45449 2.094219 0.0438 TREND 0.503173 0.152674 3.295738 0.0023 GDP_PC -2.077089 1.158584 -1.792783 0.0819 Q2 1.650664 0.576868 2.861422 0.0072 Q3 2.674084 0.578551 4.622037 0.0001 Q4 0.192040 0.579254 0.331531 0.7423 R-squared 0.859137 Mean dependent var 19.58068 Adjusted R-squared 0.838421 S.D. dependent var 3.207040 S.E. of regression 1.289129 Akaike info criterion 3.483291 Sum squared resid 56.50300 Schwarz criterion 3.736623 Log likelihood -63.66583 Hannan-Quinn criter. 3.574888 F-statistic 41.47372 Durbin-Watson stat 0.915586 Prob(F-statistic) 0.000000
(a) Report the estimated model in standard form, including standard errors. (10)
(b) Interpret the estimates of all parameters (including the constant term), paying special attention to the units of measurement of the dependent and independent variables. (25)
(c) Interpret the number “0.0819” that appears under the “Prob.” column and on the row corresponding to the “GDP_PC” variable on the table above. (10) BU52018 Page 3 of 7
(d) Interpret the F-statistic of the model (clearly state the null and alternative hypotheses and whether you reject the null or not and at what level of significance). (15)
(e) Interpret the R2 of the model. (10)
(f) Test at the 5% significance level the hypothesis that the parameter associated with the “Q4” variable is greater than 1.0. (15)
(g) The residuals from the model are plotted against the time index in the figure below. What deviation(s) from the Gauss-Markov assumptions do you identify from this plot? What are the consequences of this deviation for the Ordinary Least Squares estimator, in general? (15)
-3 -2 -1 0 1 2 3 5 10 15 20 25 30 35 40
BU52018 Page 4 of 7
SECTION B
ANSWER THREE QUESTIONS IN THIS SECTION
2. SeaShellsByTheSeaShore Investments is a small retirement fund that usually invests in low- risk, low-tech stocks. However, the fund’s CEO recently learned about a new start-up, high- tech company with great potential for profitability, but also high risk:
the probability of such a start-up surviving the first year, SP , is only 10% the probability of such a start-up failing in the first year, SP , is 90%
If the start-up survives the first year then it will turn highly profitable.
Because SeaShellsByTheSeaShore Investments’ analysts are unfamiliar with the hi-tech market, the CEO decides to hire an external consultant, HappyDay Analysts, to predict whether the start-up company will survive the first year. HappyDay Analysts are not always correct in their predictions: they are known to be overly optimistic about the prospects of start-ups. In their website they provide the following data: “out of 100 start-ups that survived we predicted correctly on 80!” “out of 900 start-ups that failed we predicted correctly on 540!”
Do the following, showing the steps of your derivations:
(a) Calculate the probability of HappyDay Analysts predicting that a start-up will survive given that it survived – SPSP . (10)
(b) Calculate the probability of HappyDay Analysts predicting that a start-up will fail given that it failed – SPSP . (10)
(c) Use the law of total probability to calculate the probability of HappyDay Analysts predicting that a start-up will survive – PSP . (20)
(d) Use Bayes’ theorem to calculate the probability of a start-up surviving given a prediction from HappyDay Analysts that it will survive – PSSP . (20)
(e) Calculate the probability of the start-up failing given a prediction from HappyDay Analysts that it will survive – PSSP . (10)
(f) Calculate the probability of HappyDay Analysts predicting that the start-up will fail – PSP . (10)
(g) Calculate the probability of the start-up surviving given a prediction from HappyDay Analysts that it will fail – PSSP . (20)
BU52018 Page 5 of 7
3. A health insurance company offers a policy according to which it charges an annual fee (premium) to a customer (policy holder) and, in return, it covers all costs resulting from visits of the customer to hospitals or medical doctors during the year. The current level of the premium is £150 and the cost per visit to a hospital/doctor is £40.
The company’s manager is currently reconsidering the value of the premium because he is concerned that the expected profit (revenue from the premium minus expected cost from visits to hospitals/doctors) from this policy may be negative. To make an informed decision, the manager collects data on the number of visits to hospitals/doctors during the previous year from 100 policy holders. The data appear on the following table.
# of visits to hospitals/doctors in the past year # of policy holders with this many visits 0 5 1 19 2 26 3 22 4 13 5 8 6 4 7 2 8 1 Total 100
Please do the following while using appropriate notation and showing your workings.
(a) Calculate the average number of visits to hospitals/doctors per year by a policy holder. (20)
The insurance company’s manager decides to model the number of visits to hospitals/doctors per policy holder as a random variable which follows a Poisson distribution with parameter, μ, equal to the number you identified in (a).
(b) What is the probability of a policy holder visiting a hospital/doctor no more than 2 times in a year if the number of visits follows a Poisson distribution? (15)
(c) Assuming that the number of visits to hospitals/doctors per year does indeed follow a Poisson distribution, what is the probability that a policy holder will visit a hospital/doctor more than 6 times in a year? (15)
(d) Assuming that the number of visits to hospitals/doctors per year does indeed follow a Poisson distribution, what is the probability that a policy holder will visit a hospital/doctor between 2 and 6 times in a year (margins included)? (20)
(e) What is the probability that the insurance company makes a loss from a given policy holder? Please explain. (15)
(f) What is the expected profit per customer? Please define random variables explicitly and show your working. (15)
BU52018 Page 6 of 7
4. A large corporation frequently hires data scientists to expand its operations, as well as to fill posts that become vacant from their own data scientists moving to other companies. For that reason they need to have good knowledge of the average salary of a data scientist in their industry. Until now the hiring manager of this corporation has been operating under the assumption that the average salary of a data scientist is £50,000 and she has been making offers to job candidates at that rate. However, due to recent trends in the industry, she suspects that the mean salary has gone up. To examine this possibility she collected data on 49 newly- hired data scientists and she found that the average salary offered to them was £54,000, with a standard deviation of £9,000 (sample variance of £281,000,000)
(a) Set up appropriate null and alternative hypotheses to test the claim that the mean salary of a data scientist in the industry is above £50,000. (20)
(b) Test the hypothesis that you identified in (a) at the 5% level of significance. Should the hiring manager alter the salary she has been offering to job applicants? (30)
Until now the hiring manager has been operating under the assumption that the standard deviation of data-scientist salaries is £10,000, but given the new data she obtained, she suspects that the variability in salaries has gone down.
(c) Set up appropriate null and alternative hypotheses to test the claim that the standard deviation of data-scientist salaries is greater than £10,000 (variance greater than £2100,000,000). (20)
(d) Test the hypothesis that you identified in (c) at the 5% level of significance. Can it be concluded that the variability in salaries is below the £10,000 threshold used until now? (30)
BU52018 Page 7 of 7
5. A class of 12 university students takes a test on Business Analytics. The invigilator assigned to the class recently read an article which stated that 15% of all people are left handed and he wants to evaluate whether this claim has any credibility. Let X be the number of students in the class who are left handed.
Please answer the following questions using appropriate notation and showing your workings:
(a) What distribution does X follow if 15% of all people are left handed? State the name of the distribution and the value(s) of its parameter(s)? (15)
(b) What is the probability of exactly 3 students being left handed if 15% of all people are left handed? (10)
(c) What is the probability of no more than 3 students being left handed if 15% of all people are left handed? (10)
(d) What is the probability of 4 or more students being left handed if 15% of all people are left handed? (15)
(e) What is the probability of 2 to 7 students being left handed (margins included) if 15% of all people are left handed? (20)
(f) If 15% of all people are left handed, what is the expected number of left-handed students in the class? (15)
(g) If 15% of all people are left handed, what is the expected number of right-handed students in the class? Please define random variables explicitly and show your working. (Here you can assume that people are either left or right handed, not both).