Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
Fall STA 3386
Patrick Project 1 On Canvas (under Files, Datasets), there is a file called Golf.csv.
This data represents the seaon earnings, number of events played, and average
score for the top 21 PGA golfers for a particular season. Follow the instructions below.
You will use R for all of your analysis. You will submit your R Script to Canvas.
Due Tuesday Oct 26 by 11:59 pm. 1. At the top of your script, use comments
to put your name and the names of any others working with you on this project.
For example, you can use something like
2. Fit a multiple regression model using Earnings as the dependent variable,
and Events and Score as the independent variables.
Use comments in your code to comment on the global F-test, the individual t-tests,
and the coefficient of determination. 3. Check for multicollinearity and check your assumptions.
For each thing you do, use comments to discuss what assumptions are violated and how you determined this.
4. Try to fix any violations of your assumptions from step 3. Hint: Think about some type of transformation.
Use comments to discuss what you did to fix any assumption.
5. Fit the model again with whatever fix you came up with. Use comments to discuss the global F-test,
individual t-tests, and the coefficient of determination with this model. 6. Check for multicollinearity and
check your assumptions for this new model. For each thing you do,
use comments to discuss what assumptions are violated and how you determined this.
7. Compare the model from item 1 and the model from item 5. Use comments in your code to summarize which model you consider best. 1