Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
Summative re-assessment for the module is by means of a single report on your statistical
analysis of a data set related to the National Basketball Association (NBA), a professional
basketball league in the USA. Please read this document carefully.
This form of assessment has been chosen because as professional statisticians and data scientists,
you will often be asked to investigate a data set and report on whether it contains anything useful
or interesting. The assessment will also help you to prepare for writing your MSc dissertation
in the summer.
Your report will be assessed according to how well you are able to
analyse the data set, 40%
interpret the results of your analysis, and 30%
present the results of your analysis and your interpretation of the data set. 30%
Your analysis should be performed using the R statistical software package, and your report
prepared using the R Markdown typesetting system and the template provided. Two marks will
be deducted for reports prepared using alternative systems such as Microsoft Word or LaTeX.
1 The data
The data set NBA_sample.csv is a partial record of shots taken by players in the NBA between
October 2014 and March 2015, and consists of 50,000 observations on 20 variables as described
in Table 1. A summary of the changes made to the data set provided for the original coursework
can be found at the end of this document.
Variable Description
GAME_ID Unique id number of the game.
DATE Date of the game.
HOME_TEAM Team playing at home.
AWAY_TEAM Team playing away from home.
PLAYER_NAME Name of the shooting player.
PLAYER_ID Unique id number of the shooting player.
LOCATION Whether the player was on the home (H) or away (A) team.
WIN_LOSE Whether the player’s team won (W) or lost (L) the game.
SHOT_NUMBER The number of the shot taken by the shooting player in that game.
PERIOD The period of the game that the shot was taken.
SEC_REMAIN The number of seconds before the end of the period that the shot was taken.
SHOT_CLOCK The time remaining before the shot must be taken.
DRIBBLES Number of dribbles by the player before the shot was taken.
TOUCH_TIME The time that the ball was in the shooting player’s hand.
SHOT_DIST The distance of the shooting player from the basket.
PTS_TYPE 2 for shots from inside the arc, 3 for shots from outside the arc.
CLOSEST_DEFENDER Name of the nearest defender when the shot was taken.
CLOSEST_DEFENDER_ID Unique id number of the nearest defender.
CLOSE_DEF_DIST Distance of the nearest defender when the shot was taken.
SUCCESS Equal to 1 if the shot was made (scored), otherwise 0.