Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
BSAN2204 - Coding Documentation
Background
The Coding Documentation Project Report (A2) documents the code used in your analysis of the Million
Song Dataset (MSD). It should include the code used to generate the outputs of the first Project Report
(A1), as well as extended analysis of the dataset using the methods learned in the latter half of the course.
The project should be submitted as a PDF document.
You are encouraged to use RMarkdown to compile your report.
Purpose
This purpose of this project is to provide both technical documentation of the R script used in your analysis
and demonstrate your ability to extend your analysis using the methods of business analytics taught in the
latter half of the semester (model validation, missing data analysis, dimensionality reduction).
The conclusion of this project should outline some possible strategic business decisions that a manager might
make based on your finalised analysis.
Suggested structure for the Coding Documentation Project Report (A2)
The Coding Documentation Project Report (A2) should have the following sections:
1. Data preparation and reconstruction
2. Exploring and visualising data
3. Predictive analytics
4. Conclusions
Data preparation and reconstruction
In the Data preparation and reconstruction section you should include the R script and output used to read
in the dataset, selecting a subset and treating missing data.
Describe the logic of the R syntax/functions used, what options (arguments) have been set, and what types
of input data (R objects) are used.
Provide basic interpretations in-text for the outputs.
Exploring and visualising data
In the Exploring and Visualising Data section you should include the R script and output used to generate
basic graphs and univariate/bivariate displays, measures of central tendency for your chosen output variable
as well as basic statistics (counts, proportions) that describe the dataset (using the subset selected following
your Data Preparation and Reconstruction steps).