Statistical Modelling for Business
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
QBUS2810
Statistical Modelling for Business
Group Assignment
This group assignment will contribute 20% towards your final result in the
unit. The deadline is 11:59pm Friday 4th November, 2022. Submission is
via Canvas.
This assignment must be completed in your Canvas group. It is entirely
students responsibility to form and/or join a group in the People section of
the 2810 Canvas site. Groups consist of precisely 3 students only.
Maximum Length: There is no maximum page length for this assignment. If you
have something interesting and worthwhile to include, then please do so without worry-
ing about a page limit. However, irrelevant or overly long-winded material may reduce
your overall mark (as well as the marker’s enjoyment of life). As a guideline, in pre-
vious runs of this class the typical report had between 20-25 pages, excluding Python
code.
Notes on Marking:
• The assignment will initially be marked out of 64.
• Up to an additional five (5) marks will be awarded based on the overall pre-
sentation quality of your report. Thus, you will receive a total mark for this
assignment out of 69. You will lose some of these 5 presentation marks for poor,
inefficient, unclear and/or unprofessional presentation. You will be rewarded for
professional, efficient and clear presentation methods. I expect your final report
to be done in a professional editing package and to be submitted in pdf only.
Html files of jupyter notebooks are not suitable.
• You must use Python for this assignment. You are being assessed on how
well you can use Python to complete the assignment tasks. NB: You can use
Excel for simple data manipulations and clean-up; but Python is better at these
2tasks too! All plots and statistical output in the assignment must have been
produced in Python, though you can of course make nicer tables in a text editor
to include in your assignment. Please include an appendix in your assignment
that contains the Python code your group used to produce ALL outputs in your
assignment. A heavy penalty will apply if the Python code is not supplied (or
the code supplied does not run or work when the marker tries to run it).
Key requirements:
Pre-analysis instructions for data:
Please include the python code from the Jupyter notebook file “grp assnt gendata.ipynb”
in your Jupyter notebook file to input and clean the data. Collect the student ID num-
bers for the members of your group and then add these numbers together. Input the
result into the python code where instructed. Run the subsequent code to generate
two datasets: “train” and “test”. Most analysis you do will only use the “train” data
set. Any forecasting your group does will only use the “test” dataset. The purpose
of these commands is to ensure that each group receives different randomly selected
datasets for “train”ing and “test”ing purposes. Two other python codes are included
in case you need it: forward selection.py and backword selection.py
Business problem: