Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
Python coursework specification
Data Analytics ECS648U/ ECS784U/ ECS784P
1. Important Dates
General information:
i. When submitting coursework online you receive an automated e-mail as proof of
submission. Turnitin receipt does not constitute proof of submission. Some students
will sometimes upload their coursework and not hit the submit button. Make sure you
fully complete the submission process.
ii. A penalty will be applied automatically by the system for late submissions.
a. Your lecturer cannot remove the penalty!
b. Penalties can only be challenged via submission of an Extenuating
Circumstances (EC) form which can be found on your Student Support page.
All the information you need to know is on that page; including how to submit
an EC claim along with the deadline dates and full guidelines.
c. If you submit an EC form, your case will be reviewed by a panel and the panel
will make a decision on the penalty and inform the Module Organiser.
iii. If you miss both the submission deadline and the late submission deadline, you will
automatically receive a score of 0. Extensions can only be granted through approval
of an EC claim.
iv. Submissions via e-mail are not accepted.
v. It is recommended by the School that we set the deadline at 10:00 AM. Do not wait
until the very last moment to submit the coursework.
vi. Your submission should be a single PDF file.
vii. For more details on submission regulations, please refer to your relevant handbook.
2
2. Coursework overview
The coursework is based on the Python lectures and labs and can be completed individually or
in a group of up to three students. You are free to choose or collate your own dataset and apply
two data analytic techniques to a real-world problem of your choice. In brief, you will:
i. Decide whether to form a group or not,
ii. Agree on the application area,
iii. Investigate the are and prepare the Introduction along with a short literature review,
iv. Collect data,
v. Clean and pre-process the data (if necessary),
vi. Apply two data analytic methods to the data,
vii. Present and discuss results,
viii. Draw conclusions,
ix. Finalise the report covering all of the above (see Section 4 for marking criteria).
You should address a data-related problem in your professional field or a field you are
interested in. If you are motivated in the subject matter the project will be more fun for you and
you will likely produce a better report. The same applies to the data analytic methods; i.e., you
are free to apply and test the methods of your choice from those covered in the Python labs or
in the Python lectures.
Please note:
i. This module is available to students with and without computer science background.
The labs provide step-by-step tutorials on machine learning with Python. Anyone
should be able to follow these tutorials to analyse some data irrespective of previous
academic background.
ii. Projects can be done individually or in groups of up to three people. If you form a
group, it is always a good idea include people who have different skills so they can
be assigned to different parts of the project (e.g., a coder, an analyst, a person to do
the literature review, etc).
a. Some students will not be able to join a group due to work commitments or
other reasons. This is not a problem since the coursework can be completed
individually.
b. Please do not send an e-mail asking if it is acceptable to form a group of more
than three people; it is not acceptable.
3
3. Deliverables
The coursework deliverable takes the form of a mini conference paper. The report shall have a
maximum length of 10 pages excluding References and Appendices. Font size should not be
lower than 11 and Page margins should not be lower than 2 (these restrictions do not apply to
the References and Appendices).
Reports should be written with a technical audience in mind. It should be concise and clear,
adopting the same style you would use in writing a scientific report. Some of the components
your report may include:
i. Problem statement and hypothesis.
ii. Description of your dataset and how it was obtained.
iii. Description of any data pre-processing steps you took.
iv. What you have learned from exploring the data, including visualisations.
v. How you chose which features to use in your analysis.
vi. Details of your modelling process, including how you selected your data analytic
methods as well as the model through validation.
vii. Your challenges and successes.
viii. Key findings.
ix. Possible extensions or business applications of your project.