Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
COMP8410 Data Mining
Assignment 2
Maximum marks 100
Weight 25% of the total marks for the course
Length
Maximum of 10 pages excluding cover sheet, bibliography and
appendices.
Layout
A4 margin, at least 11 point type size, use of typeface, margins
and headings consistent with a professional style.
Submission deadline 9:00am, Monday, 9 May
Submission mode Electronic, via Wattle
Estimated time 15 hours
Penalty for lateness 100% after the deadline has passed
Questions to: Wattle Discussion Forum
This assignment specification may be updated to reflect clarifications and modifications after it is
first issued. It is strongly suggested that you start working on the assignment right away. You can
submit as many times as you like. Only the most recent submission at the due date will be assessed.
In this assignment, you are required to submit a single report in the form of a PDF file. You may also
attach supporting information (appendices) as one or more identified sections at the end of the
same PDF file. Appendices will not be marked but may be treated as supporting information to your
report. Please use a cover sheet at the front that identifies you as author of the work using your u-
number and name and identifies this as your submission for COMP8410 Assignment 2. The cover
sheet and appendices do not contribute to the page limit or word count.
You are expected to have both an introduction and a conclusion in your report.
No particular layout is specified, but you should follow use no smaller than 11 point typeface and
stay within the maximum specified page count. Page margins, heading sizes, paragraph breaks and
so forth are not specified but a professional style must be maintained. Text beyond the page limit
will be treated as non-existent.
This is a single-person assignment and should be completed on your own. Make certain you
carefully reference all the material that you use, although the nature of this assignment suggests few
references will be needed. It is unacceptable to cut and paste another author's work and pass it off
2
as your own. Anyone found doing this, from whatever source, will get a mark of zero for the
assignment and, in addition, CECS procedures for plagiarism will apply.
No particular referencing style is required. However, you are expected to reference conventionally,
conveniently, and consistently. References are not included in the page limit. Due to the context in
which this assignment is placed, you may refer to the course notes or course software where
appropriate (e.g. “For this experiment Rattle was used”), without formal reference to original
sources, unless you copy text which always requires a formal reference to the source.
An assessment rubric is provided. The rubric will be used to mark your assignment. You are advised
to use it to supplement your understanding of what is expected for the assignment and to direct
your effort towards the most rewarding parts of the work.
Your assignment submission will be treated confidentially. It will be available to ANU staff involved
in the course for the purposes of marking. It may be shared, de-identified, as an exemplar for other
students.
Task
You are to study the supplied data set and to apply data mining processes and techniques to
discover interesting things about the data. You are to write a short report that justifies and explains
your methods in detail, presents your results, and evaluates and interprets the results you find. In
the following, the task is described in terms of what your report should contain, not in terms of the
steps you should take to carry out the assignment. In your report, similarly, you should describe the
methods used in terms of the language of data mining, not in the terms of commands you typed or
buttons you selected.