Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
COMP8410 Data Mining
Assignment 2 Maximum marks 100 Weight 25% of the total marks for the course Length Maximum of 10 pages excluding cover sheet, bibliography and appendices. Layout A4 margin, at least 11 point type size, use of typeface, margins and headings consistent with a professional style. Submission deadline 9:00am, Monday, 9 May Submission mode Electronic, via Wattle Estimated time 15 hours Penalty for lateness 100% after the deadline has passed Questions to: Wattle Discussion Forum
This assignment specification may be updated to reflect clarifications and modifications after it is first issued. It is strongly suggested that you start working on the assignment right away. You can submit as many times as you like. Only the most recent submission at the due date will be assessed. In this assignment, you are required to submit a single report in the form of a PDF file. You may also attach supporting information (appendices) as one or more identified sections at the end of the same PDF file. Appendices will not be marked but may be treated as supporting information to your report. Please use a cover sheet at the front that identifies you as author of the work using your u- number and name and identifies this as your submission for COMP8410 Assignment 2. The cover sheet and appendices do not contribute to the page limit or word count. You are expected to have both an introduction and a conclusion in your report. No particular layout is specified, but you should follow use no smaller than 11 point typeface and stay within the maximum specified page count. Page margins, heading sizes, paragraph breaks and so forth are not specified but a professional style must be maintained. Text beyond the page limit will be treated as non-existent. This is a single-person assignment and should be completed on your own. Make certain you carefully reference all the material that you use, although the nature of this assignment suggests few references will be needed. It is unacceptable to cut and paste another author's work and pass it off 2
as your own. Anyone found doing this, from whatever source, will get a mark of zero for the assignment and, in addition, CECS procedures for plagiarism will apply. No particular referencing style is required. However, you are expected to reference conventionally, conveniently, and consistently. References are not included in the page limit. Due to the context in which this assignment is placed, you may refer to the course notes or course software where appropriate (e.g. “For this experiment Rattle was used”), without formal reference to original sources, unless you copy text which always requires a formal reference to the source. An assessment rubric is provided. The rubric will be used to mark your assignment. You are advised to use it to supplement your understanding of what is expected for the assignment and to direct your effort towards the most rewarding parts of the work. Your assignment submission will be treated confidentially. It will be available to ANU staff involved in the course for the purposes of marking. It may be shared, de-identified, as an exemplar for other students.
Task You are to study the supplied data set and to apply data mining processes and techniques to discover interesting things about the data. You are to write a short report that justifies and explains your methods in detail, presents your results, and evaluates and interprets the results you find. In the following, the task is described in terms of what your report should contain, not in terms of the steps you should take to carry out the assignment. In your report, similarly, you should describe the methods used in terms of the language of data mining, not in the terms of commands you typed or buttons you selected.