Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
Subject description
Statistical thinking is the foundational mindset in data science, emphasizing the use of statistical principles and methods to understand, analyze, and derive meaningful insights from data. It serves as the core of data science. This subject equips students with essential skills and concepts for applying statistical thinking in the context of applied data science. Initially, students are introduced to fundamental statistical principles, developing a simultaneous understanding of modern methods for statistical inference, and gaining valuable hands-on experience with real-world data. Subsequently, they delve into a range of statistical models and estimation techniques, applying their acquired knowledge to engage in a complete data science research cycle. Collaborating in teams, students learn how to formulate research inquiries, employ formal statistics and real-world datasets to address them, and effectively communicate their findings through both oral presentations and written reports.
The progression of this subject starts with more teaching-intensive methods such as workshops and lectures to give students the technical and conceptual know-how to work as practicing data scientists. As the subject progresses, students increasingly move towards an individually driven learning mode, allowing both teams and individuals the flexibility to enhance their statistical thinking and skills.
Upon completion of the subject, students possess a robust foundation in technical, conceptual, and practical aspects, empowering them to continue their development as Data Scientists.
Subject learning objectives (SLOs)
Upon successful completion of this subject students should be able to:
1. Manage the complexity of real data science projects and their inevitable compromises
2. Formulate authentic data science questions precise enough to be answered by valid statistical techniques
3. Justify the use of different statistical concepts and tools to audiences from a wide range of backgrounds
4. Find, clean, and merge datasets from a range of sources to answer real world data science problems
5. Apply statistical methods that are appropriate to a dataset and stakeholder requirements
6. Interpret the results of a statistical analysis correctly, visualizing and reporting upon them in ways that create value for, and are sensitive to the needs of, a wide range of stakeholders
7. Collaborate with and contribute to the professional community of data scientists, both local and global
Course intended learning outcomes (CILOs)
This subject also contributes specifically to the development of the following course outcomes:
Exploring and testing models and describing behaviours of complex systems
Explore and test models and generalisations for describing the behaviour of sociotechnical systems and selecting data sources, taking into account the needs and values of different contexts and stakeholders (1.2)
Making the invisible visible
Use transdisciplinary approaches to seeing and doing to uncover underrepresented, or misrepresented, elements of a system (1.4)
● Exploring, interpreting and visualising data
Explore, analyse, manipulate, interpret and visualise data using data science techniques, software and technologies to make sense of data rich environments (2.2)
. Designing and managing data investigations
Apply and assess data science concepts, theories, practices and tools for designing and managing data discovery investigations in professional environments that draw upon diverse data sources, including efforts to shed light on underrepresented components (2.4)