SIT731
SIT731
Trimester 1, 2025
CONTENTS
Welcome ...................................................................................................................................................................... 2
Who is the unit team? .................................................................................................................................................. 2
Unit chair details .......................................................................................................................................................... 2
Other members of the team and how to contact them .............................................................................................. 2
Administrative queries ................................................................................................................................................. 2
About this unit ............................................................................................................................................................. 2
Unit development in response to student feedback ................................................................................................... 2
Learning Outcomes ...................................................................................................................................................... 3
Assessing your achievement of the unit learning outcomes ....................................................................................... 3
Hurdle requirements .................................................................................................................................................... 3
Summative assessment task 1 ..................................................................................................................................... 4
Your learning experiences in this unit .......................................................................................................................... 6
Educator-facilitated (scheduled) learning activities - on-campus unit enrolment ....................................................... 6
Educator-facilitated (scheduled) learning activities - online unit enrolment .............................................................. 6
Typical study commitment ........................................................................................................................................... 6
Note ............................................................................................................................................................................. 6
Unit learning resources ................................................................................................................................................ 7
Essential learning resources ......................................................................................................................................... 7
Recommended learning resources .............................................................................................................................. 7
Where to access unit resources ................................................................................................................................... 7
Key dates for this study period .................................................................................................................................... 7
Unit weekly activities ................................................................................................................................................... 8
24 February 2025
Deakin University, Faculty of Science, Engineering and Built Environment
SIT731 Data Wrangling - Trimester 1, 2025
Welcome
Data Science (DS) and Artificial Intelligence (AI) are popular fields in making sense of data that have been collected in large
quantities from various sources. Performing accurate exploration and modelling using DS and AI heavily rely on appropriately
prepared data. Data wrangling is the process of preparing the raw data appropriately for modelling purposes. The aim of this
unit is to learn various data wrangling methodologies and programming techniques to perform them. This include
programming in Python for performing various data wrangling tasks, learning data extraction methods from different
sources, working with different types of data, storing and retrieving them, applying sampling techniques and inspecting
them, cleaning them by identifying outliers/anomalies, handling missing data, transforming, selecting and extracting
features, performing exploratory analysis, visualisation using various tools, summarising data appropriately, performing basic
statistical analysis and modelling using basic machine learning. Further, techniques for maintaining data privacy and
exercising ethics in data manipulation will be covered in this unit.
This Unit Guide provides you with the key information about this unit. Please read it carefully and refer to it frequently
throughout the study period. Your unit site also provides information about your rights and responsibilities. We will assume
you have read this before the unit commences, and we expect you to refer to it throughout the study period.
read all materials in preparation for your learning activities and follow up each with further study and research on the
topic
start your assessment tasks well ahead of the due date
read or listen to all feedback carefully and use it in your future work
attend and engage in all educator facilitated (scheduled) learning activities and other learning experiences as part of
the unit design
Unit chair: leads the teaching team and is responsible for overall delivery of this unit
Unit chair details
Details of other teaching staff will be provided on the unit site at the start of the trimester.
Administrative queries
For additional support information, please see the Rights and Responsibilities section under 'Content' in your unit site.
About this unit
Every trimester, we ask students to tell us, through eVALUate, what helped and hindered their learning in each Unit. You are
strongly encouraged to provide constructive feedback for this Unit when eVALUate opens (you will be emailed a link).
In the previous editions of the unit, students have told us that these aspects of the unit have helped them to achieve the
learning outcomes:
the availability of engaging, step-by-step tutorial videos and many exercises in Python that emphasised on developing
The following aspects of the unit have been introduced, enhanced or retained in response to feedback from students who
have undertaken this unit in previous trimesters:
the feedback for this unit has remained positive, with only minor suggestions for improvement, which were
implemented in the course of the unit update for this year.
If you have any concerns about the Unit during the trimester, please contact the unit teaching team - preferably early in the
trimester - so we can discuss your concerns, and make adjustments, if appropriate.
Learning Outcomes
Each Unit in your course is a building block towards Deakin's Graduate Learning Outcomes - not all Units develop and assess
every Graduate Learning Outcome (GLO).
ULO These are the Unit Learning Outcomes (ULOs) for this unit. At the completion Alignment to Deakin Graduate
of this unit, successful students can: Learning Outcomes (GLOs)
ULO1 Undertake data wrangling tasks by using appropriate programming and GLO1: Discipline-specific knowledge
scripting languages to extract, clean, consolidate, and store data of different and capabilities
data types from a range of data sources. GLO3: Digital literacy
ULO2 Research data discovery and extraction methods and tools and apply resulting GLO3: Digital literacy
learning to handle extracting data based on project needs. GLO5: Problem solving
ULO3 Design, implement, and explain the data model needed to achieve project GLO1: Discipline-specific knowledge
goals, related security considerations, and the processes that can be used to and capabilities
convert data from data sources to both technical and nontechnical audiences. GLO2: Communication
GLO5: Problem solving
ULO4 Use both statistical and machine learning techniques to perform exploratory GLO1: Discipline-specific knowledge
analysis on data extracted, and communicate results to technical and non- and capabilities
technical audiences. GLO2: Communication
GLO5: Problem solving
ULO5 Apply and reflect on techniques for maintaining data privacy and exercising GLO8: Global citizenship
ethics in data handling.
Hurdle requirements
To be eligible to obtain a pass in this unit, students must meet certain milestones as part of the portfolio.
Brief summary of the hurdle requirements Rationale
To be eligible to obtain a pass in this unit, students must meet The pass tasks in this unit provide students with the
certain milestones as part of the portfolio. opportunity to develop and demonstrate achievement of
the Unit Learning Outcomes at the minimum expected
standards.
1. Unit Tasks (Learning Portfolio) These tasks are included as hurdle requirements so that
Students are required to submit tasks, collaborate with their students are able to provide evidence of achievement of
tutor to resolve any issues identified, and discuss their these ULOs through their portfolio. The portfolio artefact
understanding of the associated concepts by each task's that they submit is used to measure their performance
indicated due date. against the minimum standards as well as their ability to
justify the outcomes that they have achieved through
• By the end of week 3, students are required to have self-assessment and reflection. The hurdle requirement
completed at least the Pass Tasks 1 and 2 also provides a mechanism for student-staff interaction to
• By the end of week 11, students are required to have check progress and address educational and motivational
submitted all pass tasks to demonstrate achievement of the issues before it is too late in the trimester.
unit learning outcomes.
Deakin has a universal assessment submission time of 8 pm AEDT/AEST. A late penalty will apply to assessments submitted
after 11.59 pm AEDT/AEST.
Summative assessment task 1
Brief description of assessment This is an individual task. Students will develop their portfolio weekly by submitting
task tasks and receiving feedback through the OnTrack system. At the end of the Trimester
these tasks, along with a reflective learning summary report, will form the submission
for grading. Grades awarded will take into consideration the quality of the work
submitted, depth of learning demonstrated, evidence of achievement of unit learning
outcomes, timeliness of submissions, and ability to receive and incorporate feedback.
Online independent and collaborative learning including 1 x 2 hour online seminar per week, weekly meetings.
Typical study commitment
Students will on average spend 150 hours over the teaching period undertaking the teaching, learning and assessment
activities for this unit.
This will include educator guided online learning activities within the unit site.
The learning and assessment activities for this unit will include watching weekly videos, attending and participating in group
activities (online or in class), completing the learning and assessment tasks, and associated reading and study time.
All unit learning materials are provided in the unit site and in OnTrack. This includes the following resources organised for 11
weeks of learning to enable you achieve the unit learning outcomes:
You are expected to complete the prescribed readings, watch the concept and demonstration videos in the unit site, and
complete unit tasks in OnTrack. As you complete the tasks, you will be able to collect evidence for justifying how you have
met the unit learning outcomes through your portfolio. The process of developing your portfolio is simple and easy, so keep
that in mind as you read the assessment instructions below.
Engage with the learning resources each week on the Unit site before attending seminar or weekly group activities online.
These resources will help you prepare for the group activities. Each week’s seminar, or online group activity, will help develop
your understanding of unit concepts and skills.
Unit tasks within the portfolio are designed to help scaffold your learning, as well as helping you demonstrate achievement
of unit learning outcomes. You will have opportunities to receive and incorporate feedback from your tutor, allowing you to
freely ask questions and clarify misunderstandings without the risk of losing marks.
Make sure that you take advantage of the formative feedback process associated with the learning portfolio and weekly
tasks. During this interaction you can ask questions and clarify doubts to make sure that you will be fully prepared for this
assessment.
Note
At Deakin, courses are delivered within a learning environment that provides all students with equitable and consistent
access to facilities, infrastructure, resources, and support to assist student progress and achievement of learning outcomes.
We have introduced new terms to reflect learning activities to enhance your learning experience, aligning with our innovative
DeakinDesign learning principles and practices. The new terms better reflect how teaching teams will guide you through
your learning journey and the types of learning experiences you will have.
‘Lectures’ are the activities where teaching staff engage you through presentations with student participation.
In ’seminars’, an educator will guide you in a smaller group of students through highly interactive discussions and activities.
Your units may also include ‘practical experiences’ such as ‘laboratory‘, ‘workshops‘, ‘clinical skills‘ and more. These hands-
on activities typically take place in specialised facilities with industry tools, equipment or technology to allow you to apply
your knowledge practically.
‘Assessments‘ or ‘team-based learning‘ indicate an in-class evaluation of your skills or knowledge. A ‘pre-assessment
practice‘ could be scheduled to prepare you for these assessments.
Take a look at the Learning activities webpage for a full list of the terminology changes and reasons they were changed.
Unit learning resources
Your unit learning resources can be accessed from your unit site.
There is no prescribed text. Unit materials are provided via the unit site. This includes unit topic readings and references to
further information.
The texts and reading list for SIT731 can be found via the University Library.
Note: Select the relevant trimester reading list. Please note that a future teaching period's reading list may not be available
until a month prior to the start of that teaching period so you may wish to use the relevant trimester's prior year reading list
as a guide only.
Essential learning resources
Essential learning resources will be provided via the unit site. Further, relevant technical documentation will be linked to in
the relevant modules on the unit site.
Recommended learning resources
The Deakin Software Library provides students with access to software that you may need or find useful for your study at
Deakin.
Students looking for additional assistance should consult the following textbooks. These books match some of the content
being covered in the unit:
Minimalist Data Wrangling with Python, Zenodo, Marek Gagolewski, 2022, Melbourne, URL:
https://datawranglingpy.gagolewski.com/ (open-access textbook)
Wes McKinney, Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, O’Reilly, 3rd Edn, 2022
(This book is available online through the Deakin library).
Textbooks can be sourced from various outlets including direct from the publisher, online bookshops, or retailers. Limited
copies of textbooks are also available on loan from the University Library.
Key dates for this study period