Title: | Programming for Data Analytics |
Long Title: | Programming for Data Analytics |
Field of Study: |
Computer Software
|
Valid From: |
Semester 1 - 2017/18 ( September 2017 ) |
Module Coordinator: |
Sean McSweeney |
Module Author: |
Ted Scully |
Module Description: |
Data analytics is a set of techniques and processes that can be used to provide identify and analyze patterns from data. In this module the learner will be provided with the skills to import and manipulate various forms of data and perform exploratory data analysis. The learner will also be equipped with the skills to effectively visualize different aspects of data and perform basic classification and clustering techniques. |
Learning Outcomes |
On successful completion of this module the learner will be able to: |
LO1 |
Apply programming techniques to facilitate the importation, manipulation and cleaning of data. |
LO2 |
Implement exploratory data analysis techniques and interpret results. |
LO3 |
Choose and employ appropriate visualization techniques for depicting data. |
LO4 |
Select and apply basic classification and clustering techniques to a range of datasets. |
LO5 |
Evaluate the accuracy and interpret the results of classification algorithms. |
Pre-requisite learning |
Module Recommendations
This is prior learning (or a practical skill) that is strongly recommended before enrolment in this module. You may enrol in this module if you have not acquired the recommended learning but you will have considerable difficulty in passing (i.e. achieving the learning outcomes of) the module. While the prior learning is expressed as named MTU module(s) it also allows for learning (in another module or modules) which is equivalent to the learning specified in the named module(s).
|
|
Incompatible Modules
These are modules which have learning outcomes that are too similar to the learning outcomes of this module. You may not earn additional credit for the same learning and therefore you may not enrol in this module if you have successfully completed any modules in the incompatible list. |
No incompatible modules listed |
Co-requisite Modules
|
No Co-requisite modules listed |
Requirements
This is prior learning (or a practical skill) that is mandatory before enrolment in this module is allowed. You may not enrol on this module if you have not acquired the learning specified in this section.
|
No requirements listed |
Module Content & Assessment
Indicative Content |
Introduction
Overview of the terminology and applications in the area of data science and analytics. Importance of data analytics in industry.
|
Data Extraction and Manipulation
Importing data from different sources in different formats. Applying data manipulation techniques such as reshaping, pivoting, array-based indexing, joining, cleaning and munging, grouping, aggregation.
|
Visualization and Exploratory Data Analysis
Overview of a range of visualization techniques such as histograms, scatter plots, heatmaps, clustered matrices, boxplots, regression plots. Critically evaluate and assess the applicability of different visualization techniques for a specific task. Obtaining basic summary statistics such as mean, median, standard deviation, variance and range. Using programming techniques to perform more advanced analysis such as multi-collinearity analysis.
|
Clustering Analysis
Introduction to the concept and motivation for clustering. Examine the operation of a standard clustering algorithm such as k-means. Case study illustrating the advantages and limitations of applying k-means to a dataset form a specific application domain.
|
Classification
Overview of the concept of classification and its role solving real-world problems. Introduction to a standard category of classification algorithm such as decision trees. Case study to illustrate the application of classification to a dataset. Overview of k-fold cross validation as an appropriate method of evaluation.
|
Assessment Breakdown | % |
Course Work | 100.00% |
Course Work |
Assessment Type |
Assessment Description |
Outcome addressed |
% of total |
Assessment Date |
Open-book Examination |
Perform importation, cleaning and manipulation of a dataset and perform exploratory data analysis. |
1,2 |
20.0 |
Week 6 |
Project |
Complete a comprehensive analysis of a real-world dataset and produce a report documenting findings and incorporating appropriate visualizations. |
1,2,3 |
30.0 |
Week 8 |
Project |
Select and apply appropriate classification techniques to a dataset from a specific application domain. Findings should be documented and supported with appropriate visualisations. |
3,4,5 |
50.0 |
Sem End |
No End of Module Formal Examination |
Reassessment Requirement |
Coursework Only
This module is reassessed solely on the basis of re-submitted coursework. There is no repeat written examination.
|
The institute reserves the right to alter the nature and timings of assessment
Module Workload
Workload: Full Time |
Workload Type |
Workload Description |
Hours |
Frequency |
Average Weekly Learner Workload |
Lecture |
Lecture delivering theory underpinning learning outcomes. |
2.0 |
Every Week |
2.00 |
Lab |
Practical computer-based lab supporting learning outcomes. |
2.0 |
Every Week |
2.00 |
Independent Learning |
Independent Student Learning. |
3.0 |
Every Week |
3.00 |
Total Hours |
7.00 |
Total Weekly Learner Workload |
7.00 |
Total Weekly Contact Hours |
4.00 |
Workload: Part Time |
Workload Type |
Workload Description |
Hours |
Frequency |
Average Weekly Learner Workload |
Lab |
Practical computer-based lab supporting learning outcomes. |
2.0 |
Every Week |
2.00 |
Lecture |
Lecture delivering theory underpinning learning outcomes. |
2.0 |
Every Week |
2.00 |
Independent Learning |
Independent Student Learning. |
3.0 |
Every Week |
3.00 |
Total Hours |
7.00 |
Total Weekly Learner Workload |
7.00 |
Total Weekly Contact Hours |
4.00 |
Module Resources
Recommended Book Resources |
---|
- Wes McKinney 2012, Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 1st Ed., O'Reilly Media [ISBN: 9781449319793]
- John V. Guttag 2016, Introduction to Computation and Programming Using Python: With Application to Understanding Data, 2nd Ed., MIT Press [ISBN: 9780262529624]
- Joel Grus 2015, Data Science from Scratch: First Principles with Python, 1st Ed., O'Reilly Media [ISBN: 9781491901427]
| Supplementary Book Resources |
---|
- Clinton W. Brownley 2015, Foundations for Analytics with Python, 1st Ed., O'Reilly Media [ISBN: 9781491922538]
- Fabio Nelli 2015, Python Data Analytics: Data Analysis and Science using pandas, matplotlib and the Python Programming Language, 1st Ed., Apress [ISBN: 9781484209592]
| This module does not have any article/paper resources |
---|
Other Resources |
---|
- Website: Python Documentation
- Website: Python Data Analysis Library
- Website: Numerical Python Library
- Website: Python Interactive Computing
- Website: Seaborn Visualization
- Website: Matplotlib Visualization
|
Module Delivered in
|