#REQUEST.pageInfo.pagedescription#

Site Navigation

SOFT8032 - Programming for Data Analytics

banner1
Title:Programming for Data Analytics
Long Title:Programming for Data Analytics
Module Code:SOFT8032
 
Duration:1 Semester
Credits: 5
NFQ Level:Advanced
Field of Study: Computer Software
Valid From: Semester 1 - 2017/18 ( September 2017 )
Module Delivered in 4 programme(s)
Module Coordinator: Sean McSweeney
Module Author: Ted Scully
Module Description: Data analytics is a set of techniques and processes that can be used to provide identify and analyze patterns from data. In this module the learner will be provided with the skills to import and manipulate various forms of data and perform exploratory data analysis. The learner will also be equipped with the skills to effectively visualize different aspects of data and perform basic classification and clustering techniques.
Learning Outcomes
On successful completion of this module the learner will be able to:
LO1 Apply programming techniques to facilitate the importation, manipulation and cleaning of data.
LO2 Implement exploratory data analysis techniques and interpret results.
LO3 Choose and employ appropriate visualization techniques for depicting data.
LO4 Select and apply basic classification and clustering techniques to a range of datasets.
LO5 Evaluate the accuracy and interpret the results of classification algorithms.
Pre-requisite learning
Module Recommendations

This is prior learning (or a practical skill) that is strongly recommended before enrolment in this module. You may enrol in this module if you have not acquired the recommended learning but you will have considerable difficulty in passing (i.e. achieving the learning outcomes of) the module. While the prior learning is expressed as named MTU module(s) it also allows for learning (in another module or modules) which is equivalent to the learning specified in the named module(s).

Incompatible Modules
These are modules which have learning outcomes that are too similar to the learning outcomes of this module. You may not earn additional credit for the same learning and therefore you may not enrol in this module if you have successfully completed any modules in the incompatible list.
No incompatible modules listed
Co-requisite Modules
No Co-requisite modules listed
Requirements

This is prior learning (or a practical skill) that is mandatory before enrolment in this module is allowed. You may not enrol on this module if you have not acquired the learning specified in this section.

No requirements listed
 

Module Content & Assessment

Indicative Content
Introduction
Overview of the terminology and applications in the area of data science and analytics. Importance of data analytics in industry.
Data Extraction and Manipulation
Importing data from different sources in different formats. Applying data manipulation techniques such as reshaping, pivoting, array-based indexing, joining, cleaning and munging, grouping, aggregation.
Visualization and Exploratory Data Analysis
Overview of a range of visualization techniques such as histograms, scatter plots, heatmaps, clustered matrices, boxplots, regression plots. Critically evaluate and assess the applicability of different visualization techniques for a specific task. Obtaining basic summary statistics such as mean, median, standard deviation, variance and range. Using programming techniques to perform more advanced analysis such as multi-collinearity analysis.
Clustering Analysis
Introduction to the concept and motivation for clustering. Examine the operation of a standard clustering algorithm such as k-means. Case study illustrating the advantages and limitations of applying k-means to a dataset form a specific application domain.
Classification
Overview of the concept of classification and its role solving real-world problems. Introduction to a standard category of classification algorithm such as decision trees. Case study to illustrate the application of classification to a dataset. Overview of k-fold cross validation as an appropriate method of evaluation.
Assessment Breakdown%
Course Work100.00%
Course Work
Assessment Type Assessment Description Outcome addressed % of total Assessment Date
Open-book Examination Perform importation, cleaning and manipulation of a dataset and perform exploratory data analysis. 1,2 20.0 Week 6
Project Complete a comprehensive analysis of a real-world dataset and produce a report documenting findings and incorporating appropriate visualizations. 1,2,3 30.0 Week 8
Project Select and apply appropriate classification techniques to a dataset from a specific application domain. Findings should be documented and supported with appropriate visualisations. 3,4,5 50.0 Sem End
No End of Module Formal Examination
Reassessment Requirement
Coursework Only
This module is reassessed solely on the basis of re-submitted coursework. There is no repeat written examination.

The institute reserves the right to alter the nature and timings of assessment

 

Module Workload

Workload: Full Time
Workload Type Workload Description Hours Frequency Average Weekly Learner Workload
Lecture Lecture delivering theory underpinning learning outcomes. 2.0 Every Week 2.00
Lab Practical computer-based lab supporting learning outcomes. 2.0 Every Week 2.00
Independent Learning Independent Student Learning. 3.0 Every Week 3.00
Total Hours 7.00
Total Weekly Learner Workload 7.00
Total Weekly Contact Hours 4.00
Workload: Part Time
Workload Type Workload Description Hours Frequency Average Weekly Learner Workload
Lab Practical computer-based lab supporting learning outcomes. 2.0 Every Week 2.00
Lecture Lecture delivering theory underpinning learning outcomes. 2.0 Every Week 2.00
Independent Learning Independent Student Learning. 3.0 Every Week 3.00
Total Hours 7.00
Total Weekly Learner Workload 7.00
Total Weekly Contact Hours 4.00
 

Module Resources

Recommended Book Resources
  • Wes McKinney 2012, Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 1st Ed., O'Reilly Media [ISBN: 9781449319793]
  • John V. Guttag 2016, Introduction to Computation and Programming Using Python: With Application to Understanding Data, 2nd Ed., MIT Press [ISBN: 9780262529624]
  • Joel Grus 2015, Data Science from Scratch: First Principles with Python, 1st Ed., O'Reilly Media [ISBN: 9781491901427]
Supplementary Book Resources
  • Clinton W. Brownley 2015, Foundations for Analytics with Python, 1st Ed., O'Reilly Media [ISBN: 9781491922538]
  • Fabio Nelli 2015, Python Data Analytics: Data Analysis and Science using pandas, matplotlib and the Python Programming Language, 1st Ed., Apress [ISBN: 9781484209592]
This module does not have any article/paper resources
Other Resources
 

Module Delivered in

Programme Code Programme Semester Delivery
CR_ESMPR_8 Bachelor of Engineering (Honours) in Smart Product Engineering 8 Group Elective 2
CR_KSDEV_8 Bachelor of Science (Honours) in Software Development 5 Mandatory
CR_KDNET_8 Bachelor of Science (Honours) in Computer Systems 5 Elective
CR_KCOMP_7 Bachelor of Science in Software Development 5 Mandatory

Cork Institute of Technology
Rossa Avenue, Bishopstown, Cork

Tel: 021-4326100     Fax: 021-4545343
Email: help@cit.edu.ie