Students

STAT302 – Graphics, Multivariate Methods and Data Mining

2016 – S2 Day

General Information

Download as PDF
Unit convenor and teaching staff Unit convenor and teaching staff
A/Prof Ayse Aysin Bilgin
Contact via E-mail
AHH Level 2, Room 2.367
Mon 4-5pm & Thuesday 11am -12pm
Tutor
Brett Baczelis
Contact via e-mail
Credit points Credit points
3
Prerequisites Prerequisites
6cp at 200 level including (STAT270(P) or STAT271(P) or BIOL235(P) or PSY222(P) or PSY248(P))
Corequisites Corequisites
Co-badged status Co-badged status
Unit description Unit description
This unit introduces statistical tools for multivariate data analysis such as statistical graphics, discriminant analysis, principal component analysis, cluster analysis and an introduction to data mining, especially classification. Statistical packages are used extensively to illustrate the concepts in lectures and tutorials. Students are given opportunities to share their learning with their peers in tutorials, by presenting to their peers such as solutions to a problem posed in an earlier class, or summary of a specific weeks’ learning in one or two slides, similar to 3 minute thesis presentations.

Important Academic Dates

Information about important academic dates including deadlines for withdrawing from units are available at https://www.mq.edu.au/study/calendar-of-dates

Learning Outcomes

On successful completion of this unit, you will be able to:

  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Choose the appropriate statistical analysis, for a given data set, from a wide range of methods based on multivariate methods and data mining;
  • Choose appropriate graphical techniques for displaying data;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

General Assessment Information

Assessment submissions All assessments should be submitted electronically on iLearn (Turnit-in), by the given due date and time. For hand written solutions, students need to scan their work and include the hand written part within the document (such as word or PDFs) submitted within ilearn.

Extensions and penalties No extensions will be granted, except for cases where a student has a serious and unavoidable disruption to studies. In this case, an application for disruption of studies is required from the student on the ask.mq.edu.au system and needs to be approved by the Lecturer in charge. Late submissions without an approved application will not be marked and students will be given zero marks for the assessment task hat was late.

Assessment Tasks

Name Weighting Due
Participation 5% Weekly
Presentation 5% Weekly
Assignments (x2) 30% Week 6 & Week 11
Final Examination 60% Exam Timetable

Participation

Due: Weekly
Weighting: 5%

*Weeks 2 – 12 inclusive. 

Every week lecture and tutorial participation will be monitored and most weeks there will be set homework to submit to the lecturer at the start of the following lecture.


On successful completion you will be able to:
  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Presentation

Due: Weekly
Weighting: 5%

Each student will be given an opportunity to present to class either in the Lecture or in the Tutorial. The presentations will start on Week 2 until each student presents at least once.

Each week there will be three student presentations. Students are encouraged to volunteer for these presentations so that a timetable could be created, in the case of no one volunteering a random selection will be used. The three presentations will in the form of no more than two power point slides. The specifics of the presentations are:

1. Summary of the previous week's lecture (at the beginning of the lecture)

2. The most important aspects of the previous week's lecture and/or tutorial (at the beginning of the tutorial)

3. The most important aspects of the current week's lecture and/or tutorial (at the end of the tutorial)


On successful completion you will be able to:
  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assignments (x2)

Due: Week 6 & Week 11
Weighting: 30%

There will be two individual assignments due in weeks 6 and 11. The assignment questions will be made available through iLearn.

There is no “group work” assessment in this unit, however students are encouraged to work together and help each other to learn.


On successful completion you will be able to:
  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Choose the appropriate statistical analysis, for a given data set, from a wide range of methods based on multivariate methods and data mining;
  • Choose appropriate graphical techniques for displaying data;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Final Examination

Due: Exam Timetable
Weighting: 60%

The examination will examine any material covered throughout the course. Students may bring one A4 sized sheet of hand written notes, formulae, etc., which may be written on both sides and is easily readable. This summary must be submitted with your exam paper and is marked. No other materials such as lecture notes and textbooks are permitted. 

Calculators will be needed but must not be of the text/programmable type.


On successful completion you will be able to:
  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Choose the appropriate statistical analysis, for a given data set, from a wide range of methods based on multivariate methods and data mining;
  • Choose appropriate graphical techniques for displaying data;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Delivery and Resources

Lectures begin in Week 1. 

Tutorials begin in Week 2.

The timetable for classes can be found on the University web site at: http://www.timetables.mq.edu.au/

Recommended Texts and/or Materials 

There are no required texts for this unit, but the following list provides useful references, which are available in the Library.

Weeks 1-6 Material

Chambers J M et al (1983) Graphical Methods for Data Analysis. (QA276.3 .G73/1983)

Cleveland W S (1994) Elements of Graphing Data. (QA90 .C54/1994)

Cleveland W S & McGill M E (1988) Dynamic Graphics for Statistics. (QA276.3.D96/1988)

Du Toit S H C et al (1986) Graphical Exploratory Data Analysis. (QA276.3 .D778/1986)

Ehrenberg A S C (1982)Primer in Data Reduction. (QA276.12 .E37/1982)

Tufte E R (2001) The Visual Display of Quantitative Information. (QA276.3 T8 2001)

Weeks 7-13 Material

Everitt B S et al (2001) Applied multivariate data analysis. (QA278 .E914/2001)

Johnson, D.E (1998) Applied Multivariate Methods for Data Analysts. (QA278.J615/1998)

Johnson, R.A. & Wichern, D.W. (2002) Applied Multivariate Statistical Analysis.(QA278 .J63/2002)

Manly, B F J (2004) Multivariate Statistical Methods - A Primer. (QA278 .M35 2004)

iLearn

There is an iLearn site for this subject that will contain all the required course materials and allows communication between students. You can access the unit iLearn site from the address http://learn.mq.edu.au using your Student ID number and myMQ Portal password. You can only access the material if you are enrolled in the unit. If you have any problems accessing this website, go to the Online Teaching Facility support web page at http://online.mq.edu.au/docs/tecinf.html

The lecturer will make announcements via the iLearn. You should regularly log in and read the posts at least twice a week.

The Discussion Board on iLearn can be used to communicate with other students. 

Software

The main software packages that will be used are IBM SPSS Analytics, IBM SPSS Modeler, R (it is freely available from http://cran.r-project.org/), Excel & Microsoft Word. We might also use Mondrian (open source, available from http://rosuda.org/mondrian/Mondrian.html).

Unit Schedule

Week

Topic

Introduction & presenting data numerically

Good and bad graphical displays

Choosing different graphic displays

Displaying multivariate data

Similarities and distances

Hierarchical cluster analysis

7

K-means clustering

 

Mid-semester break - two weeks

Public Holiday

Eigenvalues and Eigenvectors

Principal component analysis

10 

Discriminant analysis

11

Multiple discriminant analysis

12 

Classification and regression trees

13 

Review

The order of the lectures might change.

Policies and Procedures

Macquarie University policies and procedures are accessible from Policy Central. Students should be aware of the following policies in particular with regard to Learning and Teaching:

Academic Honesty Policy http://mq.edu.au/policy/docs/academic_honesty/policy.html

New Assessment Policy in effect from Session 2 2016 http://mq.edu.au/policy/docs/assessment/policy_2016.html. For more information visit http://students.mq.edu.au/events/2016/07/19/new_assessment_policy_in_place_from_session_2/

Assessment Policy prior to Session 2 2016 http://mq.edu.au/policy/docs/assessment/policy.html

Grading Policy prior to Session 2 2016 http://mq.edu.au/policy/docs/grading/policy.html

Grade Appeal Policy http://mq.edu.au/policy/docs/gradeappeal/policy.html

Complaint Management Procedure for Students and Members of the Public http://www.mq.edu.au/policy/docs/complaint_management/procedure.html​

Disruption to Studies Policy http://www.mq.edu.au/policy/docs/disruption_studies/policy.html The Disruption to Studies Policy is effective from March 3 2014 and replaces the Special Consideration Policy.

In addition, a number of other policies can be found in the Learning and Teaching Category of Policy Central.

Student Code of Conduct

Macquarie University students have a responsibility to be familiar with the Student Code of Conduct: https://students.mq.edu.au/support/student_conduct/

Results

Results shown in iLearn, or released directly by your Unit Convenor, are not confirmed as they are subject to final approval by the University. Once approved, final results will be sent to your student email address and will be made available in eStudent. For more information visit ask.mq.edu.au.

Student Support

Macquarie University provides a range of support services for students. For details, visit http://students.mq.edu.au/support/

Learning Skills

Learning Skills (mq.edu.au/learningskills) provides academic writing resources and study strategies to improve your marks and take control of your study.

Student Services and Support

Students with a disability are encouraged to contact the Disability Service who can provide appropriate help with any issues that arise during their studies.

Student Enquiries

For all student enquiries, visit Student Connect at ask.mq.edu.au

IT Help

For help with University computer systems and technology, visit http://www.mq.edu.au/about_us/offices_and_units/information_technology/help/

When using the University's IT, you must adhere to the Acceptable Use of IT Resources Policy. The policy applies to all who connect to the MQ network including students.

Graduate Capabilities

Creative and Innovative

Our graduates will also be capable of creative thinking and of creating knowledge. They will be imaginative and open to experience and capable of innovation at work and in the community. We want them to be engaged in applying their critical, creative thinking.

This graduate capability is supported by:

Learning outcomes

  • Choose appropriate graphical techniques for displaying data;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assessment tasks

  • Participation
  • Presentation
  • Assignments (x2)

Capable of Professional and Personal Judgement and Initiative

We want our graduates to have emotional intelligence and sound interpersonal skills and to demonstrate discernment and common sense in their professional and personal judgement. They will exercise initiative as needed. They will be capable of risk assessment, and be able to handle ambiguity and complexity, enabling them to be adaptable in diverse and changing environments.

This graduate capability is supported by:

Learning outcomes

  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Choose the appropriate statistical analysis, for a given data set, from a wide range of methods based on multivariate methods and data mining;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assessment tasks

  • Participation
  • Presentation
  • Assignments (x2)
  • Final Examination

Discipline Specific Knowledge and Skills

Our graduates will take with them the intellectual development, depth and breadth of knowledge, scholarly understanding, and specific subject content in their chosen fields to make them competent and confident in their subject or profession. They will be able to demonstrate, where relevant, professional technical competence and meet professional standards. They will be able to articulate the structure of knowledge of their discipline, be able to adapt discipline-specific knowledge to novel situations, and be able to contribute from their discipline to inter-disciplinary solutions to problems.

This graduate capability is supported by:

Learning outcomes

  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Choose the appropriate statistical analysis, for a given data set, from a wide range of methods based on multivariate methods and data mining;
  • Choose appropriate graphical techniques for displaying data;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assessment tasks

  • Participation
  • Presentation
  • Assignments (x2)
  • Final Examination

Critical, Analytical and Integrative Thinking

We want our graduates to be capable of reasoning, questioning and analysing, and to integrate and synthesise learning and knowledge from a range of sources and environments; to be able to critique constraints, assumptions and limitations; to be able to think independently and systemically in relation to scholarly activity, in the workplace, and in the world. We want them to have a level of scientific and information technology literacy.

This graduate capability is supported by:

Learning outcomes

  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Choose the appropriate statistical analysis, for a given data set, from a wide range of methods based on multivariate methods and data mining;
  • Choose appropriate graphical techniques for displaying data;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assessment tasks

  • Participation
  • Presentation
  • Assignments (x2)
  • Final Examination

Problem Solving and Research Capability

Our graduates should be capable of researching; of analysing, and interpreting and assessing data and information in various forms; of drawing connections across fields of knowledge; and they should be able to relate their knowledge to complex situations at work or in the world, in order to diagnose and solve problems. We want them to have the confidence to take the initiative in doing so, within an awareness of their own limitations.

This graduate capability is supported by:

Learning outcomes

  • Choose the appropriate statistical analysis, for a given data set, from a wide range of methods based on multivariate methods and data mining;
  • Choose appropriate graphical techniques for displaying data;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assessment tasks

  • Participation
  • Presentation
  • Assignments (x2)
  • Final Examination

Effective Communication

We want to develop in our students the ability to communicate and convey their views in forms effective with different audiences. We want our graduates to take with them the capability to read, listen, question, gather and evaluate information resources in a variety of formats, assess, write clearly, speak effectively, and to use visual communication and communication technologies as appropriate.

This graduate capability is supported by:

Learning outcomes

  • Choose appropriate graphical techniques for displaying data;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assessment tasks

  • Participation
  • Presentation
  • Assignments (x2)
  • Final Examination

Engaged and Ethical Local and Global citizens

As local citizens our graduates will be aware of indigenous perspectives and of the nation's historical context. They will be engaged with the challenges of contemporary society and with knowledge and ideas. We want our graduates to have respect for diversity, to be open-minded, sensitive to others and inclusive, and to be open to other cultures and perspectives: they should have a level of cultural literacy. Our graduates should be aware of disadvantage and social justice, and be willing to participate to help create a wiser and better society.

This graduate capability is supported by:

Learning outcomes

  • Choose the appropriate statistical analysis, for a given data set, from a wide range of methods based on multivariate methods and data mining;
  • Use a statistical computer package to carry out chosen analyses and interpret the results with understanding; present the results of analyses in a form which is suitable for publication;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assessment tasks

  • Participation
  • Presentation
  • Assignments (x2)

Socially and Environmentally Active and Responsible

We want our graduates to be aware of and have respect for self and others; to be able to work with others as a leader and a team player; to have a sense of connectedness with others and country; and to have a sense of mutual obligation. Our graduates should be informed and active participants in moving society towards sustainability.

This graduate capability is supported by:

Learning outcomes

  • Understand the principles underlying graphics, multivariate methods and data mining;
  • Apply statistical techniques to problems arising from diverse fields of research.

Assessment tasks

  • Participation
  • Assignments (x2)
  • Final Examination

Changes from Previous Offering

The due date of the first assignment in 2015 was in Week 5, it is going to be in Week 6 this year.

Teaching and Learning Strategy

 

All unit related queries should be directed to the unit convenor A/Prof Ayse Bilgin using the Macquarie University e-mail system.

Lectures begin in Week 1.

Tutorials (1 x 2 hour tutorial) will start in the second week. In weeks 2 to 13 you will be required to submit homework and this work will also count towards your assessment.   

The timetable for classes can be found on the University web site at https://timetables.mq.edu.au/

Students are expected to:

  • attend all the lectures (beginning in Week 1) and tutorials (beginning in Week 2);
  • collect their marked assessments and have a discussion with their peer and/or lecturer to improve their learning based on the feedback provided on the assessment.