Unit convenor and teaching staff |
Unit convenor and teaching staff
Convenor, lecturer
Diego Molla-Aliod
Contact via diego.molla-aliod@mq.edu.au
E6A331
See: http://web.science.mq.edu.au/~diego/
Lecturer
Amin Beheshti
Contact via amin.beheshti@mq.edu.au
E6A340
Thursday 9-10am
|
---|---|
Credit points |
Credit points
3
|
Prerequisites |
Prerequisites
39cp at 100 level or above including COMP257
|
Corequisites |
Corequisites
ISYS358
|
Co-badged status |
Co-badged status
|
Unit description |
Unit description
Even simple tasks like counting elements can seem impossible when the amount of data to process is huge. This unit explores some of the key
aspects related to processing and mining information from large volumes of data. We present technology commonly used in industry such as map-reduce, and show how a range of data processing methods can be realised using map-reduce. Especial emphasis will be placed in the adaptation of data mining techniques for large volumes of data and for data streaming.
|
Information about important academic dates including deadlines for withdrawing from units are available at https://www.mq.edu.au/study/calendar-of-dates
On successful completion of this unit, you will be able to:
All assignments will be submitted using iLearn. The results of all assignments will be available via iLearn.
Late submission to the assignments will be penalised with the following deductions:
The final exam is a hurdle assessment. This means that:
The final mark of the unit will be obtained by summing the marks of all the assessment tasks for a total mark of 100. In order to pass the unit:
Name | Weighting | Hurdle | Due |
---|---|---|---|
Assignment 1 | 5% | No | Week 3 |
Assignment 2 | 20% | No | Week 8 |
Assignment 3 | 15% | No | Week 12 |
Final Exam | 60% | Yes | Examination period |
Due: Week 3
Weighting: 5%
In this assignment you will acquire hands-on experience in designing, implementing and querying a NoSQL database, i.e. MongoDB.
Due: Week 8
Weighting: 20%
In this assignment you will implement MapReduce techniques for the processing of Big Data. You will build your assignment on top of Hadoop (i.e. an open-source version of MapReduce written in Java).
Due: Week 12
Weighting: 15%
In this assignment you will implement a non-trivial problem that processes Big Data.
Due: Examination period
Weighting: 60%
This is a hurdle assessment task (see assessment policy for more information on hurdle assessment tasks)
The final exam will focus on the theoretical aspects of the unit, including algorithms and implementation issues.
This is a hurdle assessment. This means that you need to pass the exam in order to pass the unit.
Much of the contents of the unit will be based on the following books:
Additional material including lecture notes will be made available during the semester. See the unit schedule for a listing of the most relevant reading for each week.
The following software is used in COMP336:
This software is installed in the labs; you should also ensure that you have working copies of all the above on your own machine. Note that some of this software requires internet access.
Many packages come in various versions; to avoid potential incompatibilities, you should install versions as close as possible to those used in the labs.
The unit web page will be hosted in iLearn, where you will need to login using your Student One ID and password. The unit will make extensive use of discussion boards also hosted in iLearn. Please post questions there, they will be monitored by the staff on the unit.
Below is the tentative unit schedule.
Week | Topic | Reading |
---|---|---|
1 | Data and Big Data | Leskovec et al. Ch 1 |
2 | Organizing Big Data | |
3 | Curating Big Data | |
4 | Processing Big Data: Cloud computing | |
5 | Processing Big Data: MapReduce (I) | Leskovec et al. Ch 2 |
6 | Processing Big Data: MapReduce (II) | Leskovec et al. Ch 2 |
7 | Finding Similar Items | Leskovec et al. Ch 3 |
RECESS | ||
8 | Mining Data Streams | Leskovec et al. Ch 4 |
9 | Link Analysis | Leskovec et al. Ch 5 |
10 | Frequent Itemsets | Leskovec et al. Ch 7 |
11 | Large-scale Machine Learning (I) | Leskovec et al. Ch 12 |
12 | Large-scale Machine Learning (II) | Leskovec et al. Ch 12 |
Macquarie University policies and procedures are accessible from Policy Central (https://staff.mq.edu.au/work/strategy-planning-and-governance/university-policies-and-procedures/policy-central). Students should be aware of the following policies in particular with regard to Learning and Teaching:
Undergraduate students seeking more policy resources can visit the Student Policy Gateway (https://students.mq.edu.au/support/study/student-policy-gateway). It is your one-stop-shop for the key policies you need to know about throughout your undergraduate student journey.
If you would like to see all the policies relevant to Learning and Teaching visit Policy Central (https://staff.mq.edu.au/work/strategy-planning-and-governance/university-policies-and-procedures/policy-central).
Macquarie University students have a responsibility to be familiar with the Student Code of Conduct: https://students.mq.edu.au/study/getting-started/student-conduct
Results shown in iLearn, or released directly by your Unit Convenor, are not confirmed as they are subject to final approval by the University. Once approved, final results will be sent to your student email address and will be made available in eStudent. For more information visit ask.mq.edu.au.
If you receive special consideration for the final exam, a supplementary exam will be scheduled in the interval between the regular exam period and the start of the next session. By making a special consideration application for the final exam you are declaring yourself available for a resit during the supplementary examination period and will not be eligible for a second special consideration approval based on pre-existing commitments. Please ensure you are familiar with the policy prior to submitting an application. You can check the supplementary exam information page on FSE101 in iLearn (bit.ly/FSESupp) for dates, and approved applicants will receive an individual notification one week prior to the exam with the exact date and time of their supplementary examination.
If you are given a second opportunity to sit the final examination as a result of failing to meet the minimum mark required, you will be offered that chance during the same supplementary examination period and will be notified of the exact day and time after the publication of final results for the unit.
Macquarie University provides a range of support services for students. For details, visit http://students.mq.edu.au/support/
Learning Skills (mq.edu.au/learningskills) provides academic writing resources and study strategies to improve your marks and take control of your study.
Students with a disability are encouraged to contact the Disability Service who can provide appropriate help with any issues that arise during their studies.
For all student enquiries, visit Student Connect at ask.mq.edu.au
For help with University computer systems and technology, visit http://www.mq.edu.au/about_us/offices_and_units/information_technology/help/.
When using the University's IT, you must adhere to the Acceptable Use of IT Resources Policy. The policy applies to all who connect to the MQ network including students.
Our graduates will also be capable of creative thinking and of creating knowledge. They will be imaginative and open to experience and capable of innovation at work and in the community. We want them to be engaged in applying their critical, creative thinking.
This graduate capability is supported by:
We want our graduates to have emotional intelligence and sound interpersonal skills and to demonstrate discernment and common sense in their professional and personal judgement. They will exercise initiative as needed. They will be capable of risk assessment, and be able to handle ambiguity and complexity, enabling them to be adaptable in diverse and changing environments.
This graduate capability is supported by:
Our graduates will have enquiring minds and a literate curiosity which will lead them to pursue knowledge for its own sake. They will continue to pursue learning in their careers and as they participate in the world. They will be capable of reflecting on their experiences and relationships with others and the environment, learning from them, and growing - personally, professionally and socially.
This graduate capability is supported by:
Our graduates will take with them the intellectual development, depth and breadth of knowledge, scholarly understanding, and specific subject content in their chosen fields to make them competent and confident in their subject or profession. They will be able to demonstrate, where relevant, professional technical competence and meet professional standards. They will be able to articulate the structure of knowledge of their discipline, be able to adapt discipline-specific knowledge to novel situations, and be able to contribute from their discipline to inter-disciplinary solutions to problems.
This graduate capability is supported by:
We want our graduates to be capable of reasoning, questioning and analysing, and to integrate and synthesise learning and knowledge from a range of sources and environments; to be able to critique constraints, assumptions and limitations; to be able to think independently and systemically in relation to scholarly activity, in the workplace, and in the world. We want them to have a level of scientific and information technology literacy.
This graduate capability is supported by:
Our graduates should be capable of researching; of analysing, and interpreting and assessing data and information in various forms; of drawing connections across fields of knowledge; and they should be able to relate their knowledge to complex situations at work or in the world, in order to diagnose and solve problems. We want them to have the confidence to take the initiative in doing so, within an awareness of their own limitations.
This graduate capability is supported by:
We want to develop in our students the ability to communicate and convey their views in forms effective with different audiences. We want our graduates to take with them the capability to read, listen, question, gather and evaluate information resources in a variety of formats, assess, write clearly, speak effectively, and to use visual communication and communication technologies as appropriate.
This graduate capability is supported by:
This is the first offering of the unit.