Students

COMP8430 – Advanced Computer Vision and Action

2024 – Session 1, In person-scheduled-weekday, North Ryde

General Information

Download as PDF
Unit convenor and teaching staff Unit convenor and teaching staff Convenor, Lecturer
Diego Molla-Aliod
4RPD 358
Lecturer, Tutor
Xiaohan Yu
4RPD 323
Credit points Credit points
10
Prerequisites Prerequisites
COMP6420
Corequisites Corequisites
Co-badged status Co-badged status
Unit description Unit description

Computer vision is at the centre of AI technology, and the ability of an AI agent to take actions depends on it. Robots need it to navigate in a dynamic environment and self-driving cars need it to navigate on a road without causing harm to itself and others. This unit will expose students to fundamentals of computer and human vision, image formation, low-level image processing, and reinforcement learning techniques. Students will also gain an understanding of various computer vision tasks such as object detection and image style transfer, and in linking computer vision to other modalities like language (such as in image captioning) and to AI agents learning actions (such as in game-playing or instruction-following). In doing so they will gain advanced skills involving cutting-edge deep learning models and related technologies.

Important Academic Dates

Information about important academic dates including deadlines for withdrawing from units are available at https://www.mq.edu.au/study/calendar-of-dates

Learning Outcomes

On successful completion of this unit, you will be able to:

  • ULO1: Identify the key computer vision applications that match current and emerging industry needs.
  • ULO2: Explain the main techniques that are used to develop and implement computer vision applications, and how they link to the ability of AI to act in the world.
  • ULO3: Implement computer vision applications using common tools and libraries used in industry.
  • ULO4: Design AI agents that can act in environments using reinforcement learning techniques.
  • ULO5: Design computer vision applications using advanced deep learning techniques.
  • ULO6: Apply computer vision methods and techniques to industry applications using real data.
  • ULO7: Apply good practice in the development, monitoring, and deployment of computer vision systems

General Assessment Information

The assessment of this unit consists of three assignments and a final exam. You will submit the solutions to the three assignments via iLearn by the due date. The final examination is a closed book examination, and will be taken in person during the exam period.

Late Assessment Submission Penalty

Unless a Special Consideration request has been submitted and approved, a 5% penalty (of the total possible mark of the task) will be applied for each day a written report or presentation assessment is not submitted, up until the 7th day (including weekends). After the 7th day, a grade of ‘0’ will be awarded even if the assessment is submitted. The submission time for all uploaded assessments is 11:55 pm. A 1-hour grace period will be provided to students who experience a technical concern.  For any late submission of time-sensitive tasks, such as scheduled tests/exams, performance assessments/presentations, and/or scheduled practical assessments/labs, please apply for Special Consideration

Assessments where Late Submissions will be accepted

  • Assignment 1 - YES, Standard Late Penalty applies
  • Assignment 2 - YES, Standard Late Penalty applies
  • Major Project - YES, Standard Late Penalty applies
  • Exam - NO, unless Special Consideration is Granted

Supplementary Exam

If you receive Special Consideration for the final exam, a supplementary exam will be scheduled after the normal exam period, following the release of marks. By making a special consideration application for the final exam you are declaring yourself available for a resit during the supplementary examination period and will not be eligible for a second special consideration approval based on pre-existing commitments. Please ensure you are familiar with the policy prior to submitting an application. Approved applicants will receive an individual notification one week prior to the exam with the exact date and time of their supplementary examination.

Requirements to Pass this Unit

To pass this unit, you must achieve a total mark equal or greater than 50%. This unit does not have hurdle assessments.

Special Consideration

The Special Consideration Policy aims to support students who have been impacted by short-term circumstances or events that are serious, unavoidable and significantly disruptive, and which may affect their performance in assessment. If you experience circumstances or events that affect your ability to complete the assessments in this unit on time, please inform the convenor and submit a Special Consideration request through ask.mq.edu.au.  

Assessment Tasks

Name Weighting Hurdle Due
Assignment 1 20% No Week 8
Assignment 2 20% No Week 11
Major Project 40% No Week 12
Examination 20% No Examination period

Assignment 1

Assessment Type 1: Programming Task
Indicative Time on Task 2: 20 hours
Due: Week 8
Weighting: 20%

 

Implement a practical computer vision application using deep learning techniques.

 


On successful completion you will be able to:
  • Identify the key computer vision applications that match current and emerging industry needs.
  • Explain the main techniques that are used to develop and implement computer vision applications, and how they link to the ability of AI to act in the world.
  • Design computer vision applications using advanced deep learning techniques.

Assignment 2

Assessment Type 1: Programming Task
Indicative Time on Task 2: 20 hours
Due: Week 11
Weighting: 20%

 

Implement an AI agent using reinforcement learning

 


On successful completion you will be able to:
  • Explain the main techniques that are used to develop and implement computer vision applications, and how they link to the ability of AI to act in the world.
  • Design AI agents that can act in environments using reinforcement learning techniques.

Major Project

Assessment Type 1: Project
Indicative Time on Task 2: 40 hours
Due: Week 12
Weighting: 40%

 

Design, implement, deploy, evaluate, and implement monitor an industrial grade computer vision application that uses realistic data and requires advanced deep learning techniques.

 


On successful completion you will be able to:
  • Identify the key computer vision applications that match current and emerging industry needs.
  • Explain the main techniques that are used to develop and implement computer vision applications, and how they link to the ability of AI to act in the world.
  • Implement computer vision applications using common tools and libraries used in industry.
  • Design computer vision applications using advanced deep learning techniques.
  • Apply computer vision methods and techniques to industry applications using real data.
  • Apply good practice in the development, monitoring, and deployment of computer vision systems

Examination

Assessment Type 1: Examination
Indicative Time on Task 2: 2 hours
Due: Examination period
Weighting: 20%

 

Demonstrate an understanding of a selection of topics covered in the unit.

 


On successful completion you will be able to:
  • Identify the key computer vision applications that match current and emerging industry needs.
  • Explain the main techniques that are used to develop and implement computer vision applications, and how they link to the ability of AI to act in the world.
  • Implement computer vision applications using common tools and libraries used in industry.
  • Design AI agents that can act in environments using reinforcement learning techniques.
  • Design computer vision applications using advanced deep learning techniques.
  • Apply computer vision methods and techniques to industry applications using real data.

1 If you need help with your assignment, please contact:

  • the academic teaching staff in your unit for guidance in understanding or completing this type of assessment
  • the Writing Centre for academic skills support.

2 Indicative time-on-task is an estimate of the time required for completion of the assessment task and is subject to individual variation

Delivery and Resources

During most of the weeks, there will be 2 hours of lectures and 2 hours ot Practicals. All the required software will be installed in the computers of the PC labs allocated for the Practicals but you are free to bring your own device and install the software.

Lectures and practicals start on Week 1.

Delivery Modes

All lectures and Practicals are delivered in campus. The lectures will also be recorded and recordings of the lecture will be available via iLearn. There will not be recordings of the Practical sessions.

Methods of Communication

We will communicate with you via your university email or through announcements in iLearn. Queries to convenors can be made via the Contact tool in iLearn or sent to diego.molla-aliod@mq.edu.au from your university email address.

Reading

Every week there will be a list of required and recommended readings. The list will be maintained in iLearn.

Most of the contents of the unit will be based on the following books:

  • François Chollet (2021). Deep Learning with Python, 2nd Edition. Manning Publications. Available in the library.
  • Valliappa Lakshmanan, Martin Görner, Ryan Gillard (2021), Practical Machine Learning for Computer Vision. O'Reilly. Available in the library.

  • Mohammed Elgendy (2020), Deep Learning for Vision Systems. Manning Publications. Available in the library.

  • Rajalingappaa Shanmugamani (2018), Deep Learning for Computer Vision: expert techniques to train advanced neural networks using TensorFlow and Keras. Pakt Publishing. Available in the library.

Software

The main software for this unit is Anaconda for Python 3.11 with the following packages:

  1. numpy
  2. scipy
  3. pandas
  4. scikit-learn
  5. scikit-image
  6. gensim
  7. tensorflow
  8. opencv
  9. pillow

Unit Web Page

Note that the majority of the unit materials is publicly available while some material requires you to log in to iLearn to access it.

The unit will make extensive use of discussion boards hosted within iLearn. Please post questions there, they will be monitored by the staff on the unit.

COVID Information

For the latest information on the University’s response to COVID-19, please refer to the Coronavirus infection page on the Macquarie website: https://www.mq.edu.au/about/coronavirus-faqs. Remember to check this page regularly in case the information and requirements change during semester. If there are any changes to this unit in relation to COVID, these will be communicated via iLearn.

Unit Schedule

The following schedule is tentative and is only an indication of the actual contents. The final schedule will be available in iLearn.

Week Topic
1 Review of Basic Computer Vision
2 Managing your Computer Vision Project
3 Transfer Learning
4 Generative Adversarial Networks
5 DeepDream and Neural Style Transfer
6 Visual Embeddings
7 Introduction to Reinforcement Learning
8 Deep Reinforcement Learning
  RECESS
9 Advanced Reinforcement Learning for Computer Vision and Action
10 Advanced Object Detection
11 Virtual Agents
12 Relationship between vision and action
13 Review for exam

 

Policies and Procedures

Macquarie University policies and procedures are accessible from Policy Central (https://policies.mq.edu.au). Students should be aware of the following policies in particular with regard to Learning and Teaching:

Students seeking more policy resources can visit Student Policies (https://students.mq.edu.au/support/study/policies). It is your one-stop-shop for the key policies you need to know about throughout your undergraduate student journey.

To find other policies relating to Teaching and Learning, visit Policy Central (https://policies.mq.edu.au) and use the search tool.

Student Code of Conduct

Macquarie University students have a responsibility to be familiar with the Student Code of Conduct: https://students.mq.edu.au/admin/other-resources/student-conduct

Results

Results published on platform other than eStudent, (eg. iLearn, Coursera etc.) or released directly by your Unit Convenor, are not confirmed as they are subject to final approval by the University. Once approved, final results will be sent to your student email address and will be made available in eStudent. For more information visit ask.mq.edu.au or if you are a Global MBA student contact globalmba.support@mq.edu.au

Academic Integrity

At Macquarie, we believe academic integrity – honesty, respect, trust, responsibility, fairness and courage – is at the core of learning, teaching and research. We recognise that meeting the expectations required to complete your assessments can be challenging. So, we offer you a range of resources and services to help you reach your potential, including free online writing and maths support, academic skills development and wellbeing consultations.

Student Support

Macquarie University provides a range of support services for students. For details, visit http://students.mq.edu.au/support/

The Writing Centre

The Writing Centre provides resources to develop your English language proficiency, academic writing, and communication skills.

The Library provides online and face to face support to help you find and use relevant information resources. 

Student Services and Support

Macquarie University offers a range of Student Support Services including:

Student Enquiries

Got a question? Ask us via AskMQ, or contact Service Connect.

IT Help

For help with University computer systems and technology, visit http://www.mq.edu.au/about_us/offices_and_units/information_technology/help/

When using the University's IT, you must adhere to the Acceptable Use of IT Resources Policy. The policy applies to all who connect to the MQ network including students.

Changes from Previous Offering

This is the first offering of this unit.


Unit information based on version 2024.02 of the Handbook