CSI5387 Concept Learning Systems/Machine Learning


Instructor

Nathalie Japkowicz
Office: STE 5-029
Phone: 562-5800 ext. 6693
E-mail: nat@site.uottawa.ca

Meeting Times and Locations

Office Hours and Locations

Overview

Machine Learning is the area of Artificial Intelligence concerned with the problem of building computer programs that automatically improve with experience. The intent of this course is to present a broad introduction to the principles and paradigms underlying machine learning, including presentations of its main approaches, discussions of its major theoretical issues, and overviews of its most important research themes.

Course Format

The course will consist of a mixture of regular lectures and student presentations. The regular lectures will cover descriptions and discussions of the major approaches to Machine Learning as well as of its major theoretical issues. The student presentations will focus on the most important themes we survey. This year, the focus will be on Big Data Analysis.

Evaluation

Students will be evaluated on short written commentaries and oral presentations of research papers (15%), on two homework assignments (25%), on a final exam (20%), and on a final class project of the student's choice (40%). For the class project, students can propose their own topic or choose from a list of suggested topics which will be made available at the beginning of the term. Project proposals will be due in mid-semester. For the research paper commentaries, students will work in teams of 3 or 4. However, homework, presentations and projects must be submitted/done individually.

Pre-Requisites

Students should have reasonable exposure to Artificial Intelligence and some programming experience in a high level language.

Required Textbooks

Additional References .

 

Class Notes

Class notes are available here.

Other Reading Material

Articles for the students to critique and present will be provided in a zipped directory by e-mail, or as they become available.

List of Major Approaches Surveyed

List of Theoretical Issues Considered

List of Major Themes Surveyed

All the themes surveyed in this course pertain to the currently very popular extension of machine learning known as “Big Data Analysis”. The themes surveyed will belong to the following list:

·         Graph Mining

·         Mining Social Networks

·         Data Streams Mining

·         Unstructured or Semi-Structured Data Mining

·         Data Mining with Heterogeneous Sources

·         Spatio-Temporal Data Mining

·         Issues of Trust and Provenance in Data Mining

·         Privacy in Data Mining

 

Homework Related material:

·         List of Themes/Papers for this year: The papers read by the class this year will be distributed by e-mail or on Blackboard. The theme for this year is Big Data Analysis. Each team of 3 or 4 students must submit their summary and critique every week that a chapter is read.

·        Presentations: Presentations will be done individually. Each student will speak for approximately 15 minutes and answer some of the students/instructor’s questions. The schedule for presentations is available here.

·        Assignment 1: Supervised Learning Part I, Unsupervised Learning, Evaluation Techniques Part I. Handed out on Week 4; to return on Week 7.

·        Assignment 2: Supervised Learning Part II, Evaluation Techniques Part II, Association Mining. Handed out on Week 7; to return on Week 10

·        Final Exam: The final exam will take place in class, on the Monday of Week 14. (Last day of classes)

 

Course Support:

· Suggested Outline for Paper Commentaries

· Final Exam Review Sheet

· Project Description

· Guidelines for the Final Project Report

 

Machine Learning Ressources on the Web:

· David Aha's Machine Learning Resource Page

· UCI Machine Learning

· WEKA

· Free Book: Information Theory, Inference, and Learning Algorithms, David MacKay

· R Code from the Japkowicz and Shah Evaluation Book



Syllabus:

Week

Topics

Readings

Week 1:

Sep 9

Organizational Meeting, Theoretical and Practical Overview of Machine Learning, Philosophical roots of machine Learning 

Texts:

 

·         Flach: Prologue, Chapters 1, 2

·         Japkowicz & Shah : Chapter 1

·         Class Notes : Week 1

·           




Week 2:

Sep 16

 Versions Space Learning, Decision Tree Learning

Texts:  

·         Flach: Chapters  4, 5

·         Japkowicz & Shah: Chapter 2

·         Class Notes: Week 2 – Part I & Part II

Week 3:

Sep 23


Artificial Neural Networks, Bayesian Learning

Texts:

·         Flach: Sections 7.1-2, Chapter 9

·         Class Notes: Week 3 – Part I & Part II


 

 

Week 4:

Sep 30





Experimental Evaluation of Learning Algorithms


Homework 1: Handed out today; Due: Wednesday, Oct 21

Texts:

·         Japkowicz & Shah: Chapters 3-6

·         Flach: Chapter 12

·         Class Notes: Week 4

Week 5

Oct 7

·         Instance-Based Learning and unsupervised learning

·         Weekly Theme: Introduction to Big Data Analysis

Texts:

·         Flach: Chapter 8

·         Class Notes: Week 5 – Part I & Part II

·         Weekly Theme papers

Week 6:

Oct 14

·         Rule Learning/Association Mining,

·         Weekly Theme: Data Stream Mining



STUDY BREAK  

Week 7

Oct 21

·         Support Vector Machines,

·         Weekly Theme: Mining Social Networks

Homework 1 DUE today

Project Proposal DUE today

Homework 2: Handed out today; Due: Wednesday, Nov 11

Texts:

·         Flach: Chapter 6

·         Class Notes: Week 7

·         Weekly Theme papers

Week 8:

Oct 28

STUDY BREAK

Texts:

·         Flach: Chapter 7

·         Class Notes: Week 8

·         Weekly Theme papers

Week 9:

Nov 4

·         Classifier Ensembles,

·         Weekly Theme: Trust, Provenance and Privacy

Texts:

·         Flach: Chapter 11

·         Class Notes: Week 9

·         Weekly Theme papers

Week 10:

Nov 11

·         Features

·         Weekly Theme: Applications of Big Data Analysis I: Science, Medicine and Cyber Security

Homework 2: DUE today

Texts:

·         Flach: Chapter 10

·         Class Notes: Week 10

·         Weekly Theme papers

Week 11:

Nov 18

·         Genetic Algorithms,

·         Weekly Theme: Applications of Big Data Analysis II: Industry and Business

Texts:

·         Class Notes: Week 11

·         Weekly Theme papers

Week 12:

Nov 25


FINAL PROJECT PRESENTATIONS

Week 13:

Dec 2

FINAL EXAM

FINAL PROJECT PRESENTATIONS