EE6485 Computer Vision

Course Description

Can computers understand the visual world as we do? This course treats vision as a process of inference from noisy and uncertain data and emphasizes probabilistic, statistical, data-driven approaches. Topics include image processing; segmentation, grouping, and boundary detection; recognition and detection; motion estimation and structure from motion. This class will also lead you to the discussion of applications regarding state-of-the-art techniques in recognition, detection, and video analysis.
The course will consist of five programming projects, one final project, and a few self-tutorial sessions (12 minutes for each team of 5 students). Please find information about projects and self-tutorial sessions in the syllabus.

Prerequisites

This course requires programming experience (mainly Python) as well as linear algebra, basic calculus, and basic probability. Previous knowledge of visual computing will be helpful.

Textbook

Readings will be assigned in "Computer Vision: Algorithms and Applications" by Richard Szeliski. The book is available for free online or available for purchase.

Resource

Awesome computer vision GitHub link
Awesome deep learning GitHub link

Grading

Your final grade will be made up from

60% 5 programming projects
30% final projects (includes proposal, midtern report, project pitch, project presentation, and project report). 5 students maximum (Project Ideas)
10% self-tutorial + class participation

You will lose 10% each day for late projects. However, you have up to three "late days" for the whole course. That is to say, the first 24 hours after the due date and time counts as 1 day, up to 48 hours is two and 72 for the third late day. "Late days" can be used across projects or consecutively on a single one. This will not be reflected in the initial grade reports for your assignment, but they will be factored in and distributed at the end of the semester so that you get the most points possible.

Contact Info and Office Hours

You can contact the professor with any of the following:

Office Hours

Min Sun / Delta (台達館) 962 · Appointment via email
趙浚宏 / EECS (資電館) 711 · Fri. 15:30 to 16:30
鄭欽安 / EECS (資電館) 722 · Fri. 15:30 to 16:30

Tentative Syllabus

Lecture	Class Dates	Topic	Slides	Reading	Extra Info (e.g., Homework/Exam)
1	F, Sept. 14	Python tutorial Git and Github	pdf1		homework 0 out
2	F, Sept. 21, Video Lecture	Introduction to computer vision and cameras & optics Light and color	pdf1 pdf2 pdf3	Szeliski 1, 2.1 (especially 2.1.5) Szeliski 2.2, 2.3, and 3.2
3	F, Sept. 28, Video Lecture	Image filtering	pdf1	Szeliski 3.4 Szeliski 3.5.2 and 8.1.1	homework 0 due homework 1 (hybrid image) out
4	F, Oct. 5	Thinking in frequency Image pyramids and applications	pdf1 pdf2	Szeliski 4.2
5	F, Oct. 12	Edge detection Interest points, corners, and local image features	pdf1 pdf2 pdf3	Szeliski 4.3
6	F, Oct. 19, Video Lecture	Feature matching and hough transform Model fitting and RANSAC self-tutorial	pdf1 pdf2	Szeliski 9 Szeliski 7	homework 1 due
7	F, Oct. 26, Video Lecture	self-tutorial		Szeliski 4.1.4 and 8.4	project proposal due homework 2 (image stitching) out
8	F, Nov. 2	Panorama Stitching Stereo and Structure from Motion self-tutorial	pdf1 pdf2
9	F, Nov. 9, Video Lecture	SfM & cere-solver Feature Tracking and Optical Flow self-tutorial		Szeliski 14 Szeliski 14.3.2	homework 2 due
10	F, Nov. 16	Machine learning intro and clustering Machine learning: classification		Szeliski 14.1	homework 3 (scene recognition) out
11	F, Nov. 23	Recognition overview, bag of features Large-scale instance recognition			homework 4 (face detection) out
12	F, Nov. 30	Detection with sliding windows: Viola Jones and Dalal Triggs Mixture of Gaussians and advanced feature encoding			homework 3 due midterm project report due
13	F, Dec. 7, Video Lecture	Modern Object Detection: DPM Modern Object Detection: Selective Search			homework 4 due homework 5 (deep classification) out
14	F, Dec. 14	Deep Learning: introduction Deep Learning: CNN
15	F, Dec. 21	Deep Learning: recent work Deep Learning: object detector			homework 5 due
16	F, Dec. 28	Project presentation: 9 teams
17	F, Jan. 4	Project presentation: 9 teams
18	F, Jan. 11	Project presentation: 10 teams
	F, Jan. 18				final project report due

EE6485 Computer Vision