Foundations of Computer Vision

Course project of COMP4901L - a clone of CMU 16-385 course Team: Daniel Cheung, Dipsy Wong Repo: https://github.com/dipsywong98/COMP4901L

One of the hardest and most intensive UG Computer Science course in HKUST, 7 programming projects in 13 weeks. Thanks god our super difficult final is cancelled.

  1. Matlab warm-up
  2. image filtering and hough transform
  3. augmented reality and planar homography
  4. 3D reconstruction
  5. physics-based vision
  6. scene recognition with bag of words
  7. digit recognition with CNN
  8. tracking objects in videos

1. image filtering and hough transform

Pipeline that applies kernals and hough transform to detect straight line edges

  1. original image
  2. apply sobel operator
  3. thresholding
  4. hough transform
  5. lines

2. augmented reality and planar homography

Feature detectors (FAST/ Harris corner detections), feature descriptors (BRIEF, filter bank), homography, RANSAC, Panorama

  1. Extract and match the features

  2. Compute the homography and map new image

Panorama left right result

3. 3D reconstruction

Sparse reconstruction

dense reconstruction

pose estimation

4. Physic based reconstruction

Given albedo and normal, render model under different light source

Given pictures under different light source, compute albedo and normal

5. scene recognition with bag of words

Given a set of filters, set of training images, sample some points using random or corners, compute their filter responses, k-mean cluster the responses as dictionary. Then map each pixel of all images into words in the dictionary, this word map then convert into a histogram counting the relative frequency of each word. Next use image feature distance/ NN/ kNN/ SVM to do the classification

  1. original
  2. filter response on LAB color system
  3. collect sampling points
  4. image to word map
  5. classification kNN 57.5% accuracy, k=14 LibSVM 63.75% accuracy, HOG+SVM 80% accuracy

6. digit recognition with CNN

7. tracking objects in videos

Lucas-Kanade Tracker and Matthew-Baker tracker

https://youtu.be/0fQug-VUdEk (car tracking using robust LK)

https://youtu.be/UvSW91Z5msA (landing tracking using pyramid LK)

Scores

I am rank 6 A+