Visual Learning and Recognition: Difference between revisions

Visual Learning and Recognition (view source)

Revision as of 17:46, 1 October 2020

1,847 bytes added , 1 October 2020

→‎Object Detection and Instance Segmentation

David

Bureaucrats, Interface administrators, Administrators

5,337

edits

@@ Line 245: / Line 245: @@
 * Ensembles of networks
-==Object Detection and Instance Segmentation==
+==Object Detection and Segmentation==
+Beginning of Lecture 10 (Oct 1)
+===Edge Templates + Nearest Neighbor===
+Gavrila & Philomen (1999)
+# From a raw image, do feature extraction and calculate distance transform.
+# Do nearest neighbor search.
+Cons:
+* Templates are hand-made.
+===Haar Wavelets + SVM===
+A Trainable System for Object Detection. (Papageorgou & Poggio, 2000)
+# Extract Overcomplete Representation
+#* Called Haar Wavelets. Similar to CNN filters.
+#* Wavelet features can be calculated by averaging all faces. Similar to CNN features.
+# Do SVM Classifier
+;+ Parts (2001)
+Trained an SVM for face, legs, left arm, right arm.
+When detecting a person, make sure all parts are in the correct location and shape with some constraints.
+===YYY + adaBoost===
+Basically, do the same as before (Haar Wavelets) but replace SVM with adaBoost.
+;Rectangular differential features (Viola & Jones 2001)
+Use fast features to throw out parts of the image.
+Then do processing on the remainder.
+Became the standard object detection system in OpenCV.
+;Learnt wavelets + adaBoost
+Works on more than just faces.
+Ensemble face detection.
+===Dynamic Programming===
+Efficient matching of pictorial structures (Felzenszwalb & Huttenlocher, 2000)
+Basically have a cartoon model and match the position & orientation of each part.
+Probabilistic Methods for Finding People (Ioffe & Forsyth, 1999)
+===More Techniques===
+How to detect objects at different scale?
+One trick is to detect the horizon line and scale based on the horizon line.
+Sliding Window:
+Create multiple scales of the image and detect at each scale.
+This is done by building a feature pyramid using an image pyramid.
+===Histograph of Gradients (HoG)===
+How many octaves? However many octaves to reduces the image size to the template size + 1 for 2x2 upscaling.
+How many levels? Generally people try 10 levels.
 ==Will be on the exam==