Professional Documents
Culture Documents
Slide16-Machine Learning For Computer Vision
Slide16-Machine Learning For Computer Vision
Slide16-Machine Learning For Computer Vision
Min H. Kim
KAIST School of Computing
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision Shin Yoo 9
Dimensionality Reduction
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision Shin Yoo 12
Eigenfaces The ATT face database (formerly the ORL
database), 10 pictures of 40 subjects each
Mean face
50x50
Patch
50x50
Patch
Superpixels!
Oversegmentation Undersegmentation
Hierarchical Segmentations
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 19
Clustering
Key Challenges:
1) What makes two points/images/patches similar?
2) How do we compute an overall grouping from
pairwise similarities?
• Mean-shift clustering
– Estimate modes of pdf
• Top 5 error
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 26
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 27
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 28
ImageNet Competition
• Krizhevsky, 2012
• Google,
Microsoft 2015
– Beat the best
human score in
the ImageNet
challenge.
Testing
Test Image
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 33
Slide credit: D. Hoiem and L. Lazebnik
Features
• Raw pixels
• Histograms
• Templates
• SIFT descriptors
– GIST
– ORB
– HOG….
• Concision
– Minimize number of features
without sacrificing coverage
• Directness
– Ideal features are independently
useful for prediction
Testing
Test Image
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 36
Slide credit: D. Hoiem and L. Lazebnik
Recognition task and supervision
• Label: Images in the training set must be annotated
with the “correct answer” that the model is expected
to produce
Contains a motorbike
Testing
Test Image
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 42
Slide credit: D. Hoiem and L. Lazebnik
The machine learning framework
• Apply a prediction function to a feature
representation of the image to get the desired
output:
f( ) = “apple”
f( ) = “tomato”
f( ) = “cow”
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 43
Slide credit: L. Lazebnik
The machine learning framework
f(x) = y
Prediction function Image Output (label)
or classifier feature
Testing
Test Image
Lecturer: Min H. Kim (KAIST) CS484: Introduction to Computer Vision 45
Slide credit: D. Hoiem and L. Lazebnik
Supervised Learning Framework
Features and distance measures
define visual similarity.
Training labels
dictate that examples are the same or different.
Classifiers
learn weights (or parameters) of features and
distance measures…
so that visual similarity predicts label similarity.