Professional Documents
Culture Documents
1 Introduction
1 Introduction
Vision
Gaoang Wang
Education
2009-2013, Fudan University, B.S.
2013-2015, University of Wisconsin-Madison, M.S.
2015-2019, University of Washington, Seattle, Ph.D.
Working Experience
2019.06-2019.11, Research Scientist, Megvii
gaoangwang@intl.zju.edu.cn
2019.11-2020.07, Research Scientist II, Wyze Labs
2020.09-present, Assistant Professor, Zhejiang University
Homepage:
https://person.zju.edu.cn/gaoangwang
2021.03-present, Adjunct Assistant Professor, UIUC
Teaching
Machine Learning, Data Mining, Advanced Image Processing, Intro
to Computer Vision
Research Areas
Computer Vision, Machine Learning, Artificial Intelligence
Every image tells a story
• Goal of computer vision:
perceive the “story” behind
the picture
• But what does “story”
mean?
• Depends on what we want
to do with it
The goal(s) or computer vision
• What is the image about?
• What objects are in the image?
• Where are they?
• How are they oriented?
• What is the layout of the scene
in 3D?
• What is the shape of each
object?
Recent progress
• Depth cameras
https://realsense.intel.com/stereo/
Microsoft Kinect
Recent progress
• shape capture
Source: S. Seitz
Recent progress
• Established technology: 3D Models of the world
Mask R-CNN. Kaiming He, Georgia Gkioxari, Piotr Dollar, Ross Girshick. ICCV 2017
Recent progress
• Species recognition
[iNaturalist]
Recent progress
• recognizing rare concepts
Recent progress
• Recovering 3D structure from limited views
Recent progress
• Integrating Vision and Action
Map
Embedding
Features
Networks
Shape Occlusion
variation
Viewpoint
variation
Scale
Background Illumination
clutter
Hard examples
• Concepts are subtle
https://www.allaboutbirds.org
Challenges
• local ambiguity