Professional Documents
Culture Documents
3D With Consumer Depth Sensors s1 Introduction 23T
3D With Consumer Depth Sensors s1 Introduction 23T
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 2
Course contents
I. Introduction to 3D sensors
1.1 Standards & Applications
1.2 3D scene representation: images, depth, 3D sensors and multi-view
1.3 From 2D to 3D: multi-view, projection, reconstruction, scan
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 3
Course contents (cont)
III. 3D data analysis
3.1 Filtering, registration, spurious removal
3.2 Estimation of simple geometric objects with RANSAC.
Comparison to geometry estimation from photometric data
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 4
Course organization
K3D seminar @telecosBCN
Introduction to concepts and technologies related to depth sensors and data
Check bibliography
C/C++ course
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 5
Initial References
[Machanick 2003] P. Machanick, C and C++ in Five Days, Univ. of Queensland 2011 (static link)
[Mutto 2012] C. dal Mutto, et al, Time-of-flight Cameras and Microsoft Kinect, Springer, 2012
[OpenNI] OpenNI, Natural Interaction Devices, structure.io/openni (formerly openni.org, now Apple!)
[Rusu 2011] R.B. Rusu, S. Cousins, 3D is here: Point Cloud Library (PCL), ICRA, 2011 pointclouds.org
[Zhang 2012] Z. Zhang, Microsoft Kinect Sensor and Its Effect, IEEE Multimedia 19(2), 2012
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 6
Technology evolution
Images
Video
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 8
Fundamental ambiguity
9
• Any point P on the ray OP projects to the same point p in the image plane
Source: R. Collins
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas
Camera Models
• The relation between an image point and its line of sight is given by the
camera model (e.g. pinhole camera) and the calibration parameters:
These parameters are called intrinsic camera parameters,
whereas the position & orientation of the camera are called extrinsic parameters
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas
Two eyes help!
• A second camera can resolve the ambiguity, enabling depth measurement via
triangulation
Camera projection parameters (intrinsic & extrinsic) are known
Source: R. Collins
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas
Motivation and fundamental questions
Motto:
Computer Vision means “Teaching computers to see”
Antonio Torralba (MIT)
Talk@CVC 20th Anniversary, Barcelona July 9th, 2015
Exciting time for CV: new architectures, DBs, productivization, future
Fundamental problems: reconstruct 3D world, recognize…
Questions related to 2D/3D:
Q1: Is “projective vision” a natural way to capture the 3D world?
Q2: Do we need photometry to get geometry?
Q3: Does 3D vision mean the same than 3D geometry?
Q4: Does 2D/3D matter for “Teaching computers to see”?
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 12
On the rise…
Depth (range) sensors and technologies
Framos (website)
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 13
On the rise…
Point Cloud datasets for 3D processing
Range detection and people classification on time of flight measurement nuScenes (website)
Analog Devices (website)
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 14
Introduction to 3D Sensors and Range Data
3D vision comes from the concepts of:
• Projective transformations
perspective projection, projectivities (3D, plane-to-plane,
n-dimensional spaces, homography, invariants…)
• Multiple-View geometry
reconstruct real world scenes from several cameras (projections)
…or from a moving camera (SfM)
That’s a natural way into 3D vision as humans come equipped with:
• Two ‘projective’ sensors
• Multi-view (stereoscopic) vision
• And ‘Free Viewpoint View’ (FVV or rather “FV Acquisition”)
Q1: Is “projective vision” a natural way to capture the 3D world?
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 15
Projective vision ambiguities…
• Look twice
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 25
3D/range/scan sensors
Advantages 3D/range/scan Disadvantages
•… • …
…
…
• Actually not 3D but 2,5D!
• Direct measure of RGB+D
3D geometry Surface vs 3D (interface air-matter vs
volumetric 3D scanner, i.e. PET)
• Independent of photometry • Single viewpoint
(active sensors) Neither panoramic, nor surround view
Doesn’t hold for reflectivity still need MultiView!!!
• Accuracy/resolution? Processing!
• From scan to CAD blueprints vs MPix cams!
Ideally! vs OpenCV!
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 26
Symmetric paradigm of 3D Data
Double nature of 3D data
Data
Geometric information 3D data
Pure data, measures.
Numerical representation of objects
calculation, simulation, measurements
Rendering
… illumination + reflectivity
3D capture
Photometric information (images)
Can be rendered, displayed and presented visually,
by exploiting our perception capabilities
Visual exploration, analysis and understanding Images
(same than looking at the physical world)
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 27
Example 1: Terrestrial Laser Scanning
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 28
Example 2: Digital Photogrammetry
Photo-mosaics
Photo Tourism: 2006 Savelly & Seitz UWA, Szeliski MS
Offline technique
(computationally intensive)
Photosynth (MS)
My3Dscanner
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 29
Photogrammetry
Photogrammetry refers to the practice of deriving 3D
measurements from photographs.
• Advances in digital cameras, processors, and computation,
yield extremely dense and accurate 3D surface data
from a limited number of photos with standard digital photography
• Structure from Motion (SfM) and sub-pixel image matching yield
3D-dimensional structure by analyzing projected 2D motion fields created by a
sequential change of position of the camera sensor relative to the object.
Photographic sequences are captured to maximize information available from the
change in viewpoint
• Resulting data sets are software platform-independent and can be reused
Disadvantage: computation time…
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 30
Photogrammetry examples
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 31
Example 3: new consumer depth sensors
SDK
YouTube
Gestoos use depth sensors raw data. Use Gestoos with your favorite sensor.
Robust hand tracking without initialization, no matter your body pose.
Static gestures with fast and robust detection. Create your own gesture.
Useful hand gestures, at various distances, without skeleton involved.
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 32
Commercial depth sensors: a revolution?
• Motivation
• Fact: Kinect had 10 million sales 3 months after launch Oct.2010
• New Guinness World Record:
’Fastest-Selling Consumer Electronics Device’
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 34
Depth sensors’ (Kinect’s) impact
• Understanding ’users’:
• when they talk
• who they are
• interpret their movements
• translate them into a format that developers can use to build ’new experiences’
• Applications in multiple industrial sectors
• entertainment
• education (edutainment?)
• healthcare
• retail
• transportation
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 35
Application examples
• Minority report/Oblong technologies
• Microsoft Project Natal
• CES2012 Samsung Smart TV
• FascinatE gesture control: dynamic & static
• Leap motion
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 36
Now…
• Is this the same 3D
as in Avatar?
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 37
What it does?
Instead of a 2D video like this one... ... it outputs pointclouds (“stream data”)
“cutting demo cloud”, Jeremie Papon PhD (c) 2014 k3d@k3d:~>> k3d-viewer-app Data/capture.oni
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 38
References (3D sensors)
[Bhandari 2016] A. Bhandari, R. Raskar, Signal processing for TOF imaging sensors: An
introduction to inverse problems in computational 3-D imaging, IEEE SPM 33(5), 2012
[Bishop 2012] E. Bishop, P. Favaro: The Light Field Camera: Extended Depth of Field,
Aliasing, and Superresolution. IEEE PAMI 34(5), 2012
[Foix 2011] S. Foix et al, Lock-in Time-of-Flight (ToF) Cameras: A Survey, IEEE Sensors
11(9), 2011
[McManamon 2012] P. McManamon, “Review of ladar: a historic, yet emerging, sensor
technology with rich phenomenology,” OE 51(6), 2012
[Salvi 2010] J. Salvi et al, A state of the art in structured light patterns for surface
profilometry, Pattern Recognition 43(8), 2010
[Smisek 2011] J. Smisek, et al, 3D with Kinect, ICCV Workshops 2011
[Xiong 2017] Z. Xiong, et al, Computational Depth Sensing : Toward high-performance
commodity depth cameras, IEEE SPM 34(3), 2017
See also [Mutto 2012] [Zhang 2012] from previous refs. page
3D with Consumer Depth Sensors telecosBCN - UPC Jan 22nd – 26th 2024 Josep R. Casas 39