Professional Documents
Culture Documents
A Review of Stereo-Photogrammetry Method For 3-D Reconstruction in Computer Vision
A Review of Stereo-Photogrammetry Method For 3-D Reconstruction in Computer Vision
(ISCIT)
1Department of Mechatronics, Ho Chi Minh City University of Technology, Ho Chi Minh City, Vietnam
2 Department of Information Technology, Saigon Institute of Technology, Ho Chi Minh City, Vietnam
978-1-7281-5009-3/19/$31.00
Authorized licensed use limited to: UNIVERSITAETSBIBLIOTHEK 138
©2019 IEEE CHEMNITZ. Downloaded on January 30,2022 at 20:17:58 UTC from IEEE Xplore. Restrictions apply.
2019 19th International Symposium on Communications and Information Technologies
(ISCIT)
139
Authorized licensed use limited to: UNIVERSITAETSBIBLIOTHEK CHEMNITZ. Downloaded on January 30,2022 at 20:17:58 UTC from IEEE Xplore. Restrictions apply.
2019 19th International Symposium on Communications and Information Technologies
(ISCIT)
140
Authorized licensed use limited to: UNIVERSITAETSBIBLIOTHEK CHEMNITZ. Downloaded on January 30,2022 at 20:17:58 UTC from IEEE Xplore. Restrictions apply.
2019 19th International Symposium on Communications and Information Technologies
(ISCIT)
bottom right of the DSI, which as a result, is the disparity map Lr ( p, D) C ( p, D) min( Lr ( p r , D),
of the points on that scanline [31]. However, this method
introduces streaking because the relationships between DSIs Lr ( p r, D 1) P1 ,
(6)
of the whole image are not considered. Figure 10 show a DSI Lr ( p r, D 1) P1,
example.
min Lr ( p r,i) P2 ) min Lr ( p r ,k).
i k
S ( p, D) Lr ( p, D). (7)
r
141
Authorized licensed use limited to: UNIVERSITAETSBIBLIOTHEK CHEMNITZ. Downloaded on January 30,2022 at 20:17:58 UTC from IEEE Xplore. Restrictions apply.
2019 19th International Symposium on Communications and Information Technologies
(ISCIT)
III. CONCLUSIONS [8] R. Szeliski, and S. B. Kang, “Recovering 3D Shape and Motion from
Image Streams Using Nonlinear Least Squares,” in Journal of Visual
Middlebury benchmark, being mentioned earlier, is a Communication and Image Representation, vol. 5, no. 1, 1994, pp. 10–
reliable online platform to keep track of the continuous 28.
developments in stereo-matching. The newest method is [9] A. Azarbayejani and A. P. Pentland, Recursive Estimation of Motion,
Structure, and Focal Length, in IEEE Transactions on Pattern
dated to 2019 (the same publishing year of this paper) Analysis and Machine Intelligence, vol. 17, no. 6, June 1995, pp. 562–
showing that this is currently an active research topic. 575.
This review shows the possibility of stereo-photogrametry [10] S. M. Seitz, and C. M. Dyer, “Photorealistic Scene Reconstruction by
to build the 3D vision systems. We realized that in local Voxel Coloring,” in International Journal of Computer Vision, vol. 35,
techniques such as SAD, the matching costs are redundantly no. 2, 1999, pp. 151–173.
calculated many times, and it is unnecessary to search for the [11] K. N. Kutulakos, and S. M. Seitz, “A Theory of Shape by Space
depth of objects’ background. In addition, researches show Carving,” in International Journal of Computer Vision, vol. 38, no. 3,
2000, pp. 199–218.
that global approaches will take more computation cost than
[12] R. Jain, R. Kasturi, and B. G. Schunck, “Introduction,” in Machine
local methods. Although there is an OpenCV library for semi- Vision. New York, USA: McGraw-Hill, 1995.
global method StereoSGBM() and based on Hirschmuller’s
[13] M. Siudak and P. Rokita, “A Survey of Passive 3D Reconstruction
work [22], it does not make use of the Mutual Information Methods on the Basis of more than One Image,” in Machine Graphics
(MI) to calculate the matching cost (which, as being proved and Vision, vol. 23, no. 3, 2014, pp. 57–117.
by the author, yields good matching results even under [14] X. Li and Y. Shi, Computer Vision Imaging Based on Artificial
changing intensities). Therefore, the investigation on the Intelligence, in International Conference on Virtual Reality and
accuracy and performance of the stereo-photogrametry Intelligent Systems (ICVRIS), Changsha, 2018, pp. 22–25.
method should be continued. [15] R. F. Peter, M. Lucas, T. Russ, “Dense Object Nets: Learning Dense
Visual Object Descriptors by and for Robotic Manipulation,” in
Some stereo-photogrametry methods were reviewed in Proceedings of The 2nd Conference on Robot Learning, PMLR, vol.
this paper are dated a decade ago. However, those are the 87, 2018, pp. 373–385.
fundamental knowledge that has led to many state-of-the-art [16] R. Khilar, S. Chitrakala and S. SelvamParvathy, 3D Image
algorithms. For example, in 2012, Sah Jotwani showed an Reconstruction: Techniques, Applications and Challenges, in
International Conference on Optical Imaging Sensor and Security
improved stereo matching method which depended on (ICOSS), Coimbatore, 2013, pp. 1–6.
correlation method [54]. Or in [55], published in 2018, a new [17] B. Julius, G. Iñigo, G. C. Luis and F. E. Carlos, “3D Reconstruction
method was introduced and called Cyclops 2 that minimized Methods, a Survey,” in Proceedings of the First International
a weight function by applying the idea of calculating SAD Conference on Computer Vision Theory and Applications, 2006, pp.
(which method was discussed earlier in this paper). Even in a 457–463.
few-month-old paper (Mar. 2019), [56], the authors still using [18] S. Lee, Depth Camera Image Processing and Applications, in 19th
dynamic programming in order to calculate the disparity. IEEE International Conference on Image Processing, Orlando, FL,
2012, pp. 545–548.
Overall, most of the new stereo-matching methods are
[19] A. Tsai, J.W. Fisher, C. Wible, W.M. Wells, J. Kim, and A.S. Willsky,
improving the calculation time because stereo-matching is “Analysis of Functional MRI Data Using Mutual Information,” in
“computationally intensive” [55]. One of the known solutions, Taylor C., Colchester A. (eds) Medical Image Computing and
parallel computation with the graphics processing unit (GPU) Computer-Assisted Intervention – MICCAI’99, MICCAI 1999. Lecture
technology, is also our focus. Some other current researches Notes in Computer Science, Springer, Berlin, Heidelberg, vol 1679,
focus on improving the performance of the stereo-matching doi: 10.1007/10704282_51.
by applying convolutional neural networks (CNN), such as [20] Z. Kowalczuk and D. Wesierski, “Vision Guided Robot Gripping
Systems,” in Automation and Robotics. Juan Manuel Ramos Arreguin,
MC-CNN methods that uses CNN to compute matching cost Rijeka: IntechOpen, 2008, pp. 41–72. doi: 10.5772/6264.
but still using Semi-Global Matching (SGM-which was first
[21] L. Pérez, Í. Rodríguez, N. Rodríguez, R. Usamentiaga, and D. F.
introduced in 2008 [22]) to perform the matching [55]. García, “Robot Guidance Using Machine Vision Techniques in
However, those topics are not in the scope of this paper. Industrial Environments: A Comparative Review,” in Sensors (Basel),
vol. 16, no. 3, Mar. 2016, pp. 335–361.
REFERENCES [22] H. Hirschmuller, Stereo Processing by Semiglobal Matching and
[1] R. Szeliski, “Introduction,” in Computer Vision: Algorithms and Mutual Information, in IEEE Transactions on Pattern Analysis and
Applications. England: Springer, 2011. Machine Intelligence, vol. 30, no. 2, Feb. 2008, pp. 328–341.
[2] G. J. Agin and T. O. Binford, Computer Description of Curved [23] L. Yan, X. Zhao and H. Du, Research on 3D Measuring Based
Objects, in IEEE Transactions on Computers, vol. C-25, no. 4, April Binocular Vision, in IEEE International Conference on Control
1976, pp. 439–449. Science and Systems Engineering, Yantai, 2014, pp. 18–22.
[3] O. D. Faugeras and M. Hebert, “The Representation, Recognition and [24] C. Loop and Z. Zhang, “Computing Rectifying Homographies for
Positioning of 3-D Shapes from Range Data,” in Kanade, T. (ed.), Stereo Vision,” in Computer Vision and Pattern Recognition, vol. 1,
Three-Dimensional Machine Vision, Boston, MA, USA: Kluwer 1999, pp. 1125–1131.
Academic Publishers, 1987, pp. 301–353. [25] V. Nozick, “Multiple View Image Rectification,” in 1st IEEE-
[4] B. Curless and M. Levoy, “A Volumetric Method for Building International Symposium on Access Spaces (IEEE-ISAS’11), Japan,
Complex Models from Range Images,” in ACM SIGGRAPH June 2011, pp.277–282.
Conference Proceedings, New Orleans, 1996, pp. 303–312. [26] L. Robert, C. Zeller, O. Faugeras, and M. Hebert, “Applications of
[5] P. J. Besl and R. C. Jain, “Three-Dimensional Object Recognition,” in Non-Metric Vision to some Visually Guided Robotics Tasks,” in
Computing Surveys, vol. 17, no. 1, 1985 pp. 75–145. INRIA, Tech. Rep. RR-2584, June 1995.
[6] A. Banno, T. Masuda, T. Oishi and K. Ikeuchi, “Flying Laser Range [27] R.I. Hartley, “Theory and Practice of Projective Rectification,” in
Sensor for Large Scale Site Modeling and its Applications in Bayon International Journal of Computer Vision, vol. 35, 1999, pp. 115–127.
Digital Archival Project,” in International Journal of Computer Vision, [28] G. Fuhr, G. P. Fickel, L. P. Dal’Aqua, C. R. Jung, T. Malzbender, and
vol. 78, no. 2–3, 2008, pp. 207–222. R. Samadani, “An Evaluation of Stereo Matching Methods for View
[7] C. J. Taylor, D. J. Kriegman and P. Anandan, Structure and Motion in Interpolation,” in IEEE International Conference on Image
Two Dimensions from Multiple Images: a Least Squares Approach, Processing, 2013, pp. 403–407.
in Proceedings of the IEEE Workshop on Visual Motion, Princeton, NJ, [29] D. Scharstein, R. Szeliski, R. Zabih, “A Taxonomy and Evaluation
USA, 1991, pp. 242–248. of Dense Two-Frame Stereo Correspondence Algorithms,” in
142
Authorized licensed use limited to: UNIVERSITAETSBIBLIOTHEK CHEMNITZ. Downloaded on January 30,2022 at 20:17:58 UTC from IEEE Xplore. Restrictions apply.
2019 19th International Symposium on Communications and Information Technologies
(ISCIT)
Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision [51] L. Matthies, R. Szeliski, and T. Kanade, “Kalman Filter-Based
(SMBV 2001), pp. 131–140. Algorithms for Estimating Depth from Image Sequences,” in
[30] Tri. Priyambodo, “Grid-Edge-Depth Map Building Employing SAD International Journal of Computer Vision, vol. 3, 1989, pp. 209–236.
with Sobel Edge Detector,” in International Journal on Smart Sensing [52] Q. Tian and M. N. Huhns, “Algorithms for Subpixel Registration,” in
and Intelligent System, vol. 10, no. 13, Sep. 2017, pp. 551–566. Computer Vision, Graphics, and Image Processing, vol. 35, 1986, pp.
[31] C.-H. Kim, H.-K. Lee, and Y.-H. Ha, Disparity Space Image-Based 220–233.
Stereo Matching Using Optimal Path Searching, in Proc. SPIE 5022, [53] M. Shimizu and M. Okutomi, “Precise Sub-Pixel Estimation on Area-
Image and Video Communications and Processing, May 2003, pp. Based Matching,” in Proceedings Eighth IEEE International
752–760. Conference on Computer Vision— ICCV 2001, vol. 1, 2001, pp. 90–
[32] M. Bleyer and M. Gelautz, “A Layered Stereo Matching Algorithm 97.
Using Image Segmentation and Global Visibility Constraints,” in [54] S. Sah and N. Jotwani, “Stereo Matching using Multi-resolution
ISPRS J. Photogrammetry and Remote Sensing, vol. 59, no. 3, 2005, Images on CUDA,” in International Journal of Computer
pp. 128–150. Applications, vol. 56, no. 12, 2012, pp. 47–55.
[33] V. Kolmogorov and R. Zabih, “Computing Visual Correspondence [55] A. Ivanavičius, H. Simonavičius, J. Gelšvartas, A. Lauraitis, R.
with Occlusions Using Graph Cuts,” in Proc. Int’l Conf. Computer Maskeliūnas, P. Cimmperman and P. Serafinavičius, “Real-time
Vision, vol. 2, 2001, pp. 508–515. CUDA-based stereo matching using Cyclops2 algorithm,” in
[34] Q. Yang, L. Wang, R. Yang, H. Stewenius, and D. Nister, “Stereo EURASIP Journal on Image and Video Processing, 2018. DOI:
Matching with Color-Weighted Correlation, Hierarchical Belief /10.1186/s13640-018-0253-2
Propagation and Occlusion Handling,” in Proc. IEEE Conf. Computer [56] M. Hallek, F. Smach and M. Atri, “Real-time stereo matching on
Vision and Pattern Recognition, June 2006, pp. 492–504. CUDA using Fourier descriptors and dynamic programming,” in
[35] C. Lei, J. Selzer, and Y.-H. Yang, “Region-Tree Based Stereo Using Computational Visual Media, vol. 5, no. 1, Mar. 2019, pp. 59–71.
Dynamic Programming Optimization,” in Proc. IEEE Conf. Computer
Vision and Pattern Recognition, June 2006.
[36] J. Sun, Y. Li, S. Kang, and H.-Y. Shum, “Symmetric Stereo Matching
for Occlusion Handling,” in Proc. IEEE Conf. Computer Vision and
Pattern Recognition, vol. 2, June 2005, pp. 399–406.
[37] C. L. Zitnick, S. B. Kang, M. Uyttendaele, S. Winder, and R. Szeliski,
“High-Quality Video View Interpolation Using a Layered
Representation,” in Proc. ACM SIGGRAPH ’04, vol. 23, no. 3, 2004,
pp. 600–608.
[38] A. Blake and A. Zisserman, Visual Reconstruction. London, England:
The MIT Press, 1987.
[39] S. T. Barnard, “Stochastic Stereo Matching over Scale,” in IJCV, vol.
3, no. 1, 1989, pp. 17–32.
[40] S. Geman and D. Geman, “Stochastic relaxation, Gibbs distribution,
and the Bayesian restoration of images,” in IEEE Transactions on
Pattern Analysis and Machine Intelligence, vol. PAMI-6, no. 6, 1984,
pp. 721–741.
[41] J. Marroquin, S. Mitter, and T. Poggio, “Probabilistic Solution of Ill-
Posed Problems in Computational Vision,” in J. Am. Stat. Assoc., vol.
82, no. 397, 1987, pp. 76–89.
[42] P. B. Chou and C. M. Brown, “The Theory And Practice Of Bayesian
Image Labeling,” in International Journal of Computer Vision, vol. 4,
no. 3, 1990, pp. 185–210.
[43] D. Geiger and F. Girosi, “Mean Field Theory for Surface
Reconstruction,” in IEEE Transactions on Pattern Analysis and
Machine Intelligence, vol. 13, no. 5, 1991, pp. 617–630.
[44] Y. Boykov, O. Veksler, and R. Zabih, “Fast Approximate Energy
Minimization via Graph Cuts,” in IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 23, no. 11, 2001, pp. 1222–
1239.
[45] H. Ishika and D. Geiger, “Occlusions, Discontinuities, and Epipolar
Lines in Stereo,” in Burkhardt H., Neumann B. (eds) Computer Vision
— ECCV 98, Springer, Berlin, Heidelberg, vol. 1406, 1998, pp. 232–
248.
[46] V. Kolmogorov and R. Zabih, “Computing Visual Correspondence
with Occlusions Using Graph Cuts,” in Proceedings Eighth IEEE
International Conference on Computer Vision. ICCV 2001, vol. 2,
2001, pp. 508–515.
[47] S. Roy and I. J. Cox, “A Maximum-Flow Formulation of the N-Camera
Stereo Correspondence Problem,” in Sixth International Conference on
Computer Vision (IEEE Cat. No.98CH36271), 1998, pp. 492–499.
[48] O. Veksler, “Efficient Graph-based Energy Minimization Methods in
Computer Vision,” PhD thesis, Cornell University, USA, 1999.
[49] J. Kim, V. Kolmogorov and R. Zabih, Visual Correspondence Using
Energy Minimization and Mutual Information, in Proceedings Ninth
IEEE International Conference on Computer Vision, Nice, France,
vol.2, 2003, pp. 1033–1040.
[50] T. Kanade and M. Okutomi, “A Stereo Matching Algorithm with an
Adaptive Window: Theory and Experiment,” in IEEE Transactions on
Pattern Analysis and Machine Intelligence, vol. 16, no. 9, 1994, pp.
920–932.
143
Authorized licensed use limited to: UNIVERSITAETSBIBLIOTHEK CHEMNITZ. Downloaded on January 30,2022 at 20:17:58 UTC from IEEE Xplore. Restrictions apply.