01_CV2425_Intro to CV_LR.pdf

Full Transcript

Introduction to Computer Vision Dr. Eng. Laksmita Rahadianti, Muhammad Febrian Rachmadi, Ph.D., Dr. Dina Chahyati, Prof. Dr. Aniati M. Arymurthy CSCE604133 Computer Vision Fakultas Ilmu Komputer Universitas Indonesia ...

Introduction to Computer Vision Dr. Eng. Laksmita Rahadianti, Muhammad Febrian Rachmadi, Ph.D., Dr. Dina Chahyati, Prof. Dr. Aniati M. Arymurthy CSCE604133 Computer Vision Fakultas Ilmu Komputer Universitas Indonesia 2 Acknowledgements These slides are created with reference to: Computer Vision: Algorithms and Applications, 2nd ed., Richard Szeliski https://szeliski.org/Book/ Digital Image Processing, Gonzales and Woods, 3rd ed, 2008. Course slides for CSCE604133 Image Processing – Faculty of Computer Science, Universitas Indonesia Introduction to Computer Vision, Cornell Tech https://www.cs.cornell.edu/courses/cs5670/2024sp/lectures/lectures.html Computer Vision, University of Washington https://courses.cs.washington.edu/courses/cse576/08sp/ Computer Vision 2024 - Intro to CV 3 Human vision can understand this image effortlessly Computer Vision 2024 - Intro to CV Image courtesy of Tomas Anton Escobar, Unsplash. 4 Recognition Computer Vision 2024 - Intro to CV Image courtesy of Tomas Anton Escobar, Unsplash. 5 Segmentation Computer Vision 2024 - Intro to CV Image courtesy of Tomas Anton Escobar, Unsplash. 6 Reconstruction Computer Vision 2024 - Intro to CV Image courtesy of Tomas Anton Escobar, Unsplash. 7 How does human vision do it? How would a computer do it? Computer Vision 2024 - Intro to CV 8 What is Computer Vision? mathematical techniques for recovering the three-dimensional shape and appearance of objects in imagery. Computer Vision 2024 - Intro to CV 9 A Brief History of Computer Vision Szeliski, 2010. Computer Vision 2024 - Intro to CV 10 A Brief History of Computer Vision (2) 1980s 1990s Shape from Shading (Freeman and Adelson, 1991; Image from Yu and Chen, 2018) Dense Stereo Matching (Boykov, Veksler, and Zabih 2001); Image from Li - github, 2024) 3D Modelling Edge Detection (Schaffalitzky, F. and Zisserman, A. (2002).) (Freeman and Adelson, 1991; Image from Yu and Chen, 2018) Computer Vision 2024 - Intro to CV 11 Computer Vision Applications Now Image Registration Computer Vision 2024 - Intro to CV 12 Computer Vision Applications Now (2) Augmented and Virtual Reality Computer Vision 2024 - Intro to CV 13 Computer Vision Applications Now (3) ETLE (Electronic Traffic Law Enforcement) NTMC KORLANTAS POLRI Computer Vision 2024 - Intro to CV 14 Computer Vision Applications Now (4) Autonomous vehicles https://waymo.com/waymo-driver/ Computer Vision 2024 - Intro to CV 15 Computer Vision Applications Now (5) Medical Applications Computer Aided Diagnosis (Nurhayati, et al, 2021) Computer Vision 2024 - Intro to CV 3D visualization (Noshadi, et al, 2017) 16 The 3 R’s of Computer Vision Malik, Jitendra, et al. "The three R’s of computer vision: Recognition, reconstruction and reorganization." Pattern Recognition Letters 72 (2016): 4-14. Or Registration Computer Vision 2024 - Intro to CV 17 The 3 R’s of Computer Vision: Recognition Malik, Jitendra, et al. "The three R’s of computer vision: Recognition, reconstruction and reorganization." Pattern Recognition Letters 72 (2016): 4-14. Recognition is about attaching semantic category labels to objects and scenes as well as to events and activities. https://medium.com/analytics-vidhya/yolo-explained-5b6f4564f31 Zhao, Hengshuang, et al. "Icnet for real-time semantic segmentation on high-resolution images." Proceedings of the European conference on computer vision (ECCV). 2018. Computer Vision 2024 - Intro to CV 18 The 3 R’s of Computer Vision: Reconstruction Malik, Jitendra, et al. "The three R’s of computer vision: Recognition, reconstruction and reorganization." Pattern Recognition Letters 72 (2016): 4-14. Reconstruction is traditionally about recovering information from images or reconstructing images to a different form Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang; Proceedings of the IEEE/CVF Conference on Fu, Jiacheng, et al. "Fast 3D reconstruction via event-based structured Computer Vision and Pattern Recognition (CVPR), 2022. light with spatio-temporal coding." Optics Express 31.26 (2023). Computer Vision 2024 - Intro to CV 19 The 3 R’s of Computer Vision: Reorganization Malik, Jitendra, et al. "The three R’s of computer vision: Recognition, reconstruction and reorganization." Pattern Recognition Letters 72 (2016): 4-14. Reorganization – or Registration is usually called “perceptual organization” in human vision. Zhu, Bai, et al. "Robust registration of aerial images and LiDAR data using spatial constraints and Gabor structural features." ISPRS Journal of Photogrammetry and Remote Sensing 181 (2021): 129-147. Computer Vision 2024 - Intro to CV 20 3D Scene vs 2D Image Forward Process Inverse Process Models the physical process from 3D Taking the 2D image, and extracting scenes (movement, light) that is the information, 3D scene, and other projected onto a 2D image. properties. Studied in fields of physics/optics and This is usually what is called computer computer graphics. vision Computer Vision 2024 - Intro to CV Image courtesy of Alina Fiene, Unsplash. 21 How does Human Vision do it? Szeliski, 2010. Photometry Images (2D) Geometry (3D) + (appearance) Human vision can understand a lot from so little… Torallba, 2008. Karungaru, 2007. Computer Vision 2024 - Intro to CV 22 Why is 2D to 3D difficult for computers? Szeliski, 2010. Photometry Images (2D) Geometry (3D) + (appearance) Human vision does this effortlessly, how can computer vision do the same? Human vision is teamwork between eye and brain. Computer vision have cameras as eyes – but lack the brain. Computers can be better at “easy” things – humans are better at “hard” things The process from 2D to 3D is severely underdetermined, with so many unknowns Computer Vision 2024 - Intro to CV Image courtesy of Alina Fiene, Unsplash. 23 Computer Vision Topics Szeliski, 2010. Photometry Images (2D) Geometry (3D) + (appearance) Image Formation Vision Geometry and photometry Human Vision Image acquisition Computer Vision Image anatomy Image Processing Image enhancement Image transform Computer Vision 2024 - Intro to CV 24 Machine Learning for Computer Vision Photometry Images (2D) Geometry (3D) + (appearance) In recent years, computational capacity has enabled computer vision research to develop solutions using machine learning models Long, et al, 2015. Computer Vision 2024 - Intro to CV https://www.aiacceleratorinstitute.com/10-top-drivers-and-challenges-in-computer-vision-in-2023/ 25 Drivers and Challenges in Computer Vision Drivers Challenges Growing demand for computer High costs vision solutions Lack of experienced professionals Increasing need for automation Size of required data sets Increasing demand for vision- Need for regular monitoring guided robotic systems Growing adoption of Industry 4.0 Growing demand for AI in computer vision Increased manufacturing of autonomous vehicles Computer Vision 2024 - Intro to CV Computer Vision in Industry Processing power for real-time data-intensive applications running computer vision in the cloud is heavily limiting real-time applications computer vision solutions will need to be deployed on edge endpoints for most use cases Hardware Limitations require hardware to run, cameras to provide the visual input, and computing hardware for AI inference. Scaling Problems what works here may not work in bigger scales too many variations of scenarios that can not be considered Computer Vision 2024 - Intro to CV 26 Computer Vision in Real Life Highest accuracy is not always the goal Business driven objectives Data is not always available Data design and data preparation is limited Data annotationss aree a nightmareeeeeeee Computer Vision 2024 - Intro to CV 27 28 Case Study: Count the motorcycles on Jalan Margonda What type of camera should I use? What about lighting and other mitigation factors? After obtaining the image, is it good enough? What do I need to find? How do I find them? Which ones are motorcycles? Computer Vision 2024 - Intro to CV 29 Case Study: Count the motorcycles on Jalan Margonda Image Acquisition Image Enhancement Image Segmentation Feature Detection Recognition Computer Vision 2024 - Intro to CV 30 Case Study Sistem Pengawasan Ujian Online (Contohnya pada SIMAK Online) Computer Vision 2024 - Intro to CV

Use Quizgecko on...
Browser
Browser