Podcast
Questions and Answers
Stereopsis allows for the perception of what?
Stereopsis allows for the perception of what?
- Color consistency across various lighting conditions.
- Two dimensional images with no depth perception.
- Object permanence despite occlusions.
- Depth and three-dimensional structure. (correct)
Which of the following is NOT typically an application of computer-implemented stereoscopic perception?
Which of the following is NOT typically an application of computer-implemented stereoscopic perception?
- Medical diagnostics based on thermal imaging. (correct)
- Aerial reconnaissance for mapping.
- Industrial inspection of 3D objects.
- Visual robot navigation.
In single-view geometry, what makes determining structure and depth challenging?
In single-view geometry, what makes determining structure and depth challenging?
- Excessive lighting variations.
- Inherent ambiguity. (correct)
- Insufficient image resolution.
- Lack of texture in the scene.
What is the primary benefit of using multiple views (cameras) to determine the location of points P1 and P2 in stereo vision?
What is the primary benefit of using multiple views (cameras) to determine the location of points P1 and P2 in stereo vision?
Which cue presents the illusion of depth?
Which cue presents the illusion of depth?
In the context of depth perception, what effect does harsher lighting with fast falloff have on an image?
In the context of depth perception, what effect does harsher lighting with fast falloff have on an image?
What are the two key processes involved in estimating depth from stereo vision?
What are the two key processes involved in estimating depth from stereo vision?
In a simple stereo system geometry, what does the baseline represent?
In a simple stereo system geometry, what does the baseline represent?
What assumption is made about the optical axes of the cameras?
What assumption is made about the optical axes of the cameras?
In a simple stereo system, if the disparity is equal to zero, what does this imply about the depth of the point?
In a simple stereo system, if the disparity is equal to zero, what does this imply about the depth of the point?
What is the epipolar plane defined as in epipolar geometry?
What is the epipolar plane defined as in epipolar geometry?
What does the epipolar constraint allow?
What does the epipolar constraint allow?
How does image rectification simplify stereo calculations?
How does image rectification simplify stereo calculations?
What key assumption is made for simplifying stereo vision with parallel camera planes?
What key assumption is made for simplifying stereo vision with parallel camera planes?
Besides the epipolar constraint, what other soft constraints are used in stereo correspondence to find the best matches?
Besides the epipolar constraint, what other soft constraints are used in stereo correspondence to find the best matches?
What is the 'uniqueness' constraint in stereo correspondence?
What is the 'uniqueness' constraint in stereo correspondence?
What does the 'ordering' constraint imply?
What does the 'ordering' constraint imply?
What is the purpose of comparing every pixel in one image of a stereo pair with every pixel on the same epipolar line in the other?
What is the purpose of comparing every pixel in one image of a stereo pair with every pixel on the same epipolar line in the other?
What is the 'sum of square differences' used for in dense correspondence search?
What is the 'sum of square differences' used for in dense correspondence search?
What may occur in stereo vision when some surface points are not visible in one of the images?
What may occur in stereo vision when some surface points are not visible in one of the images?
How does dynamic programming address the stereo correspondence problem?
How does dynamic programming address the stereo correspondence problem?
What is a scanline?
What is a scanline?
What is indicated by polygons that join two profiles indicate matches between successive intervals?
What is indicated by polygons that join two profiles indicate matches between successive intervals?
What challenge do you face when performing stereo corrospondence?
What challenge do you face when performing stereo corrospondence?
When searching for corresponding values, how does the pixel look?
When searching for corresponding values, how does the pixel look?
In regards to Matlab, if the dispairty is 0, what's the implication?
In regards to Matlab, if the dispairty is 0, what's the implication?
In Matlab what does the code rgb2gray()
do?
In Matlab what does the code rgb2gray()
do?
What tool is used to make an image 3-D
What tool is used to make an image 3-D
Name a topic that is dedicated to the second part of a course.
Name a topic that is dedicated to the second part of a course.
Why is submission required through Fenix
Why is submission required through Fenix
In stereo images, what might the ordering constraint reveal?
In stereo images, what might the ordering constraint reveal?
What process involves finding corresponding points?
What process involves finding corresponding points?
In order to have simple stereo images, what is assumed about the optical axes?
In order to have simple stereo images, what is assumed about the optical axes?
What does project 2 replicate?
What does project 2 replicate?
Flashcards
Stereopsis
Stereopsis
Perception of depth and 3D structure through combined visual input from two eyes.
Single View Ambiguity
Single View Ambiguity
Structure and depth perception is ambiguous from single point of view.
3D Inference
3D Inference
Inferring depth from different clues like perspective, focus, size, lighting and occlusion in a single image
Stereo Vision Processes
Stereo Vision Processes
Signup and view all the flashcards
Stereo System Parameters
Stereo System Parameters
Signup and view all the flashcards
Disparity
Disparity
Signup and view all the flashcards
Disparity Map
Disparity Map
Signup and view all the flashcards
Epipolar Plane
Epipolar Plane
Signup and view all the flashcards
Epipolar Line
Epipolar Line
Signup and view all the flashcards
Epipole
Epipole
Signup and view all the flashcards
Epipolar Constraint
Epipolar Constraint
Signup and view all the flashcards
Two View Geometry Constraint
Two View Geometry Constraint
Signup and view all the flashcards
Image Rectification
Image Rectification
Signup and view all the flashcards
Similarity Constraint
Similarity Constraint
Signup and view all the flashcards
Uniqueness Constraint
Uniqueness Constraint
Signup and view all the flashcards
Ordering Constraint
Ordering Constraint
Signup and view all the flashcards
Limited Disparity Gradient
Limited Disparity Gradient
Signup and view all the flashcards
Occlusion
Occlusion
Signup and view all the flashcards
Dense Correspondence Search
Dense Correspondence Search
Signup and view all the flashcards
Textureless Regions
Textureless Regions
Signup and view all the flashcards
Brightness Constancy Violations
Brightness Constancy Violations
Signup and view all the flashcards
Large Baseline Issues
Large Baseline Issues
Signup and view all the flashcards
Camera Calibration Errors
Camera Calibration Errors
Signup and view all the flashcards
Study Notes
- Stereoscopic vision uses the principles of computational vision.
- Stereo vision topics include sections 19.6-19.7 and chapter 7 Stereopsis.
- Stereopsis can be visually referenced via Wikipedia.
Stereopsis
- Stereopsis is the perception of depth and 3D structure via visual input from both eyes.
- Stereoscopic perception's computer implementation has applications in visual robot navigation, aerial reconnaissance, and Cartography.
- Stereoscopic perception implementation is also used in 3D sensing/reconstruction, and industrial inspection.
NASA Curiosity Rover
- NASA's Curiosity Rover uses stereo vision for Mars exploration.
- The rover has 23 cameras total.
- 9 cameras are for engineering purposes.
- 7 cameras are for science purposes.
- 7 cameras are for entry, descent and landing purposes.
Stereo Vision
- Structure and depth are inherently ambiguous from single views.
- The goal is to determine where points P1 and P2 actually are in a 3D.
- This is resolved in stereo vision.
- Depth is inferred from different clues in an image.
- Perspective can give a notion of depth with converging lines.
- Shallow focus/blur cues depth.
- Size and scale provide depth information; smaller objects are farther away.
- Lighting/shading can give a sense of depth; harsher shadows with fast falloff create this illusion.
- Objects are closer the more they occlude things that are further.
Basic Stereo Geometry
- To determine actual location, use the different views from cameras.
- Estimating depth from stereo vision includes image point correspondence, and information on camera pose (calibration).
Simple Stereo System Geometry
- Parallel optical axis and known calibrated camera parameters are assumed.
- Baseline is the line connecting optical centres (B), and focal length is f.
- Point P is at distance Z in the camera coordinate systems (virtual images).
- The assumption of similar triangles is used for calculations for the stereo vision geometry.
- The formula for the disparity calculation is Z is equal to focal length multiplied by baseline, and divided by XL - XR.
- Disparity is equal to the difference between XL - XR.
- 0 depth means infinite disparity.
Disparity Map
- A ground truth disparity map and computed disparity map are used.
Epipolar Geometry
- The baseline is a line joining the cameras centres.
- The epipolar plane contains the baseline and a world point P.
- Epipolar lines (pairs) are the intersection of the epipolar plane with the image plane, potentially outside the image.
- An epipole is a point of intersection of the baseline with the image plane.
- Geometry of two views constraints where the corresponding pixel for some image point in the first view must occur in the second view.
- Epipolar constraint: with a calibrated stereo rig, a point p is constrained to lie on the associated epipolar line l'.
Converging Cameras
- Epipolar constraint reduces the correspondence problem to a 1D search along epipolar line for converging cameras.
- Two images with superimposed corresponding points have epipolar lines in white which can be used to analyse.
Parallel Cameras
- For parallel image planes, the baseline's intersection with the image plane is at infinity.
- The epipoles are at infinity and epipolar lines are parallel.
- An epipolar geometry is used for motion parallel to the image plane.
- Superimposed corresponding points and their epipolar lines (in white) can be used to analyse a pair of parallel images.
Image Rectification
- Stereo algorithm calculations are simplified when images of interest are rectified – replaced by two equivalent pictures with a common image plane parallel to the baseline joining the two optical centres.
- The two image planes II and II' are reprojected onto a common plane parallel to the baseline.
- Epipolar lines l and l' associated with points p and p in the two pictures map onto a common scanline also parallel to/passing through reprojected points.
- Simplified parallel planes are needed, since they are assumed to be co-planar with the same focal lengths.
- The epipolar lines are horizontal and at the same y location in the image.
Stereo Correspondence
- The epipolar constraint is a hard constraint.
- Additional soft constraints include similarity where corresponding pixels should look alike.
- Also, there should be uniqueness (no more than one match).
- Ordering makes pixels appear in the same order in both images.
- Limited disparity gradient means that depth doesn't change rapidly.
- To find matches in an image pair, it's assumed that scene points are visible in views, and that image regions for the matches are similar in appearance.
Dense Correspondence Search
- Each pixel/window in the left image must be compared with every pixel/window on same epipolar line in right image.
- The pick position must have a minimum match cost which include sum of squared differences, and normalized correlation, for different intensity images.
Occlusion Problem
- On the left, the order of feature points along the two (oriented) epipolar lines is the same.
- On the right, surface points aren't visible in one image and the image points' order are not the same in the two pictures.
Dynamic Programming Formulation
- Instead of individual correspondences to estimate disparities, optimise correspondence assignments joinly.
- This can be done via scanline, and through a full 2D grid (graph cuts).
Stereo Correspondence Challenges
- What defines good stereo correspondence includes having match quality, and smoothness.
- Challenges with stereo correspondence includes low contrast and textureless image regions.
- Other challenges consist of occlusions, violations of brightness constancy, and really large baselines (foreshortening/appearance change).
- Final challenges include camera calibration errors.
Disparity map - MATLAB Example
- To load a rectified stereo pair image, use I1=imread('rectified_left.png') and I2=imread('rectified_right.png').
- The Anaglyph function creates the stereo anaglyph of the image and allows it to be used with stereo glasses, and can be called by A=stereoAnaglyph(I1,I2).
- Convert rectified input color images to grayscale images using: J1 = rgb2gray(I1) and J2 = rgb2gray(I2).
- Specify disparity range and the uniqueness threshold to compute with disparitySGM function.
- The disparityMap = disparitySGM(J1,J2, 'DisparityRange',disparityRange, 'UniquenessThreshold',20) command calculates a disparity map through semi-global matching.
- The display disparity map sets display range to same value as range: imshow(disparityMap,disparityRange).
Project 2
- Course project consist of 7 topics, and use MATLAB examples, replicate the example and apply the same strategy to a case.
- Project due on April 22nd.
- Project submission through Fenix.
Next Class
- Next class will discuss projective geometry.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.