Podcast
Questions and Answers
Which type of light source is commonly used as a substitute for realistically calculating inter-reflections in a scene?
Which type of light source is commonly used as a substitute for realistically calculating inter-reflections in a scene?
- Distant point source
- Ambient light (correct)
- Area source
- Global illumination
A pixel's brightness uniquely identifies the material properties of the surface it represents.
A pixel's brightness uniquely identifies the material properties of the surface it represents.
False (B)
Besides light source characteristics, surface orientation, and material properties, list one additional factor that affects a pixel's brightness.
Besides light source characteristics, surface orientation, and material properties, list one additional factor that affects a pixel's brightness.
Reflected light and shadows from surrounding surfaces, Sensor gain
The fraction of light that a surface reflects is known as its ______.
The fraction of light that a surface reflects is known as its ______.
Match the following concepts with their descriptions:
Match the following concepts with their descriptions:
Which of the following is the main goal of color constancy?
Which of the following is the main goal of color constancy?
Computers generally perform color constancy tasks as well as humans.
Computers generally perform color constancy tasks as well as humans.
Describe the 'White World Assumption' approach to color correction.
Describe the 'White World Assumption' approach to color correction.
The 'Gray World Assumption' corrects colors by presuming that the average color in an image should be ______.
The 'Gray World Assumption' corrects colors by presuming that the average color in an image should be ______.
What is the purpose of color correction techniques in image processing?
What is the purpose of color correction techniques in image processing?
A distant point source, such as the sun, provides multiple illumination directions.
A distant point source, such as the sun, provides multiple illumination directions.
Name one example of an 'area source' of light.
Name one example of an 'area source' of light.
A global illumination model accounts for ______ in a modeled scene.
A global illumination model accounts for ______ in a modeled scene.
Match the color correction technique to its normalizing process:
Match the color correction technique to its normalizing process:
Which of the following is primarily responsible for the sensation of depth and 3D structure from a 2D image?
Which of the following is primarily responsible for the sensation of depth and 3D structure from a 2D image?
Which of the following is true regarding how image filtering forms a new image?
Which of the following is true regarding how image filtering forms a new image?
Image transformations focus on changing the overall appearance or layout of an image by directly adjusting individual pixel values.
Image transformations focus on changing the overall appearance or layout of an image by directly adjusting individual pixel values.
Which of the following describes the role of filters in information extraction?
Which of the following describes the role of filters in information extraction?
What is the primary function of filters in image enhancement?
What is the primary function of filters in image enhancement?
In linear filtering, the weights by which the values of pixels in the neighborhood are multiplied are known as filter ________.
In linear filtering, the weights by which the values of pixels in the neighborhood are multiplied are known as filter ________.
Linear filtering involves adding up the multiplied values of the neighborhood pixels to calculate the final value of the output image.
Linear filtering involves adding up the multiplied values of the neighborhood pixels to calculate the final value of the output image.
What is the alternative term used when the filter in linear filtering is rotated?
What is the alternative term used when the filter in linear filtering is rotated?
Which process is being optimized when separable filtering is applied?
Which process is being optimized when separable filtering is applied?
What is the primary advantage of using separable filtering in image processing?
What is the primary advantage of using separable filtering in image processing?
Match the following filters with their descriptions:
Match the following filters with their descriptions:
What type of filter is best suited to smooth an image while preserving edges?
What type of filter is best suited to smooth an image while preserving edges?
Which parameter controls the spread of the kernel in a Gaussian filter?
Which parameter controls the spread of the kernel in a Gaussian filter?
A larger kernel size in a Gaussian filter leads to less pronounced blurring.
A larger kernel size in a Gaussian filter leads to less pronounced blurring.
To achieve a sharpening effect, what process is typically applied following a blurring operation?
To achieve a sharpening effect, what process is typically applied following a blurring operation?
Which action do band-pass filters perform on the frequency spectrum of an image?
Which action do band-pass filters perform on the frequency spectrum of an image?
What do high frequencies in images primarily correspond to?
What do high frequencies in images primarily correspond to?
What aspect of pixel intensity does the first derivative measure?
What aspect of pixel intensity does the first derivative measure?
Where are points where intensity changes more suddenly typically detected?
Where are points where intensity changes more suddenly typically detected?
How is the Laplacian of Gaussian (LoG) filter created?
How is the Laplacian of Gaussian (LoG) filter created?
Filters that can be oriented in any direction, allowing for adjustable responses are known as _______.
Filters that can be oriented in any direction, allowing for adjustable responses are known as _______.
What is the other common term for a 'summed area table'?
What is the other common term for a 'summed area table'?
What is the main purpose of using integral images in computer vision?
What is the main purpose of using integral images in computer vision?
What type of values does a binary image contain?
What type of values does a binary image contain?
What morphological operation expands or thickens objects in a binary image?
What morphological operation expands or thickens objects in a binary image?
Which morphological operation reduces or thins objects in a binary image?
Which morphological operation reduces or thins objects in a binary image?
Opening is a morphological operation that smooths object boundaries and removes small objects by applying dilation followed by erosion.
Opening is a morphological operation that smooths object boundaries and removes small objects by applying dilation followed by erosion.
Closing fills small holes and gaps in objects by applying erosion followed by dilation.
Closing fills small holes and gaps in objects by applying erosion followed by dilation.
What tool calculates the shortest distance from any point in a binary image to the nearest boundary?
What tool calculates the shortest distance from any point in a binary image to the nearest boundary?
Which application mainly applies 'city block' or 'Euclidean distance'?
Which application mainly applies 'city block' or 'Euclidean distance'?
Which concept involves identifying regions in an image where adjacent pixels share the same value or label?
Which concept involves identifying regions in an image where adjacent pixels share the same value or label?
What term describes pixels that are immediately horizontally or vertically next to each other?
What term describes pixels that are immediately horizontally or vertically next to each other?
What does padding refer to in the context of image filtering?
What does padding refer to in the context of image filtering?
What is the purpose of zero padding in image processing?
What is the purpose of zero padding in image processing?
In image processing, what does 'clamp to edge' (or replicate) padding do?
In image processing, what does 'clamp to edge' (or replicate) padding do?
When using mirror padding, how are pixels outside the original image boundary determined?
When using mirror padding, how are pixels outside the original image boundary determined?
In 'extend' padding, the signal is extended by subtracting the mirrored version of the _______ value.
In 'extend' padding, the signal is extended by subtracting the mirrored version of the _______ value.
Non-linear filters use weighted sums of input pixels to generate output images.
Non-linear filters use weighted sums of input pixels to generate output images.
What is a key characteristic of non-linear filters compared to linear filters?
What is a key characteristic of non-linear filters compared to linear filters?
What type of value does median filtering select from a pixel's neighborhood?
What type of value does median filtering select from a pixel's neighborhood?
Median filtering is highly effective at smoothing away Gaussian noise due to its dependency on multiple pixel values.
Median filtering is highly effective at smoothing away Gaussian noise due to its dependency on multiple pixel values.
What is the unique characteristic of bilateral filtering?
What is the unique characteristic of bilateral filtering?
What determines whether bilateral filtering preserves edges better than traditional smoothing?
What determines whether bilateral filtering preserves edges better than traditional smoothing?
What is a primary disadvantage of Bilateral Filtering compared to simpler methods like linear filters?
What is a primary disadvantage of Bilateral Filtering compared to simpler methods like linear filters?
What kind of image is used within a Guided Image Filter to direct the target image filtering?
What kind of image is used within a Guided Image Filter to direct the target image filtering?
In Guided Image Filtering, the filtering effectiveness relies largely on what factor?
In Guided Image Filtering, the filtering effectiveness relies largely on what factor?
Which of the following is NOT a common image transformation technique?
Which of the following is NOT a common image transformation technique?
Linear filtering involves convolving an image with a rotated filter kernel, also known as correlation.
Linear filtering involves convolving an image with a rotated filter kernel, also known as correlation.
What is the primary purpose of image filtering?
What is the primary purpose of image filtering?
The process of decomposing a two-dimensional filter kernel into two one-dimensional filters to optimize convolution operations is known as ______.
The process of decomposing a two-dimensional filter kernel into two one-dimensional filters to optimize convolution operations is known as ______.
Match the image filtering technique with its primary characteristic.
Match the image filtering technique with its primary characteristic.
Which of the following is the primary reason for using image filtering?
Which of the following is the primary reason for using image filtering?
Image transformations modify individual pixel values, while image filtering changes the overall appearance or layout.
Image transformations modify individual pixel values, while image filtering changes the overall appearance or layout.
What is the role of 'coefficients' in the context of linear filtering?
What is the role of 'coefficients' in the context of linear filtering?
In linear filtering, the operation where the filter is rotated before being applied to the image is called ______.
In linear filtering, the operation where the filter is rotated before being applied to the image is called ______.
Match the derivatives with their functionality in image processing:
Match the derivatives with their functionality in image processing:
What is the purpose of a 'linear filter' in image processing?
What is the purpose of a 'linear filter' in image processing?
What is the primary use of 'band-pass filters' in image processing?
What is the primary use of 'band-pass filters' in image processing?
A filter that averages pixel values within a KxK window is known as a ______ filter.
A filter that averages pixel values within a KxK window is known as a ______ filter.
Match each padding technique with its description:
Match each padding technique with its description:
Which filter is most effective at smoothing an image while better preserving edges compared to a box filter?
Which filter is most effective at smoothing an image while better preserving edges compared to a box filter?
The Laplacian of Gaussian (LoG) operator involves blurring an image with a Gaussian filter followed by taking the first derivative.
The Laplacian of Gaussian (LoG) operator involves blurring an image with a Gaussian filter followed by taking the first derivative.
What is the main advantage of using separable filters?
What is the main advantage of using separable filters?
A ______ is a table that contains the sum of all pixel values to the left and top of each pixel in an image, inclusive.
A ______ is a table that contains the sum of all pixel values to the left and top of each pixel in an image, inclusive.
Match the morphological operation with its description:
Match the morphological operation with its description:
Which of the following is the primary function of the Fourier Transform?
Which of the following is the primary function of the Fourier Transform?
What type of filter is designed to pass low frequencies while attenuating high frequencies?
What type of filter is designed to pass low frequencies while attenuating high frequencies?
High-pass filters are typically used for smoothing signals by removing high-frequency noise.
High-pass filters are typically used for smoothing signals by removing high-frequency noise.
What does the frequency response of a filter indicate?
What does the frequency response of a filter indicate?
A filter is analyzed by using signal of a known frequency.
A filter is analyzed by using signal of a known frequency.
If a filter causes a large change in the magnitude of a sinusoid passed through it, what does this indicate?
If a filter causes a large change in the magnitude of a sinusoid passed through it, what does this indicate?
The Discrete Fourier Transform (DFT) is applied to continuous signals, while the Fourier Transform (FT) is used for discrete sampled signals.
The Discrete Fourier Transform (DFT) is applied to continuous signals, while the Fourier Transform (FT) is used for discrete sampled signals.
Which of the following image processing tasks can be achieved using the Fourier Transform?
Which of the following image processing tasks can be achieved using the Fourier Transform?
Which metric reflects human perception when you want to compare a denoised image to the original image?
Which metric reflects human perception when you want to compare a denoised image to the original image?
What is the purpose of 'upsampling' an image?
What is the purpose of 'upsampling' an image?
Which interpolation method is known for potentially introducing a 'ringing' effect in images during upsampling?
Which interpolation method is known for potentially introducing a 'ringing' effect in images during upsampling?
Downsampling an image involves convolving the image with a high-pass filter to prevent aliasing.
Downsampling an image involves convolving the image with a high-pass filter to prevent aliasing.
A commonly used filter in image downsampling that suppresses aliasing well but can introduce ringing is:
A commonly used filter in image downsampling that suppresses aliasing well but can introduce ringing is:
Which statement describes multi-resolution analysis?
Which statement describes multi-resolution analysis?
Which of the following is a primary application of multi-resolution representations in image processing?
Which of the following is a primary application of multi-resolution representations in image processing?
What is the typical structure of an image pyramid?
What is the typical structure of an image pyramid?
Which of the following is a crucial step to prevent aliasing artifacts when downsampling in image pyramids?
Which of the following is a crucial step to prevent aliasing artifacts when downsampling in image pyramids?
Which filter should be used to repeatedly blur an image to build a Gaussian pyramid?
Which filter should be used to repeatedly blur an image to build a Gaussian pyramid?
To get Laplacian image, [blank] a lower-resolution Gaussian level from the higher-resolution level.
To get Laplacian image, [blank] a lower-resolution Gaussian level from the higher-resolution level.
The cubic $a = -1$ filter provides a sharper fall-off but can exhibit some [blank].
The cubic $a = -1$ filter provides a sharper fall-off but can exhibit some [blank].
In the context of image pyramids, selecting the 'best' filter for downsampling depends on:
In the context of image pyramids, selecting the 'best' filter for downsampling depends on:
What is the purpose of noise removal (denoising) in image processing?
What is the purpose of noise removal (denoising) in image processing?
Applying a high-pass filter on an image will amplify edges and details.
Applying a high-pass filter on an image will amplify edges and details.
Match each filter type with its primary function:
Match each filter type with its primary function:
Why is image resizing conducted?
Why is image resizing conducted?
When do you need to use "Finding objects at different scales"?
When do you need to use "Finding objects at different scales"?
What is the interpolation in image processing?
What is the interpolation in image processing?
What is the decimation process in image processing?
What is the decimation process in image processing?
In the image decimation process, what is true for convolution computing?
In the image decimation process, what is true for convolution computing?
For the common filter of decimation, which is better between linear and binomial?
For the common filter of decimation, which is better between linear and binomial?
What is the advantages by using Multi-Resolution analysis?
What is the advantages by using Multi-Resolution analysis?
Which case of application by using Multiresolution representations?
Which case of application by using Multiresolution representations?
Which definition is correct for "the structure of Image Pyramids"?
Which definition is correct for "the structure of Image Pyramids"?
In "Progressive Resolution Reduction", what does it means?
In "Progressive Resolution Reduction", what does it means?
Why do we need to use filting in pyramid image?
Why do we need to use filting in pyramid image?
How Gaussian Pyramids is created?
How Gaussian Pyramids is created?
What do you store in "Laplacian Pyramids"?
What do you store in "Laplacian Pyramids"?
Laplacian Images can fully reconstruction the original image
Laplacian Images can fully reconstruction the original image
In cubic with $a = -1$ filter, the wavelets analisis filters have [blank]
In cubic with $a = -1$ filter, the wavelets analisis filters have [blank]
Why we need to do Coarse-to-Fine Search?
Why we need to do Coarse-to-Fine Search?
The process of reducing the resolution of an image is called ______.
The process of reducing the resolution of an image is called ______.
A sinusoidal signal's ______ is the maximum distance between the horizontal axis and the vertical position of any signal
A sinusoidal signal's ______ is the maximum distance between the horizontal axis and the vertical position of any signal
Which of the folowing statment is about an application of gaussian filter is true?
Which of the folowing statment is about an application of gaussian filter is true?
Which of the following is the primary purpose of using a Fourier Transform in image processing?
Which of the following is the primary purpose of using a Fourier Transform in image processing?
High-pass filters in image processing are designed to attenuate low frequencies and are commonly used for smoothing images by reducing noise.
High-pass filters in image processing are designed to attenuate low frequencies and are commonly used for smoothing images by reducing noise.
What characteristic of a filter is revealed by its 'frequency response'?
What characteristic of a filter is revealed by its 'frequency response'?
The new magnitude A is called the ______ or magnitude of the filer, while the phase difference $Δ = Φ - φ$ is called the shift or ______.
The new magnitude A is called the ______ or magnitude of the filer, while the phase difference $Δ = Φ - φ$ is called the shift or ______.
Match the image processing task with the appropriate filter type.
Match the image processing task with the appropriate filter type.
Flashcards
Inter-reflection
Inter-reflection
Light that comes from surrounding surfaces due to reflections.
Global Illumination Model
Global Illumination Model
Models that account for inter-reflections within a modeled scene to achieve more realistic lighting.
Factors Affecting Pixel Brightness
Factors Affecting Pixel Brightness
Brightness is determined by light source properties, surface orientation, surface material, inter-reflections, and sensor gain.
Color Constancy Goal
Color Constancy Goal
Signup and view all the flashcards
Color Correction
Color Correction
Signup and view all the flashcards
White World Assumption
White World Assumption
Signup and view all the flashcards
Gray World Assumption
Gray World Assumption
Signup and view all the flashcards
White Balancing Technique
White Balancing Technique
Signup and view all the flashcards
What is an image?
What is an image?
Signup and view all the flashcards
Image Filtering
Image Filtering
Signup and view all the flashcards
Image Transformation
Image Transformation
Signup and view all the flashcards
Why Use Filtering?
Why Use Filtering?
Signup and view all the flashcards
Linear Filter
Linear Filter
Signup and view all the flashcards
Correlation (Image)
Correlation (Image)
Signup and view all the flashcards
Convolution
Convolution
Signup and view all the flashcards
Separable Filtering
Separable Filtering
Signup and view all the flashcards
Box Filter (Moving Average)
Box Filter (Moving Average)
Signup and view all the flashcards
Bilinear (Tent) Filter
Bilinear (Tent) Filter
Signup and view all the flashcards
Gaussian Kernel
Gaussian Kernel
Signup and view all the flashcards
Band-pass Filters
Band-pass Filters
Signup and view all the flashcards
First Derivative (Image)
First Derivative (Image)
Signup and view all the flashcards
Sobel Filter
Sobel Filter
Signup and view all the flashcards
Second Derivative (Image)
Second Derivative (Image)
Signup and view all the flashcards
Laplacian Filter
Laplacian Filter
Signup and view all the flashcards
Laplacian of Gaussian (LoG)
Laplacian of Gaussian (LoG)
Signup and view all the flashcards
Steerable Filters
Steerable Filters
Signup and view all the flashcards
Summed Area Table
Summed Area Table
Signup and view all the flashcards
Integral Image
Integral Image
Signup and view all the flashcards
Image Padding
Image Padding
Signup and view all the flashcards
Zero Padding
Zero Padding
Signup and view all the flashcards
Constant Padding
Constant Padding
Signup and view all the flashcards
Clamp Padding
Clamp Padding
Signup and view all the flashcards
Mirror Padding
Mirror Padding
Signup and view all the flashcards
Cyclic Padding
Cyclic Padding
Signup and view all the flashcards
Non-linear Filters
Non-linear Filters
Signup and view all the flashcards
Linear Filters
Linear Filters
Signup and view all the flashcards
Median Filtering
Median Filtering
Signup and view all the flashcards
Bilateral Filtering
Bilateral Filtering
Signup and view all the flashcards
Guided Image Filter
Guided Image Filter
Signup and view all the flashcards
Binary Image
Binary Image
Signup and view all the flashcards
Morphological operations
Morphological operations
Signup and view all the flashcards
Dilation (Morphological)
Dilation (Morphological)
Signup and view all the flashcards
Erosion (Morphological)
Erosion (Morphological)
Signup and view all the flashcards
Majority Rule (Morphological)
Majority Rule (Morphological)
Signup and view all the flashcards
Opening (Morphological)
Opening (Morphological)
Signup and view all the flashcards
Closing (Morphological)
Closing (Morphological)
Signup and view all the flashcards
Distance Transform
Distance Transform
Signup and view all the flashcards
Connected Components
Connected Components
Signup and view all the flashcards
Fourier Transform Definition
Fourier Transform Definition
Signup and view all the flashcards
Frequency-Based Filters
Frequency-Based Filters
Signup and view all the flashcards
Low-Pass Filter
Low-Pass Filter
Signup and view all the flashcards
High-Pass Filter
High-Pass Filter
Signup and view all the flashcards
Frequency Response
Frequency Response
Signup and view all the flashcards
Sinusoidal Signal
Sinusoidal Signal
Signup and view all the flashcards
Time Period
Time Period
Signup and view all the flashcards
Amplitude (A)
Amplitude (A)
Signup and view all the flashcards
Frequency (f)
Frequency (f)
Signup and view all the flashcards
Phase (Φ)
Phase (Φ)
Signup and view all the flashcards
Convolution with a filter
Convolution with a filter
Signup and view all the flashcards
Gain (Filter)
Gain (Filter)
Signup and view all the flashcards
Phase Shift (Filter)
Phase Shift (Filter)
Signup and view all the flashcards
Magnitude
Magnitude
Signup and view all the flashcards
Discrete Fourier Transform (DFT)
Discrete Fourier Transform (DFT)
Signup and view all the flashcards
Fast Fourier Transform (FFT)
Fast Fourier Transform (FFT)
Signup and view all the flashcards
Two-Dimensional Fourier Transforms
Two-Dimensional Fourier Transforms
Signup and view all the flashcards
Image Frequency Content
Image Frequency Content
Signup and view all the flashcards
Sharpening (Image)
Sharpening (Image)
Signup and view all the flashcards
Blur Removal
Blur Removal
Signup and view all the flashcards
Noise Removal (Denoising)
Noise Removal (Denoising)
Signup and view all the flashcards
FLIP
FLIP
Signup and view all the flashcards
Reason for Resizing
Reason for Resizing
Signup and view all the flashcards
Image Resizing Techniques
Image Resizing Techniques
Signup and view all the flashcards
Upsampling
Upsampling
Signup and view all the flashcards
Downsampling
Downsampling
Signup and view all the flashcards
Multi-Resolution Pyramid
Multi-Resolution Pyramid
Signup and view all the flashcards
Linear (Bilinear)
Linear (Bilinear)
Signup and view all the flashcards
Bicubic
Bicubic
Signup and view all the flashcards
Windowed Sinc
Windowed Sinc
Signup and view all the flashcards
Decimation Process
Decimation Process
Signup and view all the flashcards
Common Filters for decimation
Common Filters for decimation
Signup and view all the flashcards
Multi-Resolution Analysis
Multi-Resolution Analysis
Signup and view all the flashcards
Varying Scales
Varying Scales
Signup and view all the flashcards
Image Compression
Image Compression
Signup and view all the flashcards
Feature Detection
Feature Detection
Signup and view all the flashcards
Image Pyramids Structure
Image Pyramids Structure
Signup and view all the flashcards
Downsampling Pyramid
Downsampling Pyramid
Signup and view all the flashcards
Laplacian Pyramids
Laplacian Pyramids
Signup and view all the flashcards
Gaussian Pyramid construction
Gaussian Pyramid construction
Signup and view all the flashcards
Study Notes
- Local illumination models consider light, surface, and camera properties.
- Inter-reflection from surrounding surfaces significantly contributes to light.
Models of Light Sources
- Distant point sources, like the sun, provide illumination from essentially one direction.
- Area sources, such as white walls, diffuser lamps, and the sky, are a type of light source.
- Ambient light serves as a substitute for explicitly modeling inter-reflections.
- Global illumination models account for inter-reflections within a modeled scene.
Pixel Brightness Factors
- A pixel’s brightness hinges on:
- Light source characteristics (strength, direction, color).
- Surface orientation relative to the light source and camera.
- Surface material and albedo (surface reflectance).
- Reflected light and shadows from surrounding surfaces.
- Sensor gain.
- A single pixel's brightness alone reveals limited information.
Intensity Differences and Shape
- Intensity variations offer insights into shape through:
- Changes in surface normal.
- Texture.
- Proximity effects.
- Indents and bumps on the surface.
- Grooves and creases.
Color Constancy
- Color constancy involves interpreting a surface based on its albedo or "true color," rather than the raw observed intensity.
- Humans excel at color constancy, but computers struggle to achieve comparable performance.
Color Correction
- Adjusts colors by multiplying the R, G, and B values by separate constants.
- There are several methods to determine the constants:
- White World Assumption: Assumes the brightest pixel is white and normalizes all channels by the largest value.
- Gray World Assumption: Assumes the average color should be gray and adjusts each channel (e.g., multiply R by avg(R)/avg((R+G+B)/3)).
- White Balancing: Selects a reference point as the "white" or "gray" color for image correction,
Image Filtering
- Forms a new image by modifying pixel values locally using predefined rules (e.g., blurring or edge detection).
- Filtering operations typically consider neighboring pixels to compute the new value for each pixel in the output image.
Image Transformation
- Forms a new image by modifying global spatial properties of an image (e.g., rotation, scaling, or contrast adjustments).
- Transformation operations change the overall appearance or layout of the image without necessarily affecting individual pixel values directly.
Filtering
- Filters are used to extract useful information from images, such as identifying edges or contours to understand the shape of objects.
- Filters help improve image quality by removing noise or sharpening details.
Linear Filtering
- A common type of neighborhood operator.
- Works by taking a small neighborhood of pixels around each pixel, multiplying their values by filter coefficients, and adding them up to become the new pixel value.
- This operation is called correlation.
- Alternatively, rotating the filter results in convolution and impulse response function.
Separable Filtering
- A technique to optimize convolution by breaking down the kernel into two one-dimensional convolutions.
- Convolving with a 2D filter kernel requires significant computation.
- Separable filtering reduces computational load, requiring fewer operations per pixel and resulting in faster processing.
Examples of Linear Filtering
- Box Filter (Moving Average)
- Bilinear (Tent) Filter
- Gaussian Kernel
- Sobel Operator
- Corner Operator
Box Filter (Moving Average)
- Averages pixel values within a KxK window.
Bilinear (Tent) Filter
- Weights are not uniform like the box filter.
- The center pixel has a higher weighting, decreasing linearly towards the edges.
- Used to smooth an image and preserving edges and is commonly used in image resampling and scaling.
Gaussian Kernel
- Commonly used for blurring and smoothing.
- Based on the Gaussian function.
- Kernel size determines the extent of blurring, with larger kernels producing more smoothing.
- Standard deviation controls the spread of the kernel; a larger standard deviation leads to more pronounced blurring.
Smoothing and Sharpening
- Sharpening is achieved by subtracting the blurred version from the sharpened one, highlighting edges and fine details.
Band-pass Filters
- Removes both low- and high-frequency extremes.
- Typically created by smoothing with a Gaussian filter and applying first or second derivatives.
- Low frequencies in an image correspond to large uniform areas and smooth transitions.
- High frequencies correspond to rapid changes in intensity and fine details like edges, textures, or noise.
Derivatives
- Used to measure changes in intensity.
- First Derivative: Measures the rate of change of pixel intensity, highlighting edges and areas where intensity rapidly changes with a Sobel filter.
- Second Derivative: Measures the rate of change of the first derivative, captures finer details like corners and noise, and detects points where intensity changes suddenly with a Laplacian filter.
- Differences:
- First Derivative detects edges.
- Second Derivative detects finer details.
Laplacian of Gaussian
- Blurring an image with a Gaussian filter followed by the Laplacian operator (second derivative).
- This process is equivalent to direct convolution with a Laplacian of Gaussian filter.
- It highlights regions of rapid intensity change.
Steerable Filters
- Filters can be oriented in any direction, allowing for adjustable responses based on the image content.
- Useful for enhancing features like edges in specific orientations.
- Can be used to construct both feature descriptors and edge detectors.
Summed Area Table
- Also known as an integral image.
- Used for accelerating image processing, particularly for images with repeated convolutions with box filters.
Integral Image
- A table holding the sum of all pixel values to the left and top of a given pixel, inclusive.
- Created by iterating through the original image, where each pixel at location (i, j) contains the sum of all pixels above and to the left of (i, j), including the pixel itself.
Padding (Border Effects)
- Handling edges in image convolution can be challenging due to insufficient surrounding pixels.
- Techniques to Overcome this Issue:
- Zero: set all pixels outside the source image to 0.
- Constant (border color): set all pixels outside the source image to a specified border value.
- clamp: repeat edge pixels indefinitely.
- cyclic wrap: loop "around" the image.
- mirror: reflect pixels across the image edge.
- extend: extend the signal by subtracting the mirrored version of the signal from the edge pixel value.
Non-Linear Filters vs Linear Filtering
- Non-Linear Filters
- Adjust, select, or combine pixel values based on complex relationships, resulting in non-linear output images.
- More complex to implement and optimize.
- Behavior isn't easily analyzed via frequency response methods.
- Preserve sharp edges while effectively reducing or removing noise.
- Linear Filters
- Use weighted sums of input pixels to generate output images. Simple to use and analyze with frequency responses.
- Ineffective against certain types of noise; they tend to blur rather than remove noise.
Median Filtering
- Selects the median value from a pixel's neighborhood to filter out extreme values such as spike noises
- Is a robust alternative to averaging filters
- However, this technique is less effective at smoothing away Gaussian noise due to its dependency on a single pixel value.
- To overcome this imitation techniques such as the α- trimmed mean or weighted median can be used.
Bilateral Filtering
- Combines a Gaussian domain filter and a range filter (intensity similarities to center pixel value) to selectively smooth images while preserving edges
- Edge preservation denoising filter
- Preserves edges, unlike traditional filtering for effective smoothing of noise while maintaining the sharpness, particularly useful in color images.
- Cons: Slower than simpler linear, computational resources are required
- Enhancements: Several acceleration techniques have been developed, such as the bilateral grid and permutohedral lattice, to make bilateral filtering applicable for real-time applications.
Binary Image Processing
- Binary images only have pixel values of 0 and 1.
- Involves Morphological Operations that are used to modify the structure or shape of objects in binary images
Operations
- Dilation: Expands (Thickens) By setting a pixel to 1.
- Erosion: Reduces (Thins) Objects by having to retain all pixels at 1.
- Majority: Sets a pixel to 1 if the majority of pixels under the structuring element are 1.
- Opening: Applying erosion then dilation, creating smooth boundaries,
- Closing: Smooths Object boundaries
Distance Transforms
- A tool to calculate the shortest distance from any point nearest boundary, using either Euclidean distance or other distances.
- Applying image stitching, and fast chamfer-matching
- Areas such as object recognition, and scene interpretation
Connected Components
- Connecting pixels in an image where all adjacent components are equal
- It is crucial for applications like recognizing individual characters
- adjacent
Course Information
- DS-473 is a Computer Vision course within the Bachelor of Science in Computing, Computer Science program.
- The course aims to develop deep understanding of signal processing in frequency domain.
- The course also aims to develop proficiency in applying Fourier transforms to real-world image processing challenges.
- The course covers:
- Fourier Transform and Filters: Analysis of signal decomposition and various filtering methods.
- Image Classification and Segmentation: Utilization in medical imaging, photo tagging, and autonomous driving. Challenges and Traditional Approaches: Addressing issues like high variability and similarity in classes.
- Required reading includes Chapter 3, sections 3.4-3.5.3.
- Recommended reading includes Chapter 3, sections 3.1-3.7.
Thinking in Frequency
- The Fourier Transform decomposes a signal into its frequency components, to understand how filters manipulate different frequency ranges.
Filters
- Filters are designed to affect signals based on frequency.
- Low-pass filters:
- Pass low frequencies while attenuating high frequencies.
- Used for smoothing signals and removing high-frequency noise.
- High-pass filters:
- Pass high frequencies and attenuate low frequencies.
- Used in edge detection by removing low frequencies such as hum in audio.
- Band-pass filters:
- Pass a specific range of medium frequencies, and attenuate below and above this range.
- Used in feature extractions and texture analysis.
- A filter's effect on high, medium, and low frequencies is analyzed by passing a sinusoid of known frequency through it and observing the attenuation.
- Each filter has a frequency response, which the filter attenuates different frequencies.
- Each filter frequency response is similar to the output of a Fourier Transform.
Sinusoidal Signals
- Defined as periodic signal with a waveform as that of a sine wave
- The sine wave's parameters are:
- Amplitude: Increases from 0 at an angle of 0° to a maximum of 1 at 90°, reaches a minimum of -1 at 270°, and returns to 0 at 360°.
- Time period: Defined as the time taken by a periodic signal to complete one cycle; after 360°, the signal repeats. The time period is 2π.
- Frequency: Number of oscillations in one second, mathematically defined as the reciprocal of a period.
- Phase: Its horizontal position of a waveform in one oscillation.
Sinusoids for Analyzing Filters
- Behavior on different frequency ranges, sinusoids of known frequencies are passed through the filter and the output is observed.
- When a sinusoidal signal is convolved with a filter's impulse response, a sinusoid of the same frequency is produced with different magnitude and phase is produced.
- New magnitude A is the filter's gain or magnitude.
- The phase difference (Δ=ϕ -ϕ) is called the shift or phase.
- A large change in magnitude with a strong effect on the original sinusoid.
- Minimal change in magnitude means the filter allows the original sinusoid to pass almost unaffected.
- Many filters introduce a phase shift (horizontal displacement) to the output sinusoid, meaning it's delayed or advanced compared to the original.
Fourier Transform (FT) as a Filter Response Tool
- FT analyzes a filter's effect on each frequency.
- Its output (H(ω)) is a complex number representing the magnitude change (A) and phase shift (φ) caused by the filter at each frequency (ω).
- Magnitude (A) indicates how much a frequency is amplified/attenuated by the filter (F).
- Phase Shift (φ) indicates any delay or advancement in the timing caused by the filter (F).
Discrete Fourier Transform (DFT)
- Specifically for digital signals, or sampled data.
- DFT takes O(N²) operations.
- H(k) = DFT at frequency k.
- = Original Signal at time.
- = Total number of samples.
- = sinusoidal basis functions.
- Fast Fourier Transform (FFT)
- An efficient algorithm to compute the DFT quickly.
- FFT takes O(N logâ‚‚ N) operations.
- FT is applied to continuous signals, while DFT is used for discrete sampled signals.
Discrete Fourier Transform Filters
- Box-3 and Box-5 are smoothing filters (low-pass), blur the image by averaging neighboring pixel values.
- Linear Filter: smoothing filter that is similar to Box-3 but with slightly weights that emphasize center pixel more.
- Binomial Filter is a smoothing, and similar to the Gaussian filter and it is used for blurring while reduing noise.
- Sobel: an edge detection filter that emphasizes horizontal or vertical gradients in the image.
- Corner: detects corners in images, highlighting areas where intensity changes in multiple directions.
Two-Dimensional Fourier Transforms
- An image can be visualized as a 2D function of position (x and y coordinates).
- The 2D FT extends the concept to analyze its frequency content across both horizontal and vertical directions.
- Here, N and M are the width and the height of the image.
Applications of Fourier Transform
- Understanding Image Content: Provides insights into its characteristics.
- High frequencies correspond to rapid changes like sharp details and edges.
- Low frequencies represent slow, smooth variations and overall background.
- Image Enhancement:
- Sharpening: Amplify high-frequency components by enhancing edges and other details in the image's Fourier Transform.
- Blur Removal: The effect can undo blurring in the Fourier if the blur type is known.
- Noise Removal (Denoising): Noise has high frequencies, reduced by removing high frequencies while keeping the edges and textures.
Evaluation of Image Denoising Algorithms
- PSNR (Peak Signal-to-Noise Ratio): A common algorithm to compare denoised image to the original images.
- SSIM (Structural Similarity Index): an algorithm to compare denoised image to original images that reflects the human perception.
- FLIP (Flicker Perception): Evaluates the smoothness of a video or image sequence by focusing on flicker or temporal artifacts.
- Using Neural Networks.
- No-reference assessment: Effectiveness measured by image denoising by original image is unknown.
Image Resizing and Applications
- Image resizing adjusts the dimensions of an image, with different reasons for doing so.
- Reasons:
- Match output device resolution (printer, screen)
- Reduce file size for storage/transmission
- Optimize algorithm speed
- Find objects at different scales (e.g., face detection)
- Advanced image editing like seamless blending.
Image Resizing Techniques
- Upsampling: Using interpolation for enlarging images.
- Downsampling: Using decimation for shrinking images.
- Multi-Resolution Pyramids: For a structured set of resized images.
Interpolation for Upsampling
- To enlarge an image, an interpolation kernel convolved with the image that's filling in the blanks in a grid of pixels.
- Types of Kernels:
- Linear: Simple but can create jagged edges.
- Bicubic: Common choice, smoother results.
- Windowed Sinc: Highest quality, but can introduce ringing.
Decimation or Downsampling Images
- To reduce image resolution by decimation.
- Decimation Process:
- Convolution with a low-pass filter avoids aliasing caused by using high-frequency details.
- Convolution is evaluated at every sample to optimize computation.
Common Filters for Decimation
- Common downsampling filters ( r=2):
- Linear, Binomial, Cubic: From simple to complex filters.
- Tradeoffs with the use of algorithms:
- Binomial leaves some aliasing, but is better compared to the use of Linear.
- Cubic (a=-1) helps suppresses aliasing while causing ringing.
- Advanced Filters
- QMF-9, JPEG2000 filters for specific tasks
Multi-Resolution Representations
- Multi-Resolution Analysis
- The understanding of signals and images at different scales.
- Varying Scales: analysis of large-scale features (overall shapes) with fine-scale details (edges and textures).
- Applications:
- Image Compression: Store efficiently images with important details on across the scales.
- Feature Detection: Detects key points/regions which are also key points that are informative in the images sizes and how they can be resized.
Image Pyramids Overview
- Structure: A hierarchical series of images, where each level is a lower-resolution version of one before it.
- Progressive Resolution Reduction
- common, creating with ¼ pixels a level to another.
- Filtering: Crucial to avoid aliasing artifacts when downsampling.
- Types: -Gaussian Pyramids: Reapeating process of smoothing and downsampling. -Laplacian Pyramids: Stores detail differences using layers from smoothing, allows reconstruction. -Wavelet Pyramids: Captures directional and detail of image, using different resolutions.
Decimation, Interpolation and Pyramids
- Downsampling using a pyramid and upsampling during reconstruction.
- Crucial the processes of using the filters for both are of quality!
Gaussian Pyramid
- Construction:
- Repeated blurring of the image with a Gaussian filter.
- Downsample to create subsequent levels by a amount of 2.
- Binomial Filter:
- Good for balancing an approximation through efficiency and simplicity through good Gaussian blur.
- Applications:
- Foundation for feature detection and other feature pyramids
Laplacian Pyramid
- Storage:
- Storing each layer using the Gaussian levels
- Construction:
- Create a lower res Gaussian level
- Subtracting it from the higher resolution to get the Laplacian image.
- Perfect Reconstruction:
- Laplacian with the smallest Gaussian will reconstruct the original image.
Frequency Response
- Filters
- Image frequency is affected using certain filter to analyze.
- Sharp Cutoff vs Aliasing:
- Ideal filters can be cut off and are harder. Simplers can be cut shorter and harder.
- Applications
- Dictate the choices while the artifacts sensitivity in the image based on computations.
Image Pyramids
- Applications
- Coarse to Fire: Locate objects through refining the process
- Multi resolution: Use of various blending that can produce high resolutions
- Fractional scaling without blockiness (MIP)
- Medical Whole Slide Imaging
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.