Eye Pupil Measurement via Webcam: A Novel Method

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Why is pupil diameter measurement considered vital in behavioral research?

  • It provides insights into physiological and psychological states, such as stress levels and cognitive load. (correct)
  • It is essential for diagnosing vision-related ailments.
  • It helps in determining the physical fitness of individuals.
  • It allows for accurate calculation of a person's age.

What challenge does using webcams pose in the accurate measurement of pupil diameter, as mentioned in the abstract?

  • Webcams require specialized software for operation.
  • Webcams do not provide color images, making analysis tricky.
  • Webcams can be expensive for widespread use.
  • Webcams typically have lower resolution compared to specialized pupillometry equipment. (correct)

In the context of pupil diameter estimation, what is the primary role of upscaling techniques?

  • To convert color images to grayscale for easier analysis
  • To enhance measurement precision by improving the resolution of eye images (correct)
  • To correct distortions caused by webcam lenses
  • To reduce the computational resources required for image processing

Which of the following is a key contribution of the thesis regarding accessibility and analysis options?

<p>Creating an application that allows estimation of pupil diameter through standard webcams with user-friendly visualizations (B)</p> Signup and view all the answers

According to the 'Declaration of Authorship,' which of the following principles was confirmed?

<p>The work was done solely while in candidature for a research degree at the University, and all sources are clearly attributed. (A)</p> Signup and view all the answers

Which equipment, traditionally used for accurate pupillometry, is mentioned as requiring specialized features and being traditionally expensive?

<p>Tobii eye-trackers and Pupillabs glasses (D)</p> Signup and view all the answers

In data collection for EyeDentify, what was the role of ChameleonView?

<p>To provide a user interface for webcam-based data recording (B)</p> Signup and view all the answers

Why was it important to change the background color of the computer screen during data collection?

<p>To collect diverse pupil diameter sizes (A)</p> Signup and view all the answers

To prepare the EyeDentify dataset, what steps were taken after aligning recordings?

<p>Cropping the eye regions to focus on relevant features (A)</p> Signup and view all the answers

What is Eye Aspect Ratio (EAR) used for in the data preprocessing pipeline?

<p>Detecting and excluding frames where participants were blinking (A)</p> Signup and view all the answers

How did the researchers address the challenge posed by lower-quality images from webcams?

<p>By applying image super-resolution (SR) techniques (C)</p> Signup and view all the answers

Among the SR models used, which were specifically designed for face restoration and enhancement?

<p>GFPGAN and CodeFormer (B)</p> Signup and view all the answers

What was the benefit of applying SR before blink detection, rather than after?

<p>Improved performance of blink detection due to enhanced image quality (A)</p> Signup and view all the answers

What type of data was collected, in addition to pupil diameter and webcam images?

<p>Depth maps using DepthAnythingV2 model (A)</p> Signup and view all the answers

Which data augmentation techniques were applied to prepare the datasets?

<p>Super-Resolution using bicubic interpolation and five SR models (D)</p> Signup and view all the answers

What is the primary goal of integrating image SR into the pupil diameter estimation workflow?

<p>To improve the quality of input data and enable more accurate analyses (A)</p> Signup and view all the answers

Aside from improving diagnosis of medical conditions, which use case is mentioned for gaze and pupil diameter estimation applications?

<p>Analyzing user behavior during web interactions (B)</p> Signup and view all the answers

Which condition is a limitation to prior work for measurement acquisition?

<p>The presence of two cameras. (C)</p> Signup and view all the answers

What is emphasized to improve ground truth values for future works?

<p>Use an eye-tracker that doesn't obscure the face. (A)</p> Signup and view all the answers

What data is presented for end users in the final interactive format?

<p>EAR and differences in Pupil Diameters. (D)</p> Signup and view all the answers

What do users rate for the overall experience of pupilSense?

<p>All are rated. (A)</p> Signup and view all the answers

What does Pupil Sense ensure, to protect the privacy of end users?

<p>Does not store user data. (A)</p> Signup and view all the answers

Which of the following statements best describes a limitation of the EyeDentify dataset?

<p>It mainly consists of participants without eyeglasses, impacting general applicability. (D)</p> Signup and view all the answers

Regarding enhancing the model precision, what can be implemented to create iris segments?

<p>Image Processing Segmentation Masks. (C)</p> Signup and view all the answers

For real-time capabilities in the web application, what models could integrated to be more efficient in those lower resources?

<p>Those from local devices after the models have been trained. (A)</p> Signup and view all the answers

Which areas are identified for future development within PupilSense, the web application?

<p>Real-time capabilities and integration with medical diagnostic tools. (D)</p> Signup and view all the answers

What technique does not mention to improve features on the webcam imagery?

<p>Increasing sensor size on camera. (A)</p> Signup and view all the answers

What technology was used in the app to create the data?

<p>Tobii eye tracker and Microsoft Surface Studio 1 (B)</p> Signup and view all the answers

What is one demographic details of the participants in the study?

<p>All were below 40 years in age (D)</p> Signup and view all the answers

What was used to optimize learning?

<p>AdamW optimizer (A)</p> Signup and view all the answers

To have a more diverse study, how can it be modified?

<p>Have different camera brands (D)</p> Signup and view all the answers

How does integrating personalized profiles with large language models affect results?

<p>Will generate detailed cognitive and behavioral reports. (A)</p> Signup and view all the answers

Which type of study was done to analyze the validity of the app?

<p>Did a Survey (D)</p> Signup and view all the answers

Where does the web app launch?

<p>Hugging Face Spaces. (B)</p> Signup and view all the answers

Does the creative commons liscense used prevent what activity?

<p>Commercial adaptation. (A)</p> Signup and view all the answers

Which of the following was not a recommendation from survey feedback?

<p>Ensure that the app required external hardware. (C)</p> Signup and view all the answers

What can researchers further accomplish in future studies?

<p>Transmitting fully cropped eye data to separate models. (C)</p> Signup and view all the answers

What type of environment would enhance prediction of pupil measurement and overall model performance?

<p>Account to the lighting. (A)</p> Signup and view all the answers

Flashcards

Pupil Diameter Estimation

Estimating pupil diameter from webcam images using super-resolution techniques, making pupillometry more accessible.

Super-Resolution (SR)

Techniques to enhance image resolution beyond the sensor's physical limitations.

GFPGAN

A type of AI model designed specifically to enhance facial features in images.

CodeFormer

A face-oriented VQ-VAE based model.

Signup and view all the flashcards

Leave One Participant Out Cross Validation (LOPOCV)

A technique where one participant is excluded during training to evaluate model performance on unseen data.

Signup and view all the flashcards

Eye Aspect Ratio (EAR)

Ratio of vertical to horizontal distances of eye landmarks, used for blink detection.

Signup and view all the flashcards

WebGazer

A model that provides real-time gaze-tracking capabilities for web environments without specialized hardware.

Signup and view all the flashcards

Pupil diameter

The diameter estimation of pupils is a psychophysiological indicator and a viable measure of appraisal levels for video game play.

Signup and view all the flashcards

Mean Absolute Error (MAE)

A metric that quantifies the average magnitude of error between predicted and actual values, without considering their direction.

Signup and view all the flashcards

SearchGazer

Tool for user behavior understanding in search tasks.

Signup and view all the flashcards

ChameleonView

This can be useful to capture timestamp and records videos to ensure a diversity of pupil diameters.

Signup and view all the flashcards

Data Preprocessing

The process of aligning data from different devices by synchronizing the video and Tobii eye-tracker data.

Signup and view all the flashcards

EyeDentify

This dataset has an open-source for webcam-based pupil diameter estimation. It can be found on Github.

Signup and view all the flashcards

EyeDentify++

Application use for training a model to accurately determine the measure of pupil diameter in everyday settings.

Signup and view all the flashcards

PupilSense

It is a novel and interactive web app that helps the user to analyze pupil diameters from webcams and phone cameras.

Signup and view all the flashcards

Study Notes

  • Estimation of eye pupil diameters aids insights into psychological states
  • Accurate pupil measurement is traditionally done via specialized eye trackers and glasses
  • A new method has been created using inexpensive webcam images and super-resolution
  • The method aims to make measuring eye pupils more accessible
  • Upscaling techniques range from bicubic interpolation to CodeFormer
  • Accuracy hinges on selection of upscaling methods and scaling factor
  • An application provides visualization via heatmaps, graphs, and blink ratios

Declaration of Authorship

  • States all work is the authors
  • Cites prior work of others, all sources are given.

Acknowledgements

  • Thanks go to Prof. Dr. Andreas Dengel
  • Thanks go to Brian Moser and Ko Watanabe

Measuring pupil diameter

  • Measures stress levels
  • Gauges cognitive load
  • Acts as a valuable tool in behavioral research
  • Related to short and long term memory processes of the human brain

Neurological conditions

  • Alzheimer's
  • Schizophrenia
  • Parkinson's
  • Opioid use
  • Mild cognitive impairment

Common eye monitoring methods

  • Drowsiness detection via blink detection
  • User interface optimization using gaze estimation

Key Takeaways

  • Diverse eye colors, user effort
  • Position constraints and technical skills needed limit broader use
  • Applications and devices with pupil size estimation can advance analysis

Challenges

  • Ground truth data collection
  • Sufficient diversity in the dataset
  • Pupil Diameter Prediction
  • Image Scale for Prediction
  • User-Friendly System Development

Contributions

  • ChameleonView, assists research of pupil diameters. During recordings screen alters to guarantee variety
  • EyeDentify: Ground truth measurements of pupil captured alongside face video, yielding data from 51 participants
  • EyeDentify++: Applying image SR to enhance images scaled 2x and 4x
  • Deep learning models were assessed, split and tested to estimate diameters of both eyes
  • Web application PupilSense created for public use

Gaze estimation datasets

  • Various gaze estimation, blink detection, as well as facial image data have been collected, these lack data on pupil diameters

Pupil diameter datasets

  • Data was collected from children with ADHD, this data set included only basic sensor values.
  • Videos were recorded under light variation, also only 16 subjects had their pupil diameter quantified equivalent to 5mm.

Pupil Diameter Estimation Methods

  • BINOMAP method uses slave cameras as a binocular geometric constraint for gaze images.
  • CNN can operate on mobile systems for real-time assessment.
  • algorithm PuReST, rapidly tracks pupils via head mounted eye trackers
  • Yolo algorithm is often implemented

Super-Resolution as Pre-Processing

  • Enhances medical imaging
  • Provides more detailed geography insights
  • Improves smartphones and TVs
  • Enhances Smart-phone resolution significantly
  • Pre-trained SR models is similar to image and object recognition

Dataset creation

  • Includes hardware, data collection procedure, along with SR dataset
  • Tobii eye-tracker is used to gather pupil diameter
  • Microsoft Surface Studio 1 used for face and video recordings

Data collection application

  • Specialized web application known as ChameleonView
  • User friendly, enables webcam-based data
  • Color changes capture different diameter sizes of the pupil

Data Preprocessing

  • Webcam videos are combined with eye-tracking data
  • Alignment
  • Cropping the eyes

Aligning the video and pupil Diameter data

  • CSV files are generated from the eye-tracker
  • Records Diameters from pupils as well as gaze positions
  • Used timestamp CSV file for webcam recording information
  • Frames extracted are aligned with data points

Cropping the eyes

  • Involves cropping the eyes from each extracted frame
  • Landmark localization is completed using mediapipe, identifies facial features such as the eyes, nose and mouth to map face
  • Eliminates the the effects of blinking

Super resolution

  • Super resolution enhances images by a factor of 2 or x4
  • A range of image SR approaches were employed to improve the quality of eye images
  • Creates HR versions of the Low Resolution webcam images
  • Eye regions are cropped before blink detection.

SR Techniques

  • SRResNet is an method inspired by ResNet
  • CodeFormer is a model based on VQ-VAE
  • Model testing involves models known as GFPGAN, HAT, and Real-ESRGAN

Dataset details

  • 51 participants
  • Includes mixed genders and various ages
  • Consists of a final count of 212073, removing those winking
  • Dataset is made fully available.

Dataset distribution

  • Includes multi display colors
  • A web application known as ChameleonView records synchronized webcam, with a Tobii eye-tracker aligning at 90hertz

Dataset variation

  • Allows for spontaneous movement by participants
  • Captures a variety of different lighting conditions
  • Captures a range of physical characteristics between different people

Training and evaluation

  • Deep learning was used to train and evaluate new models
  • Models were trained across the no-SR data, 2x data and 4x data

Models details

  • ResNet employed as a base model
  • With each ResNet architecture, custom linear layer was added

Dataset splits

  • Effective dataset splitting is required to ensure the reliability of such models
  • Implemented two methods of validation and one cross validation

Training details

  • Trained using both cross validation strategies from the ground up
  • With a batch of 128 images from separate left and right eyes
  • Utilized the AdamW Optimizer

Results overview

  • Upscaling improves accuracy for prediction
  • SR is also effective in boosting element detail

Results: Scale Sensitivity

  • A complex relationship can be seen between the upscaling factor
  • Upscaling factor and models to generalize that not all are straight cut with scale and SR methods

Results: Potential Overfitting

  • Overfitting is possible depending on the SR approaches used
  • Some test configurations did not perform as strongly when assessed, model becomes too similar to training visuals

Results: Best Models

  • Upsampling is the solution with optimal results, works with both left and right eye estimations for models
  • Quantitative data shown that pre-trained models as compared can assess pupil diameter and provide error scores based on the SR method employed

Quantitative results

  • LOPOCV tested again with a resnet model and left and right eyes recorded separately
  • Class Activation Maps( CAM) show a singular participant with distinct colors across datasets
  • CAM visualizations tested, including how lighting can effect accuracy

PupilSense Web-app details

  • Web app, which estimates pupil size from images
  • It presents pupil measurements
  • It gives comprehensive recording analysis as well as maps to show model influence in the output metrics
  • Creates and includes line charts illustrating pupil sizes

User Experience Demographics

  • The survey was split into sections with users and their experience in mind

Overall experience

  • Focused on application function, design, usability

Technical features and trustworthiness

  • Advanced capabilities, ability to integrate diagnostic data

User Demographics

  • Gathers quality input regarding users overall reaction

Survey Results

  • The survey was quite diverse in opinion, with various views of background knowledge on both eye tracking and Pupil metric study
  • Most found it user friendly
  • Processing times also are of note to users
  • Blinks, model action was well classified in surveys
  • App usage was given high scores overall
  • Some results are skewed or unnecessary

Overal Insights from feedback

  • Feedback suggest that PupilSense was well designed, its strengths reside with its upload media
  • However, feedback showed that most wanted faster times as well more precise explanation of data

Limitations and Shortcomings

  • Primarily recorded w/o eyeglasses or medical conditions
  • Prediction challenges of the study include data regarding diverse groups and lighting changes
  • The web app created has limited time ability while also being prediction based alone

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Use Quizgecko on...
Browser
Browser