Acoustic Processing of Speech Signals

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What characteristic of vowels can be observed in a waveform?

Short duration and low amplitude
Long duration and relatively loud (correct)
Irregular pattern and low amplitude
Consistent pattern and high frequency

How can fricatives be identified in a waveform?

They produce an intense irregular pattern (correct)
They have a constant amplitude over time
They appear as smooth, consistent waves
They are characterized by high-frequency tones

Which of the following accurately describes spectral features?

They cannot represent phonetic features
They provide a less detailed classification than waveforms
They require only visual inspection for analysis
They interpret a complex wave as a sum of simpler waves (correct)

What does the repeated wave in the diagram represent in terms of frequency?

It has a frequency of about 250 Hz (D) Signup and view all the answers

What can be inferred about the smaller repeated wave in relation to the larger wave?

It has a frequency approximately four times that of the larger wave (A) Signup and view all the answers

Which type of software was mentioned for creating spectrograms?

Gram software (C) Signup and view all the answers

In a waveform, what does high amplitude indicate?

A high volume sound (A) Signup and view all the answers

Which phonetic feature would not be evident in a waveform without additional spectral analysis?

Tone quality (C) Signup and view all the answers

What two characteristics are most important in analyzing a wave?

Frequency and amplitude (C) Signup and view all the answers

If there are 28 repetitions of a wave captured in 0.11 seconds, what is the frequency in Hertz?

255Hz (B) Signup and view all the answers

What does a high amplitude in a waveform indicate?

Higher than normal air pressure (A) Signup and view all the answers

How is the perceptual correlate of frequency described?

Sound pitch (B) Signup and view all the answers

What does the zero value on the vertical axis of a waveform represent?

Normal atmospheric pressure (D) Signup and view all the answers

What are spectral features used to represent in acoustic processing of speech?

The distribution of frequencies in a waveform (A) Signup and view all the answers

What relationship between amplitude and loudness is described?

Non-linear relationship (C) Signup and view all the answers

What does LPC stand for in the context of speech signal processing?

Linear Predictive Coding (C) Signup and view all the answers

Why is analyzing waveforms important in understanding speech?

They contain information to transcribe speech (A) Signup and view all the answers

In acoustic processing, what is the primary role of feature extraction?

To summarize and represent time slices of a speech signal (C) Signup and view all the answers

How are sound waves represented in signal analysis?

By a graph depicting air pressure changes over time (C) Signup and view all the answers

If a sound has a lower frequency, how is its pitch perceived?

As lower (C) Signup and view all the answers

What is the significance of analyzing a waveform in acoustic processing?

It enables the interpretation of sound frequencies (D) Signup and view all the answers

What role do dialogue agents play in speech processing?

They manage conversation flow and context (C) Signup and view all the answers

What do changes in air pressure represent in the context of speech recognition?

The sound waves generated by a speaker (D) Signup and view all the answers

Which of the following tools is commonly used for parsing in speech recognition?

Prolog (D) Signup and view all the answers

What is the frequency of the first formant (F1) for the vowel [iy]?

540 Hz (A) Signup and view all the answers

What do dark bars on a spectrogram typically represent?

Spectral peaks of vowels (C) Signup and view all the answers

Which frequency range is associated with the second formant (F2) for the vowel [iy]?

2581 Hz (A) Signup and view all the answers

What primarily causes the differences in formant frequencies across vowels?

Size of the oral cavity and tongue position (D) Signup and view all the answers

Which of the following phones can be identified using formants?

Nasal phones, lateral phone, and rhotic sound (A) Signup and view all the answers

How do the formants differ between vowels such as [iy] and [ɒ]?

Both first and second formant frequencies are different (C) Signup and view all the answers

What effect does moving the tongue have on vowel frequency production?

Creates resonant cavities that filter specific frequencies (B) Signup and view all the answers

What role do formants play in vowel identification?

They are crucial for recognizing vowel identity (B) Signup and view all the answers

What term describes the maximum frequency that can be measured based on the sampling rate?

Nyquist frequency (C) Signup and view all the answers

Which of the following is NOT a step in the analogue-to-digital conversion process?

Transmission (B) Signup and view all the answers

How many amplitude measurements are required per second for a sampling rate of 8,000 Hz?

8,000 (B) Signup and view all the answers

What is the consequence of having less than two samples per cycle during the sampling process?

Complete loss of frequency information (D) Signup and view all the answers

Which of the following sampling rates would be sufficient to capture the majority of human speech frequencies below 10,000 Hz?

16,000 Hz (B) Signup and view all the answers

What is typically the integer representation size used for quantisation in digital audio?

8-bit integers (A) Signup and view all the answers

What is the purpose of quantisation in the context of digitising a waveform?

To represent real-valued numbers as integers (D) Signup and view all the answers

To digitise a sound wave effectively, how many samples should be taken for each cycle of the wave?

Two samples (C) Signup and view all the answers

What is the approximate frequency of the tiny wave observed on the 1000Hz waves?

2000Hz (D) Signup and view all the answers

What does the y-axis of a spectrum represent?

Magnitude of each frequency component (D) Signup and view all the answers

Why is an LPC spectrum utilized in speech applications?

To simplify the analysis of frequency peaks (C) Signup and view all the answers

Which of the following statements accurately describes a spectrogram?

It visually represents how frequency components change over time. (B) Signup and view all the answers

What characteristic does a spectrum help identify in sound waves?

Spectral signatures of different sounds (A) Signup and view all the answers

What is the main function of the cochlea in human audition?

To compute a spectrum of incoming waveforms (C) Signup and view all the answers

In a spectrogram, what does the darkness of a point indicate?

The loudness of the sound (A) Signup and view all the answers

How do scientists use spectral information to analyze chemical elements?

By detecting wavelengths of light emitted when elements burn (C) Signup and view all the answers

Flashcards

Speech Signal Analysis

A process of analyzing speech signals to extract meaningful information for computer processing, like speech recognition.

Feature Extraction

The process of selecting key characteristics (features) from a speech signal that help distinguish between sounds or words.

Fourier Analysis

A mathematical method for decomposing a complex signal into simpler sine and cosine waves, showing its frequency components.

Linear Predictive Coding (LPC)

A method of feature extraction that models a speech signal's short-term characteristics based on predicting the next sound.