| |
| |
| |
Introduction | |
| |
| |
Discrete-Time Speech Signal Processing | |
| |
| |
The Speech Communication Pathway | |
| |
| |
Analysis/Synthesis Based on Speech Production and Perception | |
| |
| |
Applications | |
| |
| |
Outline of Book | |
| |
| |
| |
A Discrete-Time Signal Processing Framework | |
| |
| |
Discrete-Time Signals | |
| |
| |
Discrete-Time Systems | |
| |
| |
Discrete-Time Fourier Transform | |
| |
| |
Uncertainty Principle | |
| |
| |
z-Transform | |
| |
| |
LTI Systems in the Frequency Domain | |
| |
| |
Properties of LTI Systems | |
| |
| |
Time-Varying Systems | |
| |
| |
Discrete-Fourier Transform | |
| |
| |
Conversion of Continuous Signals and Systems to Discrete Time | |
| |
| |
| |
Production and Classification of Speech Sounds | |
| |
| |
Anatomy and Physiology of Speech Production | |
| |
| |
Spectrographic Analysis of Speech | |
| |
| |
Categorization of Speech Sounds | |
| |
| |
Prosody: The Melody of Speech | |
| |
| |
Speech Perception | |
| |
| |
| |
Acoustics of Speech Production | |
| |
| |
Physics of Sound | |
| |
| |
Uniform Tube Model | |
| |
| |
A Discrete-Time Model Based on Tube Concatenation | |
| |
| |
Vocal Fold/Vocal Tract Interaction | |
| |
| |
| |
Analysis and Synthesis of Pole-Zero Speech Models | |
| |
| |
Time-Dependent Processing | |
| |
| |
All-Pole Modeling of Deterministic Signals | |
| |
| |
Linear Prediction Analysis of Stochastic Speech Sounds | |
| |
| |
Criterion of Goodness | |
| |
| |
Synthesis Based on All-Pole Modeling | |
| |
| |
Pole-Zero Estimation | |
| |
| |
Decomposition of the Glottal Flow Derivative | |
| |
| |
| |
Properties of Stochastic Processes | |
| |
| |
Random Processes | |
| |
| |
Ensemble Averages | |
| |
| |
Stationary Random Process | |
| |
| |
Time Averages | |
| |
| |
Power Density Spectrum | |
| |
| |
| |
Derivation of the Lattice Filter in Linear Prediction Analysis | |
| |
| |
| |
Homomorphic Signal Processing | |
| |
| |
Concept | |
| |
| |
Homomorphic Systems for Convolution | |
| |
| |
Complex Cepstrum of Speech-Like Sequences | |
| |
| |
Spectral Root Homomorphic Filtering | |
| |
| |
Short-Time Homomorphic Analysis of Periodic Sequences | |
| |
| |
Short-Time Speech Analysis | |
| |
| |
Analysis/Synthesis Structures | |
| |
| |
Contrasting Linear Prediction and Homomorphic Filtering | |
| |
| |
| |
Short-Time Fourier Transform Analysis and Synthesis | |
| |
| |
Short-Time Analysis | |
| |
| |
Short-Time Synthesis | |
| |
| |
Short-Time Fourier Transform Magnitude | |
| |
| |
Signal Estimation from the Modified STFT or STFTM | |
| |
| |
Time-Scale Modification and Enhancement of Speech | |
| |
| |
| |
FBS Method with Multiplicative Modification | |
| |
| |
| |
Filter-Bank Analysis/Synthesis | |
| |
| |
Revisiting the FBS Method | |
| |
| |
Phase Vocoder | |
| |
| |
Phase Coherence in the Phase Vocoder | |
| |
| |
Constant-Q Analysis/Synthesis | |
| |
| |
Auditory Modeling | |
| |
| |
| |
Sinusoidal Analysis/Synthesis | |
| |
| |
Sinusoidal Speech Model | |
| |
| |
Estimation of Sinewave Parameters | |
| |
| |
Synthesis | |
| |
| |
Source/Filter Phase Model | |
| |
| |
Additive Deterministic-Stochastic Model | |
| |
| |
| |
Derivation of the Sinewave Model | |
| |
| |
| |
Derivation of Optimal Cubic Phase Parameters | |
| |
| |
| |
Frequency-Domain Pitch Estimation | |
| |
| |
A Correlation-Based Pitch Estimator | |
| |
| |
Pitch Estimation Based on a Comb Filter<170 | |
| |
| |
Pitch Estimation Based on a Harmonic Sinewave Model | |
| |
| |
Glottal Pulse Onset Estimation | |
| |
| |
Multi-Band Pitch and Voicing Estimation | |
| |
| |
| |
Nonlinear Measurement and Modeling Techniques | |
| |
| |
The STFT and Wavelet Transform Revisited | |
| |
| |
Bilinear Time-Frequency Distributions | |
| |
| |
Aeroacoustic Flow in the Vocal Tract | |
| |
| |
Instantaneous Teager Energy Operator | |
| |
| |
| |
Speech Coding | |
| |
| |
Statistical Models of Speech | |
| |
| |
Scaler Quantization | |
| |
| |
Vector Quantization (VQ) | |
| |
| |
Frequency-Domain Coding | |
| |
| |
Model-Based Coding | |
| |
| |
LPC Residual Coding | |
| |
| |
| |
Speech Enhancement | |
| |
| |
Introduction | |
| |
| |
Preliminaries | |
| |
| |
Wiener Filtering | |
| |
| |
Model-Based Processing | |
| |
| |
Enhancement Based on Auditory Masking | |
| |
| |
| |
Stochastic-Theoretic parameter Estimation | |
| |
| |
| |
Speaker Recognition | |
| |
| |
Introduction | |
| |
| |
Spectral Features for Speaker Recognition | |
| |
| |
Speaker Recognition Algorithms | |
| |
| |
Non-Spectral Features in Speaker Recognition | |
| |
| |
Signal Enhancement for the Mismatched Condition | |
| |
| |
Speaker Recognition from Coded Speech | |