| |
| |
Preface | |
| |
| |
Foreword | |
| |
| |
| |
Reducing Discontinuities at Synthesis Time for Corpus-Based Speech Synthesis | |
| |
| |
| |
Introduction | |
| |
| |
Shift-Only F0 Smoothing | |
| |
| |
Improving Quality of MBROLA Synthesis | |
| |
| |
Evaluation | |
| |
| |
Discussions and Conclusion | |
| |
| |
Bibliography | |
| |
| |
| |
Voice Quality Variation in a Long-Term Recording of a Single Speaker Speech Corpus | |
| |
| |
| |
Introduction | |
| |
| |
Perceptual Experiment | |
| |
| |
Factors of Voice Quality Variation | |
| |
| |
Candidates of Acoustic Correlates | |
| |
| |
Prediction of Voice Quality Difference Scores | |
| |
| |
Summary | |
| |
| |
Bibliography | |
| |
| |
| |
Join Cost for Unit Selection Speech Synthesis | |
| |
| |
| |
Introduction | |
| |
| |
Previous Work | |
| |
| |
Spectral Distances | |
| |
| |
Perceptual Listening Tests | |
| |
| |
Results and Discussion | |
| |
| |
Conclusions | |
| |
| |
Bibliography | |
| |
| |
| |
Articulatory Modeling: A Role in Concatenative Text to Speech Synthesis | |
| |
| |
| |
Introduction | |
| |
| |
Articulatory Modeling | |
| |
| |
Rule-Based Control of the Parameters | |
| |
| |
Concatenative Articulatory Synthesis | |
| |
| |
Concluding Remarks | |
| |
| |
Bibliography | |
| |
| |
| |
Minimizing The Amount of Pitch Modification in Speech Synthesis | |
| |
| |
| |
Introduction | |
| |
| |
Speech Corpus Analysis | |
| |
| |
Text Corpus Analysis | |
| |
| |
Perceptual Experiment | |
| |
| |
Conclusion | |
| |
| |
Bibliography | |
| |
| |
| |
The Use of Speech Recognition Technology in Speech Synthesis | |
| |
| |
| |
Introduction | |
| |
| |
Speech Recognition | |
| |
| |
ASR in Synthesis | |
| |
| |
Limitations | |
| |
| |
Speculations | |
| |
| |
Bibliography | |
| |
| |
| |
An HMM-Based Approach to Multilingual Speech Synthesis | |
| |
| |
| |
Introduction | |
| |
| |
HMM-Based Speech Synthesis System | |
| |
| |
F0 Pattern Modeling by HMM | |
| |
| |
Speech-Parameter Generation from an HMM | |
| |
| |
Implementation on Festival Architecture | |
| |
| |
Discussion | |
| |
| |
Conclusion | |
| |
| |
Bibliography | |
| |
| |
| |
Prosody Control For HMM-Based Japanese TTS | |
| |
| |
| |
Introduction | |
| |
| |
Outline of HMM-Based TTS System | |
| |
| |
Prosody Generation Using the Quantification Theory (Type 1) | |
| |
| |
Speech-Rate-Variable Synthesis Method | |
| |
| |
Conclusions | |
| |
| |
Bibliography | |
| |
| |
| |
Synthesizing Expressive Speech Overview: Challenges, and Open Questions | |
| |
| |
| |
Introduction | |
| |
| |
Theories of Emotion | |
| |
| |
Dimensions of Emotional Space | |
| |
| |
Speech Synthesis Methods | |
| |
| |
Emotional Speech Data Collection | |
| |
| |
Experimental Evaluation of Expressive Speech | |
| |
| |
Presentation of Results From Case Studies | |
| |
| |
Conclusion | |
| |
| |
Open Questions and Future Directions | |
| |
| |
Bibliography | |
| |
| |
| |
Unit Selection Synthesis of Prosody: Evaluation Using Diphone Transplantation | |
| |
| |
Romain Prudon, Christophe D'Alessandro and Philippe Boula de Mare uil | |
| |
| |
Introduction | |
| |
| |
Co | |