000 03738nam a22005775i 4500
001 978-981-99-0827-1
003 DE-He213
005 20240423125517.0
007 cr nn 008mamaa
008 230529s2023 si | s |||| 0|eng d
020 _a9789819908271
_9978-981-99-0827-1
024 7 _a10.1007/978-981-99-0827-1
_2doi
050 4 _aQA76.9.N38
072 7 _aUYQL
_2bicssc
072 7 _aCOM073000
_2bisacsh
072 7 _aUYQL
_2thema
082 0 4 _a006.35
_223
100 1 _aTan, Xu.
_eauthor.
_4aut
_4http://id.loc.gov/vocabulary/relators/aut
245 1 0 _aNeural Text-to-Speech Synthesis
_h[electronic resource] /
_cby Xu Tan.
250 _a1st ed. 2023.
264 1 _aSingapore :
_bSpringer Nature Singapore :
_bImprint: Springer,
_c2023.
300 _aXXV, 201 p. 24 illus. in color.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
490 1 _aArtificial Intelligence: Foundations, Theory, and Algorithms,
_x2365-306X
505 0 _aChapter 1. Introduction -- Part 1. Preliminary -- Chapter 2. Basics of Spoken Language Processing -- Chapter 3. Basics of Deep Learning -- Part 2. Key Components in TTS -- Chapter 4. Text Analyses -- Chapter 5. Acoustic Models -- Chapter 6. Vocoders -- Chapter 7. Fully End-to-End TTS -- Part 3. Advanced Topics in TTS -- Chapter 8. Expressive and Controllable TTS -- Chapter 9. Robust TTS -- Chapter 10. Model-Efficient TTS -- Chapter 11. Data-Efficient TTS -- Chapter 12. Beyond Text-to-Speech Synthesis -- Part 4. Summary and Outlook -- Chapter 13. Summary and Outlook.
520 _aText-to-speech (TTS) aims to synthesize intelligible and natural speech based on the given text. It is a hot topic in language, speech, and machine learning research and has broad applications in industry. This book introduces neural network-based TTS in the era of deep learning, aiming to provide a good understanding of neural TTS, current research and applications, and the future research trend. This book first introduces the history of TTS technologies and overviews neural TTS, and provides preliminary knowledge on language and speech processing, neural networks and deep learning, and deep generative models. It then introduces neural TTS from the perspective of key components (text analyses, acoustic models, vocoders, and end-to-end models) and advanced topics (expressive and controllable, robust, model-efficient, and data-efficient TTS). It also points some future research directions and collects some resources related to TTS. This book is the first to introduceneural TTS in a comprehensive and easy-to-understand way and can serve both academic researchers and industry practitioners working on TTS.
650 0 _aNatural language processing (Computer science).
650 0 _aSpeech processing systems.
650 0 _aSignal processing.
650 0 _aMachine learning.
650 0 _aArtificial intelligence.
650 1 4 _aNatural Language Processing (NLP).
650 2 4 _aSpeech and Audio Processing.
650 2 4 _aMachine Learning.
650 2 4 _aArtificial Intelligence.
710 2 _aSpringerLink (Online service)
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9789819908264
776 0 8 _iPrinted edition:
_z9789819908288
776 0 8 _iPrinted edition:
_z9789819908295
830 0 _aArtificial Intelligence: Foundations, Theory, and Algorithms,
_x2365-306X
856 4 0 _uhttps://doi.org/10.1007/978-981-99-0827-1
912 _aZDB-2-SCS
912 _aZDB-2-SXCS
942 _cSPRINGER
999 _c178717
_d178717