Springer Handbook of Speech Processing
Book 1
Book 2
Book 3
Book 1
Book 2
Book 3
Book 1
Book 2
Book 3
Book 1
Book 2
Book 3
Home > Computing and Information Technology > Computer science > Audio processing > Springer Handbook of Speech Processing
Springer Handbook of Speech Processing

Springer Handbook of Speech Processing


     0     
5
4
3
2
1



Available


X
About the Book

This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.

Table of Contents:
Foreword by J. L. Flanagan Chap. 1 Introduction to Speech Processing Part A: Production, Perception, and Modeling of Speech (M. M. Sondhi) Part A describes the contemporary views on phonatory and articulatory mechanisms of humans to illustrate the physiological processes of speech production. It also describes the nonlinear cochlear speech processing in auditory masking, the perception of speech and sound by humans, and various methods for speech quality assessment with a focus on standardized methods. Chap. 2 Physiological Processes of Speech Production Chap. 3 Nonlinear Cochlear Signal Processing and Masking in Speech Perception Chap. 4 Perception of Speech and Sound Chap. 5 Speech Quality Estimation Part B: Signal Processing for Speech (Y. Huang, J. Benesty) Part B gives a large number of signal processing concepts and algorithms that are widely used in speech processing and in the applications of speech. Chap. 6 Wiener and Adaptive Filters Chap. 7 Linear Prediction Chap. 8 Kalman Filter Chap. 9 Homomorphic Systems and Cepstrum Analysis of Speech Chap. 10 Pitch and Voicing Determination of Speech with an Extension Toward Music Signals Chap. 11 Formant Estimation and Tracking Chap. 12 The STFT, Sinusoidal Models, and Speech Modification Chap. 13 Adaptive Blind Multichannel Identification Part C: Speech Coding (W. B. Kleijn) Part C discusses the attributes of speech coders as well as the underlying principles that determine their behavior and architecture. Coders for both traditional and packet networks are discussed, as well as low-bit-rate speech coding, various speech coding standards, and perceptual audio coders. Chap. 14 Principles of Speech Coding Chap. 15 Voice over IP: Speech Transmission over Packet Networks Chap. 16 Low-Bit-Rate Speech Coding Chap. 17 Analysis-by-Synthesis Speech Coding Chap. 18 Perceptual Audio Coding of Speech Signals Part D: Text-to-Speech Synthesis (S. Narayanan) Part D presents different techniques for speech synthesis, including rule-based, corpus-based, and a combination of both. Linguistic analysis and prosodic processing, which are important parts of a text-to-speech (TTS) system, are reviewed. Other aspects of interest for TTS such as voice transformation and synthesis of expressive speech are also discussed. Chap. 19 Basic Principles of Speech Synthesis Chap. 20 Rule-Based Speech Synthesis Chap. 21 Corpus-Based Speech Synthesis Chap. 22 Linguistic Processing for Speech Synthesis Chap. 23 Prosodic Processing Chap. 24 Voice Transformation Chap. 25 Expressive/Affective Speech Synthesis Part E: Speech Recognition (L. Rabiner, B.-H. Juang) Part E describes the most important speech recognition technologies. The approach based on the powerful hidden Markov models is generously presented and some other promising approaches are outlined. The robustness issues concerning the acoustical environment are studied. Finally, several fundamental applications are also discussed. Chap. 26 Historical Perspective of the Field of ASR/NLU Chap. 27 HMMs and Related Speech Technologies Chap. 28 Speech Recognition with Weighted Finite-State Transducers Chap. 29 A Machine Learning Framework for Spoken-Dialog Classification Chap. 30 Towards Superhuman Speech Recognition Chap. 31 Natural Language Understanding Chap. 32 Transcription and Distillation of Spontaneous Speech Chap. 33 Environmental Robustness Chap. 34 The Business of Speech Technologies Chap. 35 Spoken Dialog Systems Part F: Speaker Recognition (S. Parthasarathy) Part F develops the field of speaker recognition. It covers text-dependent and text-independent speaker recognition and their applications. Chap. 36 Overview of Speaker Recognition Chap. 37 Text-Dependent Speaker Recognition Chap. 38 Text-Independent Speaker Recognition Part G: Language Recognition (C.-H. Lee) Part G provides an overview on principles of state-of-the-art language recognition approaches. Language characterization, identification, and modeling are addressed. Vector space characterization approaches to converting speech utterances into spoken document vectors for modeling and classification are also presented. Chap. 39 Principle of Spoken Language Recognition Chap. 40 Spoken Language Characterization Chap. 41 Automatic Language Recognition via Spectral and Token Based Approaches Chap. 42 Vector Based Spoken Language Classification Part H: Speech Enhancement (J. Chen, S. Gannot, J. Benesty) Part H develops all classical aspects of speech enhancement: noise reduction, dereverberation, echo cancellation, feedback control, and active noise control. Chap. 43 Fundamentals of Noise Reduction Chap. 44 Spectral Enhancement methods Chap. 45 Echo Cancellation Chap. 46 Dereverberation Chap. 47 Adaptive Beamforming and Postfiltering Chap. 48 Feedback Control in Hearing Aids Chap. 49 Active Noise Control Part I: Multichannel Speech Processing (J. Benesty, I. Cohen, Y. Huang) Part I presents modern aspects of multichannel processing, for acoustic scene analysis, speech acquisition and presentation, when a large number of microphones and loudspeakers are available. Chap. 50 Microphone Arrays Chap. 51 Time Delay Estimation and Source Localization Chap. 52 Convolutive Blind Source Separation Methods Chap. 53 Sound Field Reproduction About the Authors Subject Index

About the Author :
Jacob Benesty Jacob Benesty received the Masters degree in microwaves from Pierre & Marie Curie University, France, in 1987, and the Ph.D. degree in control and signal processing from Orsay University, France, in 1991. From January 1994 to July 1995, he worked at Telecom Paris University on multichannel adaptive filters and acoustic echo cancellation. From October 1995 to May 2003, he was first a Consultant and then a Member of the Technical Staff at Bell Laboratories, Murray Hill, NJ, USA. In May 2003, he joined the University of Quebec, INRS-EMT, in Montreal, Quebec, Canada, as a professor. His research interests are in signal processing, acoustic signal processing, and multimedia communications. Dr. Benesty received the 2001 Best Paper Award from the IEEE Signal Processing Society. He co-authored the books "Acoustic MIMO Signal Processing" (Springer-Verlag, Berlin, 2006) and "Advances in Network and Acoustic Echo Cancellation" (Springer-Verlag, Berlin, 2001). He is also a co-editor/co-author of four other books. M. M. Sondhi M. Mohan Sondhi is a consultant at Avaya Research Labs, Basking Ridge, New Jersey. Prior to joining Avaya he spent 39 years at Bell Labs, from where he retired in 2001. He holds undergraduate degrees in Physics and Electrical Communication Engineering, and M.S and Ph.D. degrees in Electrical Engineering. At Bell Labs he conducted research in speech signal processing, echo cancellation, acoustical inverse problems, speech recognition, articulatory models for analysis and synthesis of speech and modeling of auditory and visual processing by humans. He has authored or co-authored several book chapters, over 120 journal articles, 10 patents, and the book Advances in Network and Acoustic Echo Cancellation. He has co-edited the book Advances in Speech Signal Processing. He has been a Distinguished Lecturer of the ASSP society, and an Associate Editor of Trans. ASSP, and has been a visiting scientist at laboratories in Sweden, France, and Japan. He has been on the editorial board of the Journal Speech Communication, and has co-edited a special issue of the Transactions of the IEEE on Speech and Audio Processing, He is a Bell Labs Fellow, a co-recipient of a best paper award of the IEEE ASSP society, and a co-recipient of IEEE's E.E. Sumner award for 1998. Y. Huang Yiteng (Arden) Huang received the B.S. degree from the Tsinghua University in 1994, the M.S. and Ph.D. degrees from the Georgia Institute of Technology (Georgia Tech) in 1998 and 2001, respectively, all in electrical and computer engineering. During his doctoral studies from 1998 to 2001, he was a research assistant with the Center of Signal and Image Processing, Georgia Tech, and was a teaching assistant with the School of Electrical and Computer Engineering, Georgia Tech. In the summers from 1998 to 2000, he worked with Bell Laboratories, Murray Hill, NJ and engaged in research on passive acoustic source localization with microphone arrays. Upon graduation, he joined Bell Laboratories as a Member of Technical Staff in March 2001. His current research interests are in acoustic signal processing and multimedia communications. Dr. Huang is currently an Associated Editor of the EURASIP Journal on Applied Signal Processing. He is a member of the Signal Processing Theory and Methods and the Audio and Electroacoustics Technical Committees of the IEEE Signal Processing Society.

Review :
From the reviews: "This massive volume contains 53 chapters and covers just about all aspects of the field of speech processing. ... The editors are commended for producing a valuable tool in the understanding of speech and speech synthesis/recognition. The book is a valuable addition to the bookshelf of researchers, speech scientists, and engineers." (Richard J. Peppin, International Journal of Acoustics and Vibration, Vol. 13 (1), 2008) "This book is a comprehensive overview of most of the major topics associated with speech processing written by the most renowned authors in each topic. The book is well structured with a clearly organized topics. It is intended for use by the researcher ... . book is organized in nine sections that cover all current speech applications. ... In conclusion, I would highly recommend that anyone interested in speech processing have a copy of this encyclopaedic work." (Eduardo Lopez Gonsalo, The Phonetician, Vol. I-II (97/98), 2008)


Best Sellers


Product Details
  • ISBN-13: 9783540491255
  • Publisher: Springer-Verlag Berlin and Heidelberg GmbH & Co. KG
  • Publisher Imprint: Springer-Verlag Berlin and Heidelberg GmbH & Co. K
  • Height: 242 mm
  • Returnable: N
  • ISBN-10: 3540491252
  • Publisher Date: 28 Nov 2007
  • Binding: SA
  • Language: English
  • Width: 193 mm


Similar Products

Add Photo
Add Photo

Customer Reviews

REVIEWS      0     
Click Here To Be The First to Review this Product
Springer Handbook of Speech Processing
Springer-Verlag Berlin and Heidelberg GmbH & Co. KG -
Springer Handbook of Speech Processing
Writing guidlines
We want to publish your review, so please:
  • keep your review on the product. Review's that defame author's character will be rejected.
  • Keep your review focused on the product.
  • Avoid writing about customer service. contact us instead if you have issue requiring immediate attention.
  • Refrain from mentioning competitors or the specific price you paid for the product.
  • Do not include any personally identifiable information, such as full names.

Springer Handbook of Speech Processing

Required fields are marked with *

Review Title*
Review
    Add Photo Add up to 6 photos
    Would you recommend this product to a friend?
    Tag this Book Read more
    Does your review contain spoilers?
    What type of reader best describes you?
    I agree to the terms & conditions
    You may receive emails regarding this submission. Any emails will include the ability to opt-out of future communications.

    CUSTOMER RATINGS AND REVIEWS AND QUESTIONS AND ANSWERS TERMS OF USE

    These Terms of Use govern your conduct associated with the Customer Ratings and Reviews and/or Questions and Answers service offered by Bookswagon (the "CRR Service").


    By submitting any content to Bookswagon, you guarantee that:
    • You are the sole author and owner of the intellectual property rights in the content;
    • All "moral rights" that you may have in such content have been voluntarily waived by you;
    • All content that you post is accurate;
    • You are at least 13 years old;
    • Use of the content you supply does not violate these Terms of Use and will not cause injury to any person or entity.
    You further agree that you may not submit any content:
    • That is known by you to be false, inaccurate or misleading;
    • That infringes any third party's copyright, patent, trademark, trade secret or other proprietary rights or rights of publicity or privacy;
    • That violates any law, statute, ordinance or regulation (including, but not limited to, those governing, consumer protection, unfair competition, anti-discrimination or false advertising);
    • That is, or may reasonably be considered to be, defamatory, libelous, hateful, racially or religiously biased or offensive, unlawfully threatening or unlawfully harassing to any individual, partnership or corporation;
    • For which you were compensated or granted any consideration by any unapproved third party;
    • That includes any information that references other websites, addresses, email addresses, contact information or phone numbers;
    • That contains any computer viruses, worms or other potentially damaging computer programs or files.
    You agree to indemnify and hold Bookswagon (and its officers, directors, agents, subsidiaries, joint ventures, employees and third-party service providers, including but not limited to Bazaarvoice, Inc.), harmless from all claims, demands, and damages (actual and consequential) of every kind and nature, known and unknown including reasonable attorneys' fees, arising out of a breach of your representations and warranties set forth above, or your violation of any law or the rights of a third party.


    For any content that you submit, you grant Bookswagon a perpetual, irrevocable, royalty-free, transferable right and license to use, copy, modify, delete in its entirety, adapt, publish, translate, create derivative works from and/or sell, transfer, and/or distribute such content and/or incorporate such content into any form, medium or technology throughout the world without compensation to you. Additionally,  Bookswagon may transfer or share any personal information that you submit with its third-party service providers, including but not limited to Bazaarvoice, Inc. in accordance with  Privacy Policy


    All content that you submit may be used at Bookswagon's sole discretion. Bookswagon reserves the right to change, condense, withhold publication, remove or delete any content on Bookswagon's website that Bookswagon deems, in its sole discretion, to violate the content guidelines or any other provision of these Terms of Use.  Bookswagon does not guarantee that you will have any recourse through Bookswagon to edit or delete any content you have submitted. Ratings and written comments are generally posted within two to four business days. However, Bookswagon reserves the right to remove or to refuse to post any submission to the extent authorized by law. You acknowledge that you, not Bookswagon, are responsible for the contents of your submission. None of the content that you submit shall be subject to any obligation of confidence on the part of Bookswagon, its agents, subsidiaries, affiliates, partners or third party service providers (including but not limited to Bazaarvoice, Inc.)and their respective directors, officers and employees.

    Accept

    Fresh on the Shelf


    Inspired by your browsing history


    Your review has been submitted!

    You've already reviewed this product!