Tech Quarto
Search Advanced SearchView Cart   Checkout   
 Location:  Home » Computer Science » Spoken Language Processing: A Guide to Theory, Algorithm and System Development  
Categories
Computer Science
The Internet
For Dummies
Web Browsers
Windows
Digital Culture
Multimedia
Mobile & Wireless
New Releases
Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations
Computing Meaning, Volume 3 (Studies in Linguistics and Philosophy)
Machine Learning for Multimodal Interaction: 5th International Workshop, MLMI 2008, Utrecht, The Netherlands, September 8-10, 2008, Proceedings (Lecture Notes in Computer Science)
Computational Forensics: Second International Workshop, IWCF 2008, Washington, DC, USA, August 7-8, 2008, Proceedings (Lecture Notes in Computer Science)
Finding Out About: A Cognitive Perspective on Search Engine Technology and the WWW
Visit Laptop Nirvana for the best Cheap Discount Laptops
Bestsellers
Dragon NaturallySpeaking for Dummies
Speech and Language Processing (2nd Edition) (Prentice Hall Series in Artificial Intelligence)
The Cathedral & the Bazaar: Musings on Linux and Open Source by an Accidental Revolutionary
Learning Perl, Fourth Edition
Perl Best Practices
Spoken Language Processing: A Guide to Theory, Algorithm and System Development
Natural Language Understanding (2nd Edition)
Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition)
Dragon Naturally Speaking for the Office Professional: Speech Recognition Series
Perl Testing: A Developer's Notebook (Developers Notebook)

Spoken Language Processing: A Guide to Theory, Algorithm and System Development

Spoken Language Processing: A Guide to Theory, Algorithm and System Development

zoom enlarge 
Authors: Xuedong Huang, Alex Acero, Hsiao-wuen Hon
Publisher: Prentice Hall PTR
Category: Book

List Price: $89.00
Buy New: $71.17
You Save: $17.83 (20%)



New (5) Used (6) from $65.20

Avg. Customer Rating: 5.0 out of 5 stars 10 reviews
Sales Rank: 63947

Media: Paperback
Number Of Items: 1
Pages: 1008
Shipping Weight (lbs): 3.2
Dimensions (in): 9.1 x 7 x 2

ISBN: 0130226165
Dewey Decimal Number: 006.35
UPC: 076092010418
EAN: 9780130226167
ASIN: 0130226165

Publication Date: May 5, 2001
Availability: Usually ships in 1-2 business days
Condition: BRAND NEW

Customer Reviews:   Read 5 more reviews...

5 out of 5 stars A classic and comprehensive resource   December 12, 2006
 3 out of 3 found this review helpful

This book is a comprehensive overview of most of the major topics associated with speech processing. Divided into five main sections, the book is well structured with a clear division of concerns. The title, "Spoken Language Processing", may be misleading to some as language processing topics only accounts for one section of the book.

The first two sections cover the fundamental theories that should be understood before embarking in-depth into a study of speech processing. This may seem an obvious approach but many texts do not follow this pattern making their use as reference tomes limited. Separating background theory from its use is also useful in that it allows a rigorous approach to its description. Too often texts give a hurried imprecise overview of theories used before launching into a long and complex use of the theory; losing the reader instantly in a quagmire of formulae.

The first two sections of the book deals with background material, material that the reader should at least understand the key concepts of. The first section concentrates on speech in general (including production and perception), probability and statistics, and pattern classification. These last two topics mentioned are both important parts of the book and are dealt with in their own chapters. Both are well written with the right amount of explanation and background. Much of the remainder of the book expects at least some familiarity with the material presented here. These chapters, like all chapters in the book finish with a section entitled, "Historical Perspective and Further Reading". The inclusion of recommended further reading, in addition to the vast number of references appearing in each chapter, make the book as a whole a very good starting point for any work in speech processing.

The second section concerns itself with the DSP topics which relate to speech processing. In this section the reader will find everything from FFTs to multi-rate signal processing and speech signal representations to speech coding. Again the section is well written and the reader is not forced to refer to other texts to understand what is written. If a topic is not expanded upon here then it is an indication that is not dealt further in any great depth in the remainder of the book.

The third section of the book covers speech recognition and is probably the section which will find most use with many readers. This section is very thorough in its treatment of the subject. It starts immediately with a discussion of Hidden Markov Models which is almost exclusively the method employed in the pattern matching stage of speech recognition. Any algorithms that are mentioned are also detailed which really make the book useful. In fact algorithms are presented throughout the book making it a practical reference as much as a theoretical one. This is important because there is a big jump from understanding theory to being able to implement an algorithm to exploit that theory. Other topics covered include an excellent chapter on environmental robustness with one of the best discussions of microphones I have seen. Language modelling and search algorithms are given a thorough treatment. I would like to have seen more detailed information on front-end processing and endpoint detection, as this remains a critical stage of the recognition process. Perhaps the level of detail reflects the fact that this is currently a hot research topic with potential for significant advancement.

Section four, on text-to-speech processing, is a good overview of the field and better than any book I've seen on the subject. It shows numerous block diagrams of what you need to build such a system and gives numerous algorithms in pseudocode. It also dedicates a subsection to each block of the text-to-speech system block diagram, discussing in detail what you would need to do to implement that particular block. Since much of the individual blocks have been discussed earlier in the book, it refers you back to specific earlier sections for details.

The fifth section is a short one on entire systems and shows some case studies, concentrating on what Microsoft was doing at the time this book was published, since that is where the authors' research came from. I would highly recommend that anyone anticipating getting into speech processing have a copy of this classic nearby.



4 out of 5 stars exhortation   July 31, 2006
 0 out of 2 found this review helpful

Beautifully written book covering almost all area of spoken language processing. However, despite of relative ease of reading, it is necessary to warn the beginning reader, that in some sections the deep enough acquaintance to their theme is veiled. It, for example, concerns questions of definition and application of delta function concept in chapter 5 (Digital Signal Processing).


5 out of 5 stars A tour de force   August 23, 2005
 2 out of 2 found this review helpful

What a wonderful book. Whether you are a computer scientist or mathematician with limited exposure to the discipline of speech processing, or alternatively you are a dedicated expert in this field, you will find everything you are looking for in this book. For two weeks, I couldn't put this thing down. And that's an extraordinary testimony to a book that's 800+ pages of technical detail. If you want a high level understanding of how speech processing works, or if you want to dig in and build your own speech engine, everything you need is right here.


5 out of 5 stars Microsoft's future cook book   June 29, 2003
 3 out of 4 found this review helpful

This is a great book if you want to know the future of what Microsoft's top researchers like XD Huang etc are thinking and working. Dr Huang is a super star of the field and it is equally worthwhile to read his excellent book.


5 out of 5 stars Useful and interesting   January 5, 2003
 2 out of 4 found this review helpful

A thourough and complete review about the subject, in which many disciplines (language, computer, probability, statistics, numerical analysis) converge. As a non-practitioner I have found it an enjoyable opportunity to refresh my knowledges in the field of signal processing, and a source of many hints I have been able to develop in other branches. In spite of notations and methodologies (e.g. bayesian) a bit far from I am used to, the near one thousand pages never seemed extreme related to the meaning compressed into them, spreading from base theory to advanced applications.

Powered by Associate-O-Matic