Provides an insightful and practical introduction to crowdsourcing as a means of rapidly processing speech data.
Intended for those who want to get started in the domain and learn how to set up a task, what interfaces are available, how to assess the work, etc. as well as for those who already have used crowdsourcing and want to create better tasks and obtain better assessments of the work of the crowd. It will include screenshots to show examples of good and poor interfaces; examples of case studies in speech processing tasks, going through the task creation process, reviewing options in the interface, in the choice of medium (MTurk or other) and explaining choices, etc.
Key Features
- Provides an insightful and practical introduction to crowdsourcing as a means of rapidly processing speech data.
- Addresses important aspects of this new technique that should be mastered before attempting a crowdsourcing application.
- Offers speech researchers the hope that they can spend much less time dealing with the data gathering/annotation bottleneck, leaving them to focus on the scientific issues.
- Readers will directly benefit from the book’s successful examples of how crowd- sourcing was implemented for speech processing, discussions of interface and processing choices that worked and choices that didn’t, and guidelines on how to play and record speech over the internet, how to design tasks, and how to assess workers.
Essential reading for researchers and practitioners in speech research groups involved in speech processing.
Contents
Chapter 1 An Overview
- 1.1 Growing Needs for Speech Data
- 1.2 Some Issues
- 1.3 Some Terminology
- 1.4 Acknowledgements
- References
Chapter 2 The Basics
- 2.1 An Overview of the Literature on Crowdsourcing for Speech Processing
- 2.2 Alternate Solutions
- 2.3 Some Ready-Made Platforms for Crowdsourcing
- 2.4 Making Task Creation Easier
- 2.5 Getting Down to Brass Tacks
- 2.6 Quality Control
- 2.7 Judging the Quality of the Literature
- 2.8 Some Quick Tips
- References
Chapter 13 Collecting Speech from Crowds
- 13.1 A Short History of Speech Collection
- 13.2 Technology for Web-based Audio Collection
- 13.3 Example:WAMI Recorder
- 13.4 Example: The WAMI Server
- 13.5 Example: Speech Collection on Amazon Mechanical Turk
- 13.6 Using the Platform Purely for Payment
- 13.7 Advanced Methods of Crowdsourced Audio Collection
- 13.8 Summary
- 13.9 Acknowledgements
- References
- Index
Book Details
- Hardcover: 356 pages
- Publisher: Wiley; 1 edition (May 6, 2013)
- Language: English
- ISBN-10: 1118358694
- ISBN-13: 978-1118358696
- Product Dimensions: 6.8 x 0.9 x 9.6 inches
- List Price: $125.00