Speech transformation solutions
-
Dimitri Kanevsky
Abstract
This paper outlines the background development of “intelligent” technologies such as speech recognition. Despite significant progress in the development of these technologies, they still fall short in many areas, and rapid advances in areas such as dictation are actually stalled. In this paper we have proposed semi-automatic solutions — smart integration of human and intelligent efforts. One such technique involves improvement to the speech recognition editing interface, thereby reducing the perception of errors to the viewer. Other techniques that are described in the paper are batch enrollment, which allows the user to reduce the amount of time required for enrollment, and content spotting, which can be used for applications that have repeated content flow, such as movies or museum tours. The paper also suggests a general concept of distributive training of speech recognition systems that is based on data collection across a network.
Abstract
This paper outlines the background development of “intelligent” technologies such as speech recognition. Despite significant progress in the development of these technologies, they still fall short in many areas, and rapid advances in areas such as dictation are actually stalled. In this paper we have proposed semi-automatic solutions — smart integration of human and intelligent efforts. One such technique involves improvement to the speech recognition editing interface, thereby reducing the perception of errors to the viewer. Other techniques that are described in the paper are batch enrollment, which allows the user to reduce the amount of time required for enrollment, and content spotting, which can be used for applications that have repeated content flow, such as movies or museum tours. The paper also suggests a general concept of distributive training of speech recognition systems that is based on data collection across a network.
Kapitel in diesem Buch
- Prelim pages i
- Table of contents v
- About the authors ix
-
Introduction
- Offloading cognition onto cognitive technology 1
-
Articles
- A framework for thinking about distributed cognition 25
- Distributed cognition: Domains and dimensions 45
- Distributed cognition: A methodological note 57
- Radical changes in cognitive process due to technology: A jaundiced view 71
- The grounding and sharing of symbols 83
- Collaborative tagging as distributed cognition 93
- Thinking in groups 99
- Distributed learning and mutual adaptation 117
- Distributed cognition, representation, and affordance 137
- Categorization and technology innovation 145
- Crime scene investigation as distributed cognition 159
- Web Search engines and distributed assessment systems 185
- Speech transformation solutions 207
- Computer-aided translation as a distributed cognitive task 237
- Index 257
Kapitel in diesem Buch
- Prelim pages i
- Table of contents v
- About the authors ix
-
Introduction
- Offloading cognition onto cognitive technology 1
-
Articles
- A framework for thinking about distributed cognition 25
- Distributed cognition: Domains and dimensions 45
- Distributed cognition: A methodological note 57
- Radical changes in cognitive process due to technology: A jaundiced view 71
- The grounding and sharing of symbols 83
- Collaborative tagging as distributed cognition 93
- Thinking in groups 99
- Distributed learning and mutual adaptation 117
- Distributed cognition, representation, and affordance 137
- Categorization and technology innovation 145
- Crime scene investigation as distributed cognition 159
- Web Search engines and distributed assessment systems 185
- Speech transformation solutions 207
- Computer-aided translation as a distributed cognitive task 237
- Index 257