Startseite Technik 1. State-of-the-art speaker recognition methods applied to speakers with dysarthria
Kapitel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

1. State-of-the-art speaker recognition methods applied to speakers with dysarthria

  • Mohammed Senoussaoui , Milton O. Saria-Paja , Patrick Cardinal , Tiago H. Falk und François Michaud
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

Speech-based biometrics is one of the most effective ways for identity management and one of the preferred methods by users and companies given its flexibility, speed and reduced cost. Current state-of-the-art speaker recognition systems are known to be strongly dependent on the condition of the speech material provided as input and can be affected by unexpected variability presented during testing, such as environmental noise, changes in vocal effort or pathological speech due to speech and/or voice disorders. In this chapter, we are particularly interested in understanding the effects of dysarthric speech on automatic speaker identification performance. We explore several state-of-theart feature representations, including i-vectors, bottleneck neural-networkbased features, as well as a covariance-based feature representation. High-level features, such as i-vectors and covariance-based features, are built on top of four different low-level presentations of dysarthric/controlled speech signal. When evaluated on TORGO and NEMOURS databases, our best single system accuracy was 98.7%, thus outperforming results previously reported for these databases.

Abstract

Speech-based biometrics is one of the most effective ways for identity management and one of the preferred methods by users and companies given its flexibility, speed and reduced cost. Current state-of-the-art speaker recognition systems are known to be strongly dependent on the condition of the speech material provided as input and can be affected by unexpected variability presented during testing, such as environmental noise, changes in vocal effort or pathological speech due to speech and/or voice disorders. In this chapter, we are particularly interested in understanding the effects of dysarthric speech on automatic speaker identification performance. We explore several state-of-theart feature representations, including i-vectors, bottleneck neural-networkbased features, as well as a covariance-based feature representation. High-level features, such as i-vectors and covariance-based features, are built on top of four different low-level presentations of dysarthric/controlled speech signal. When evaluated on TORGO and NEMOURS databases, our best single system accuracy was 98.7%, thus outperforming results previously reported for these databases.

Kapitel in diesem Buch

  1. Frontmatter I
  2. Foreword V
  3. Acknowledgments IX
  4. Contents XI
  5. List of contributors XIII
  6. Introduction 1
  7. Part I: Comparative analysis of methods for speaker identification, speech recognition, and intelligibility modification in the dysarthric speaker population
  8. 1. State-of-the-art speaker recognition methods applied to speakers with dysarthria 7
  9. 2. Enhancement of continuous dysarthric speech 35
  10. 3. Assessment and intelligibility modification for dysarthric speech 67
  11. Part II: New approaches to speech reconstruction and enhancement via conversion of non-acoustic signals
  12. 4. Analysis and quality conversion of nonacoustic signals: the physiological microphone (PMIC) 97
  13. 5. Non-audible murmur to audible speech conversion 125
  14. Part III: Use of novel speech diagnostic and therapeutic intervention software for speech enhancement and rehabilitation
  15. 6. Application of speech signal processing for assessment and treatment of voice and speech disorders 153
  16. 7. A mobile phone-based platform for asynchronous speech therapy 195
Heruntergeladen am 23.9.2025 von https://www.degruyterbrill.com/document/doi/10.1515/9781501501265-002/html
Button zum nach oben scrollen