Skip to content

itmo-mbss-lab/sr_lectures_book

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ITMO Speaker Recognition Course

Authors: Novoselov S., Lavrentyeva G., Volokhov V., Matveev Y.

Description: the project is related to the development of Basics of Voice Biometrics lecture book for the ITMO Speaker Recognition Course.

Keywords: voice biometrics, speaker recognition, speaker verification, speaker identification, acoustic features, speech activity detector, machine learning, speaker embedding extractor, deep neural network, decision theory, domain adaptation and calibration, speaker diarization.

Content: the repository contains theoretical materials (now only in russian language) for self-study in the speaker recognition area. This book is a theoretical supplement to the lab work here. Overleaf project of the book is here. The titles of the book chapters are listed below.

  • Introduction (link).
  • Chapter 1. Introduction to voice biometrics (link).
  • Chapter 2. Preprocessing of speech signals (link).
  • Chapter 3. Classical methods for speaker model computing (link).
  • Chapter 4. State of the art methods for speaker model computing (link).
  • Chapter 5. Comparison of speaker models (link).
  • Chapter 6. Decision criteria (link).
  • Chapter 7. Quality assessment of biometric systems (link).
  • Chapter 8. Domain adaptation (link).
  • Chapter 9. Calibration of speaker recognition system (link).
  • Chapter 10. Speaker diarization (link).
  • Chapter 11. Prospective directions for the voice biometrics development (link).
  • Subject index (link).
  • Contents (link).

Releases

No releases published

Packages

 
 
 

Contributors

Languages