• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • А
  • А
  • А
  • А
  • А
Обычная версия сайта
01
Февраль

Речевые технологии

2024/2025
Учебный год
RUS
Обучение ведется на русском языке
6
Кредиты
Статус:
Курс по выбору
Когда читается:
2-й курс, 1, 2 модуль

Преподаватель


Гурков Иван Евгеньевич

Программа дисциплины

Аннотация

The course introduces students to the basic principles and methods of speech signal analysis and automatic synthesis, as well as automatic speech recognition. Students obtain an understanding of the acoustics of the speech signal, learn to apply various tools for its processing and markup. Students are also introduced to existing speech recognition and synthesis systems and learn to apply them in practice.
Цель освоения дисциплины

Цель освоения дисциплины

  • Familiarization with the methods of signal processing
  • Familiarization with the method of recognition and synthesis of speech
  • Recognition by the student of the system and the model of synthesis and recognition
Планируемые результаты обучения

Планируемые результаты обучения

  • has an idea of the acoustic theory of speech formation, operates basic acoustic concepts (frequency, period, amplitude, resonator, spectrum, harmonics, formants, basic tone)
  • possesses skills of signal processing: construction of instantaneous spectra and sonograms, calculation of formants, signal markup in Praat program, manipulation of signal properties (amplitude, basic tone)
  • is oriented in the basic methods of speech signal synthesis (compilative: subphonetic, allophonetic, diphonetic, syllabic, macrosynthesis, unit selection; parametric, articulatory)
  • possesses skills of sound base development for compilative synthesis
  • is fluent in the apparatus of automatic speech recognition system (ASR): acoustic model, language model, decoder
  • possesses skills of extracting acoustic features relevant for ASR from the signal using Kaldi or Python
  • understands the principles of creating pronunciation dictionaries, is oriented in methods and tools of their development
  • possesses the skills of applying ASR systems in practice and evaluating the quality of recognition.
Содержание учебной дисциплины

Содержание учебной дисциплины

  • Acoustic theory of speech formation
  • Acoustic analysis of speech signal
  • History of speech technologies
  • Directions of speech synthesis
  • Compilative synthesis of speech
  • Automatic transcription and text normalization
  • General information about ASR systems
  • Acoustic modeling in ASR systems
  • Language modeling and dictionaries in ASR systems
  • Finding the right solution
Элементы контроля

Элементы контроля

  • неблокирующий Exam
    The examination is conducted in verbal form by tickets. Each ticket contains two questions
  • неблокирующий Homework
    Homework: includes practical assignments
Промежуточная аттестация

Промежуточная аттестация

  • 2024/2025 2nd module
    0.3 * Exam + 0.7 * Homework
Список литературы

Список литературы

Рекомендуемая основная литература

  • Speech and language processing, Jurafsky, D., 2014

Рекомендуемая дополнительная литература

  • A history of communications : media and society from the evolution of speech to the Internet, Poe, M. T., 2011

Авторы

  • Кессель Ксения Витальевна
  • Корнева Анна Михайловна
  • Колмогорова Анастасия Владимировна