What is it about?

Allophones are various phonetic realizations of a language. It is difficult to classify them using acoustical representation, only. Therefore, we added data from the face motion capture system, where lip movements are clearly visible. We discuss results of results of allophone recognition using such a bi-modal approach.

Featured Image

Why is it important?

Speech recognition on the allophonic level is important for automatic translation of speech to IPA (International Phonetic Alphabet)

Perspectives

We still work at automatic speech translation to IPA. The audio-video corpus was prepared accessible at the web address: www.modality-corpus.org

Andrzej Czyzewski
Gdańsk University of Technology

Read the Original

This page is a summary of: Bimodal classification of English allophones employing acoustic speech signal and facial motion capture, The Journal of the Acoustical Society of America, September 2018, Acoustical Society of America (ASA),
DOI: 10.1121/1.5067951.
You can read the full text:

Read

Contributors

The following have contributed to this page