Romanovskyi, Oleh та Iosifov, Ievgen та Iosifova, Olena та Sokolov, Volodymyr та Skladannyi, Pavlo та Sukaylo, Igor та Tsarenok, Oleksii (2025) Accuracy Improvement of Spoken Language Identification System for Close-related Languages Advances in Computer Science for Engineering and Education VII. ICCSEEA 2024 (242). ISSN 2367-4512
![]() |
Текст
Romanovskyi_O_et_al_LNDECT_242.pdf Download (59kB) |
Анотація
Spoken Language Identification (SLI) systems have witnessed tremendous progress in adopting deep learning architectures. However, differentiating between acoustically and phonetically similar languages remains a significant challenge. This paper investigates the challenges of building SLI systems for similar languages and presents an accuracy improvement strategy. We provide experimental results on a set of closely related languages (Swedish-Norwegian, Czech-Slovak, Ukrainian-Russian, and Spanish-Catalan-Galician) and discuss the implications of our findings. This paper aims to research data impact on SLI models' accuracy, primarily focusing on comparing models trained on different datasets, data quality versus quantity comparison, and data balance demand to achieve the best possible SLI model. Experiments lead us to conclusions that the most impactful criteria on additional accuracy are region-specific SLI models (outperform heavy multilingual by 2.47% on average and much faster to train), adding new domain dataset (improvements for a new domain without significant risk to the original domain accuracy), balancing training data for the same amount of data per each language (provides stability in trained pairs/triples), and adding less but more quality data (30% high-quality data gives more accuracy than 100% of mid-quality data).
Тип елементу : | Стаття |
---|---|
Ключові слова: | Spoken Language Identification; SLI; Spoken Language Recognition; SLR; Language identification; LID; VoxLingua107; Common Voice |
Типологія: | Статті у базах даних > Scopus > У виданнях Q4 Scopus |
Підрозділи: | Факультет інформаційних технологій та математики > Кафедра інформаційної та кібернетичної безпеки ім. професора Володимира Бурячка |
Користувач, що депонує: | Павло Миколайович Складанний |
Дата внесення: | 24 Квіт 2025 09:19 |
Останні зміни: | 24 Квіт 2025 09:19 |
URI: | https://elibrary.kubg.edu.ua/id/eprint/51663 |
Actions (login required)
![]() |
Перегляд елементу |