“Improving access to a critically under-resourced language: AI-based approaches for producing and obtaining Livonian content”

Project No 5.2.1.1.i.0/2/24/I/CFLA/007 “Internal and External Consolidation of the University of Latvia” of the second round of the Consolidation and Governance Change Implementation Grants within Investment 5.2.1.1.i “Research, Development and Consolidation Grants” under Reform 5.2.1.r “Higher Education and Science Excellence and Governance Reform” of Reform and Investment Strand 5.2 of the Latvian Recovery and Resilience Mechanism Plan “Ensuring Change in the Governance Model of Higher Education Institutions”.

Project:  “Improving access to a critically under-resourced language: AI-based approaches for producing and obtaining Livonian content”
Nr. LU-BA-PA-2024/1-0056

(2024–2026)

The aim of the project is to explore whether and how building speech synthesis and recognition is possible for critically under-resourced languages with limited data and an extremely small number of contemporary speakers. This will be achieved through the application of experimental data processing methods – based on the hypothesis developed by the project team – together with (1) the AI applications, know-how, and guidance provided by the Artificial Intelligence Laboratory (AIL) at the University of Latvia (UL) Institute of Mathematics and Computer Science, and (2) the experience in Livonian phonetics, prosody, and development of speech corpuses provided by the University of Tartu Institute of Estonian and General Linguistics (IEGL), both of which are partners of this project.

This project focuses on Latvia’s indigenous Livonian language. As a pre-condition, it uses work that has been previously performed by the applicant in the development of digital resources and tools for the Livonian language as well as the experience and excellence of the project partners. The project seeks not only to obtain new knowledge about the use of AI tools in the specific conditions in which critically under-resourced endangered languages exist – which is the project’s main novelty – but also to achieve practical results in the form of improving access for the general public, researchers, and the Livonian community itself to written Livonian language collections and – in the longer perspective – to the content of audio collections.

During the timeframe of the current project, the applicant and partners are heading towards the overarching goal of creating speech recognition for Livonian, while focusing within the project on the first step – the creation of speech synthesis for Livonian and multiplication of data. This also serves as a precondition for future research and the development of speech recognition.

Project result: scientific publication – 2, conference materials – 3, scientific databases and data collections – 1, submitted project application – 1, other results relevant to the research subject – Livonian speech synthesizer

RESULTS OBTAINED:

SCIENTIFIC PUBLICATIONS

Tuisk, Tuuli, Nicolai Pharao. Unveiling Tonal Contrasts in The Baltic Region: Exploring Stød in Livonian Spontaneous Speech. Linguistica Uralica LX 2024 4. pp 241—270. DOI: https://doi.org/10.3176/lu.2024.4.01. Available here.

Ernštreits, Valts. Towards the speech recognition for Livonian. 9th International Workshop on Computational Linguistics for Uralic Languages, IWCLUL 2024, November 28-29, 2024 : Proceedings of the Workshop Helsinki : Association for Computational Linguistics, 2024. P.76-80. Available here

SCIENTIFIC CONFERENCES AND SEMINARS, THESIS

Ernštreits, Valts. Towards the speech recognition for Livonian. 9th International Workshop on Computational Linguistics for Uralic Languages, IWCLUL 2024, November 28-29, 2024.

Ernštreits, Valts. Endangered Languages and Cultures in the Digital Era. Plenary speech at the student conference “Bridges in the Baltics”. Vilnius, 4.10.2024. Conference platform here.

Ernštreits, Valts. Digital Future of Indigenous Languages – a ground to prosper or a new battlefield? Presentation at the conference “LT4ALL –Language Technologies for All”. Parīze, 24–26.02.2025. Conference page here and abstracts here.

Ernštreits, Valts. Roots of the Livonian written language. Plenary presentation at the conference ““Koloniaalkeelest rahvuskeeleks”. Tallina, 27–28.02.2025. Abstracts here.

Ernštreits, Valts. Transforming Livonian Folklore Archives into a Multifaceted Digital Resource. Presentation at the conference “Digital Humanities in the Nordic and Baltic Countries” (DHNB2025). Tartu, 4.03.2025. Abstracts here.

Ernštreits, Valts. The path towards literary Livonian. Presentation at the conference “500 years of Latvian and Estonian books and Literary Language”. Riga, 23-24.05.2025. More here.

Ernštreits, Valts. 12–14 September 2025. International Conference at Kyoto University, “IV International Conference on the Revitalization of Indigenous and Minoritized Languages”, Kyoto University, Kyoto.

Presentation: “Bridging the Digital Divide: Latvia’s Indigenous Livonian Case.”

Ernštreits, Valts. 18–23 August 2025. University of Tartu Congress, “XIV Congressus Internationalis Fenno-Ugristarum” (CIFU XIV), University of Tartu, Tartu.

Plenary lecture: “Building a Future for Livonian.”

Publicity

39 posts about project events published on institute’s Facebook profile. Publications can be located by using hashtags #NextGenerationEU #AtveselosanasFonds

Ernštreits Valts. Overview about UL LIvonian Institute’s projects. Ventspils Livonian culture days, 13.10.2024.

Ernštreits, Valts. Latvijas Radio 1 programme “Zināmais nezināmajā” – talk on Livonian, challenges of endangered languages in the digital age and approaches to narrow digital gaps. Broadcast could be listened here.

UL Livonian Institute participates at Europe Scientists Night. Information on projects, digital instruments and toold provided to participants. 27.09.2024. Information about schedule here.

Ernštreits, Valts. Participation on Radio SWH programme “Prāts izglābs pasauli”, talk on challenges and solutions of the digital world. 21.09.2024. Broadcast can be followed here.

Ernštreits, Valts, Kļava, Gunta. Workshop “Kui se līb #līvõkīelkõks?”. (FB post). 02.02.2025.

Ernštreits, Valts. Public lecture “Livonian and other Endangered languages in the digital age”. (FB post). 02.02.2025.

Ernštreits, Valts. Interview about Livonians and digital technologies in the newspaper “Talsu vēstis” 7.03.2025. (here)

Ernštreits, Valts. Interview about Livonians and digital technologies in the Latvian Public Broadcast 9.06.2025. (here)

23–27 June 2025. Lillestrøm, Norway – participation of the Director of the Livonian Institute, Valts Ernštreits, in the UN Internet Governance Forum.

On 26 June, he spoke in two panels — “Strengthening Multistakeholder Participation” and “Indigenous Peoples’ Languages in a Digital Age.” His talks focused on multilingualism, Indigenous inclusion in digital domains, and the newly launched UNESCO publication “Global Roadmap for Multilingualism in the Digital Era.”

8 August 2025. Valts Ernštreits participated in the World Summit on the Information Society organized by the UN Information and Communication Technology Agency.

In the panel on multilingualism in the digital age, he presented the Institute’s work in giving the Livonian language a voice in the digital world and discussed global initiatives aimed at promoting Indigenous languages, linguistic, and cultural diversity.

30 September 2025. On International Translation Day (#InternationalTranslationDay2025), Valts Ernštreits presented one of today’s most influential international actions for the future of language technologies — the UNESCO Global Roadmap for Multilingualism in the Digital Era.

He also introduced the Livonian Speaker Engagement Programme, implemented by the University of Latvia’s Livonian Institute, which promotes digital use and revitalization of Latvia’s Indigenous Livonian language.

The event was organized by Translation Commons and the International Decade of Indigenous Languages (2022–2032).

4 August 2025. At the 4th Forum of Latvian Professionals Working in International Organizations, organized by the Ministry of Foreign Affairs and dedicated to security, defense, social cohesion, and resilience, Valts Ernštreits presented the Institute’s work within international Indigenous organizations.

new cooperations

The Livonian Institute of the University of Latvia has signed an agreement with META as a language technology partner within the initiative “Democratizing Language Technology for the International Decade of Indigenous Languages.”

Discussions have also begun on data collaboration with the Mozilla Foundation’s Data Collective.

Planned cooperation with Microsoft includes joining the initiative “LINGUA: Expanding Europe’s Voices in AI.”

A Technical Experts Meeting, “Languages at Every Stage: A Technical Dialogue on Inclusive Language Technology Development,” was organized by UNESCO and the Zenter fir d’Lëtzebuerger Sprooch (ZLS).

POLICY IMPACT

Project scientific team participants start their work (13.09.2024.) on Ad-Hoc groups adjacent to the Global Task Force of making the UN International Decade of Indigenous languages (2022–2032). Gunta Kļava starts work in the group dedicated to the language transmission and Valts Ernštreits starts to work and is elected one of two co-chairs for the group dedicated to digital equality and domains of indigenous languages. According to Valts Ernštreits preposition Ad-Hoc group on digital equality prepares draft ammendments from the perspective of indigenous languages for Recommendation on the Ethics of Neurotechnology (more here). Currently Ad-Hoc groups have also prepared global survey on indigenous languages, including the presence of indigenous languages in the digital space (here).

Valts Ernštreits participated as a member of the editorial group established by UNESCO to prepare one of the most significant contemporary global policy initiatives in language technology development — the UNESCO Global Roadmap for Multilingualism in the Digital Era, which will be presented on 6 November 2025 in Doha, Qatar, during the UN Second World Summit for Social Development.

April 4, 2025: Valts Ernštreits was elected as Latvia’s representative to the United Nations Permanent Forum on Indigenous Issues (2026–2028; here).

16 September 2025. The government approved the first report to the Saeima (Parliament) on achievements and planned actions in national language policy.

The report includes information prepared by the University of Latvia’s Livonian Institute on measures to ensure the vitality of Latvia’s Indigenous Livonian language, including activities implemented within the framework of projects dedicated to this goal.

February 3, 2025: At the Ministry of Foreign Affairs of the Republic of Latvia, Latvian representatives in UNESCO institutions were introduced to the work carried out in this project to ensure digital language equality (here).

March 23, 2025: A meeting took place in Kolka with the Minister of Culture of the Republic of Latvia, Agnese Lāce, during which the situation and needs of the Livonian community were discussed, including issues related to the presence of the Livonian language in the digital environment (here).

In February 2025: An initiative was sent to the State Chancellery and the Saeima (Parliament) regarding the removal of obstacles to the use of the Livonian language, including in state data systems.