Speech Processing

Tilde has worked on spoken language processing since the late 1990s.

Tilde’s researchers has developed a world's first Latvian automatic speech recognition and text-to speech systems. The research result -  Latvian automatic speech recognition service -is freely available as a web service.  Latvian automatic speech recognition and text-to speech systems are also integrated in Tilde's products. 

Speech Technology research

Driving Force for Development of Speech Technology for Baltic Languages

Tilde continues research on speech recognition by adapting developed technologies for new languages (e.g. Lithuanian) and for specific domains (e.g. medicine, public debates, etc.). The special attention is paid to data sparseness problem that is typical for morphologically rich languages and to novel methods for data acquisition from the web. New application domain - dictation – is also among active research topics. Finally, methods for speech recognition from low quality data are actively researched.

Where it concerns speech synthesis, Tilde’s experience cover different methods, including unit selection, parametric synthesis and recently parametric synthesis with neural networks. 



Ongoing projects

Odine project

Open Data Incubator for Europe

ODINE aims to support the next generation of digital businesses and support them to fast-track the development of their products. 

Read more
QT 21 project

Quality Translation 21

Project aims to develop substantially improved statistical and machine-learning based translation models for challenging languages and resource scenarios.

Read more
european language resource coordination

European Language Resource Coordination

The objective of the project to identify and gather language and translation data relevant to public administration across all 30 European countries.

Read more




Georg Rehm, Stelios Piperidis, Kalina Bontcheva, Jan Hajic, Victoria Arranz, Andrejs Vasiļjevs (Tilde), Gerhard Backfried, Jose Manuel Gomez-Perez, Ulrich Germann, Rémi Calizzano, Nils Feldhus, Stefanie Hegele, Florian Kintzel, Katrin Marheinecke, Julian Moreno-Schneider, Dimitris Galanis, Penny Labropoulou, Miltos Deligiannis, Katerina Gkirtzou, Athanasia Kolovou, Dimitris Gkoumas, Leon Voukoutis, Ian Roberts, Jana Hamrlova, Dusan Varis, Lukas Kacena, Khalid Choukri, Valérie Mapelli, Mickaël Rigault, Julija Melnika (Tilde), Miro Janosik, Katja Prinz, Andres Garcia-Silva, Cristian Berrio, Ondrej Klejch and Steve Renals. 2021. European Language Grid: A Joint Platform for the European Language Technology Community. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 221–230.