Tekstiks – Speech recognition: speech to text

Suomeksi

A demo service where you can try out automatic speech transcription and edit the automatically generated transcript via a browser interface.

Note: For the time being, this service is intended for trial use with individual audio files only. The service is not designed to handle large amounts of data and should not be used to handle confidential speech recordings for data protection reasons.

Latest version:  

Tekstiks
icon-info-circle Metadata and license
icon-quote-right Attribution instructions

Access the service
Look for all versions of this tool in META-SHARE  

What is Tekstiks.ee?

Tekstiks.ee is a web browser-based speech recognition service for transcribing speech in Estonian or Finnish.

The Tekstiks service is part of the international CLARIN cooperation. The text editor for editing transcripts and the interface for running speech recognition tools have been developed at the Laboratory of Language Technology at the Tallinn University of Technology (TalTech). TalTech’s own speech recogniser for the Estonian language is connected to the service, as well as the speech recogniser for Finnish provided through the Language Bank of Finland, which uses speech recognition models developed at Aalto University.

The system can handle several files simultaneously. The average processing time is about half of the recording’s length (in November 2022). The language of the browser interface can be set to English or Finnish instead of Estonian.

How does Tekstiks work with Finnish speech?

First, you need to create a local username on a server managed by the Tallinn University of Technology in Estonia. To create an account, all you need is a working email address, a user name and a password. The audio files to be processed are uploaded to the Tekstiks server in Estonia. The logged-in user can manage and delete files uploaded to the Tekstiks server.

If Finnish speech recognition is selected and activated in the Tekstiks service, the speech recordings are transferred over the network to a CSC-hosted server in Finland, where they are processed. The recognised text is transferred from the CSC server back to the Tekstiks server in Estonia, where the user can further edit the text and, if they wish, download it. Currently the supported download format is .docx (MS Word document).

Please note that the level of security of this test service is currently not sufficient to handle confidential speech data.

Reference instructions

  • If you use the Tekstiks service in your research, please refer to the following article in your publications:
    Olev, Aivo; Alumäe, Tanel. (2022) Estonian Speech Recognition and Transcription Editing Service. Baltic J. Modern Computing, Vol. 10 (3), pp. 409–421. DOI: 10.22364/bjmc.2022.10.3.14
  • If you are using the Finnish speech recogniser developed at Aalto University, please check the reference guidelines for that as well.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2022112802

Search the Language Bank Portal:
Harri Uusitalo
Researcher of the Month: Harri Uusitalo

 

Upcoming events


Contact

The Language Bank's technical support:
kielipankki (at) csc.fi
tel. +358 9 4572001

Requests related to language resources:
fin-clarin (at) helsinki.fi
tel. +358 29 4129317

More contact information