Finnish TreeBank (FTB)

These treebanks and parsebanks for Finnish were created by the FinnTreeBank project. The data in FinnTreeBank 1 is based on model sentences in Iso suomen kielioppi (The Large Grammar of Finnish), manually annotated with dependency-syntactic descriptions (see the tagset and the annotation manual). FinnTreeBank 1 was built as a Grammar Definition Corpus and intended as a model for further automatic analysis of Finnish. FinnTreeBank 2 is a small extension to FinnTreeBank 1, and it was manually annotated in the same fashion as the first treebank. FinnTreeBank 3 is a large treebank that was only automatically annotated, using an experimental method. As a result, the annotations in the third treebank are of much lower quality in comparison to the manually annotated treebanks.

The UD version of FinnTreeBank 1 was derived from FinnTreeBank 1 2014 by a scripted mapping of labels and some restructuring in an attempt to conform approximately to the UD Finnish model.

More information on UD Finnish FTB

UD versions:  
UD Finnish-FTB: The UD version of FinnTreeBank 1
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Search for these versions in META-SHARE  
Latest versions/subcorpora:  
The Downloadable Version of the Finnish TreeBank 1
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Helsinki Korp Version of the Finnish TreeBank 1
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp (as part of FTB2)
The Downloadable Version of the Finnish TreeBank 2
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Helsinki Korp Version of the Finnish TreeBank 2
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Downloadable Version of the Finnish TreeBank 3
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Helsinki Korp Version of the Finnish TreeBank 3
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for these versions in META-SHARE  

 

Several different versions of these resources are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found on the list above. Details on the content and license of each version are available via the metadata records.

Annotation details

Publications related to FinnTreeBank

 


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021031604

 

Search the Language Bank Portal:
Harri Uusitalo
Researcher of the Month: Harri Uusitalo

 

Upcoming events


Contact

The Language Bank's technical support:
kielipankki (at) csc.fi
tel. +358 9 4572001

Requests related to language resources:
fin-clarin (at) helsinki.fi
tel. +358 29 4129317

More contact information