Finnish News Corpus for Named Entity Recognition

finer-data

Persistent Identifier of this resource:

http://urn.fi/urn:nbn:fi:lb-2019050201

Access location:

The corpus consists of 953 articles (193,742 word tokens) with six named entity classes (organization, location, person, product, event,and date). The articles are extracted from the archives of Digitoday, a Finnish online technology news source.

The data sets are available at https://github.com/mpsilfve/finer-data and will be available in the download service korp.csc.fi/download in Kielipankki – the Language Bank of Finland.

The FiNER system and its technical documentation are available at http://urn.fi/urn:nbn:fi:lb-2018091301

You don’t have the permission to edit this resource.