The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), version 2

137 Last view: 2024-02-29

7 Last update: 2020-02-26

The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), version 2

View resource name in all available languages

Helsingin puhekielen pitkittäiskorpus (1970, 1990, 2010), versio 2

helpuhe-v2

Persistent Identifier of this resource:

http://urn.fi/urn:nbn:fi:lb-2016041424

This corpus will be available in Kielipankki - the Language Bank of Finland (lat.csc.fi), under CLARIN RES+PLAN+NC+PRIV license (see http://urn.fi/urn:nbn:fi:lb-2015041303). Personal permission is required in order to access the corpus. The purpose of the resource use must be outlined in a research plan. Access rights are limited due to personal data protection issues.

The corpus contains interviews with people of different ages born in Helsinki. The data was collected in three decades: 1972-74, 1991-92 and 2013. The material consists of about one hour long audio recordings of individual interviews. Although the interviews don’t contain exactly the same questions, they deal with the same topics: school, work and hobbies related issues of the interviewees, as well as their lives in general in Helsinki. In addition to this the interviews contain questions related to the interviewees’ perception of the languages and language forms spoken in Helsinki.

This version of the corpus contains updated and new transcripts for a number of the original recordings. The audio files have not been updated since the first corpus version, i.e., the audio files and all the unmodified transcripts are also incorporated in version 2.0. Work on the transcription, alignment and thematic coding of the corpus is planned to continue in the future.

The corpus should be referred to in the following way:

The Longitudinal Corpus of Finnish Spoken in Helsinki, decade, informant’s code (if applicable). Examples:

- The Longitudinal Corpus of Finnish Spoken in Helsinki, 1970s subcorpus, version 2, F60
- The Longitudinal Corpus of Finnish Spoken in Helsinki, 1990s subcorpus, version 2

The informant’s code should be marked if concrete text examples of the corpus are given.

Important: due to the nature of the material, the resource should be handled with care in order to respect the privacy of the people concerned. If samples of the data are published, they must be anonymized according to best practices.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Licence

CLARIN RES

Restrictions: Academic - Non Commercial Use, Attribution, No Derivatives, No Redistribution, Other

Distribution Access/Medium: Accessible Through Interface, Downloadable

Attribution Details: ks. Documentation, see Documentation

Licensors:

University of Helsinki

Kotimaisten kielten keskus, Institute for the Languages of Finland

Heikki Paunonen

Distribution rights holders:

University of Helsinki

Kotimaisten kielten keskus, Institute for the Languages of Finland

Heikki Paunonen

IPR Holder

University of Helsinki

Heikki Paunonen

Kotimaisten kielten keskus, Institute for the Languages of Finland

Contact Persons

User support FIN-CLARIN

Hanna Lappalainen

text
audio

Monolingual text corpusLanguages

Finnish

Linguality

Linguality type: Monolingual

Size

83 Files

Modalities

Spoken Language

Time Coverage

1970-2013

Geographic coverage

Helsinki

Creation

Creation mode details: Manual transcription or alignment of existing transcripts

Creation mode: Manual

Original Sources

The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s)

Creation Tools

Praat

Link to Other Media

Other media: Audio

Synchronized with audio: True

Synchronized with text: False

Monolingual audio corpusLanguages

Finnish

Linguality

Linguality type: Monolingual

Size

216 Hours

Effective speech duration

200 Hours

Audio duration

216 Hours

Modalities

Spoken Language, Voice

AnnotationSpeech Annotation - Orthographic Transcription

StandOff: True

Segmentation level: Other, Topic, Utterance

Format: TextGrid (Praat), EAF (ELAN)

Annotation Mode: Manual

Annotation Tools:

Praat

Content

Speech items: Free Speech

Noise Level: Medium

Setting

Naturality: Natural

Conversational type: Multilogue

Audience: Few

Interactivity: Interactive

Audio Formatsaudio/wav

Byte order: Little Endian

Compression: False

Quantization: 16

Number of tracks: 1

Sampling rate: 22050

Signal encoding: LinearPCM

Resource Creation

Resource Creator

Hanna Lappalainen

University of Helsinki

Heikki Paunonen

Metadata

Created: 02/03/2016

Last Updated: 02/26/2020

Metadata Creator

Mietta Lennes

Version

Version: 2

Last Updated: 02/25/2016

Relation

Related Resource: The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), Korp Version 2, http://urn.fi/urn:nb...

Relation Type: IsMetadataFor

Related Resource: The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), LAT Version 2, http://urn.fi/urn:nb...

Relation Type: IsMetadataFor

Documentation

Document Type: Manual

Helsingin puhekielen pitkittäiskorpuksen (1970, 1990, 2010) ohjeet, https://www.kielipan...

Editor: FIN-CLARIN

Document Language: Finnish

Document Type: Other

License (helpuhe), http://urn.fi/urn:nb...

People who looked at this resource also viewed the following:

Resources from the same creators