Go to content
About corpora and other material 

The linguistic corpora are the Institute’s way of passing on our linguistic heritage to future generations.

Links to electronic corpora and other material 

Kotus has published parts of its language resources on electronic platforms.

The Tape Archive 

The old colloquial dialects were considered soon to be lost, so the linguists felt the need to record them.

The Names Archive 

The Names Archive (Nimiarkisto) contains a corpus of material on Finnish and Saami names, serving as a tool for research and name planning.

Lexical corpora 

Lexical corpora on Finnish dialects, Old literary Finnish, Karelian and Finish slang.

Written language corpora 

Written Finnish ranging from 16th to 21st century

The Institute has comprehensive collections compiled over the course of more than 100 years. They include corpora of names, words, and spoken and written language. In addition to printed matter, the collections contain audio material.

The Institute’s archives are an important part of the cultural heritage of Finland. Some of the material is available in electronic format, and the Institute’s digitisation efforts are designed to increase the availability of the corpora and other material. 

Share