Institute of Lithuanian Literature and Folklore

Datasheet

Title

Lietuvių literatūros ir tautosakos institutas, DISMARC metadata (2011)

Description

This dataset was created for data ingest to the DISMARC and the Europeana platform during the EuropeanaConnect (2009-2011) project. The Institute of Lithuanian Literature and Folklore, which was a partner in EuropeanaConnect, is carrying out research of contemporary Lithuanian literature and folklore as well as ancient literature of the Great Duchy of Lithuania. The folklore archives of the Institute is the oldest repository of folklore in Lithuania comprising over 1.9 million of folklore items. The earliest sound records stored in the archives are phonograph records from 1908–1949. This dataset contains a selected part of 7,346 metadata records.

Homepage

http://www.llti.lt/en/default_en.htm

Repository

https://www.dismarc.org/info/wp-content/uploads/2025/06/LLTI-1.zip

Publisher

Lietuvių literatūros ir tautosakos institutas

DISMARC Audio Aggregation Platform (https://www.dismarc.org/)

Point of Contact

Gerda Koch, https://orcid.org/0000-0002-1257-092X, kochg[@]ait.co.at, AIT Angewandte Informationstechnik Forschungsgesellschaft mbH, Metadata Coordinator for the DISMARC Audio Aggregation Platform

Supported Tasks and Shared Tasks

AI Category

Natural Language Processing – Text Mining

Type of Cultural Heritage Application

Collection search

(Cultural Heritage) Application Example

LLTI in DISMARC.org

Distribution

Dataset Curators

Gerda Koch, AIT Angewandte Informationstechnik Forschungsgesellschaft mbH, Metadata Coordinator DISMARC Audio Aggregation Platform

Auste Nakiene, Department of Folklore Archives, senior researcher

Licensing Information

http://creativecommons.org/publicdomain/mark/1.0/

Citation Information

@misc{LLTI dataset,
title = {Lietuvių literatūros ir tautosakos institutas, DISMARC metadata},
author = {DISMARC Audio Aggregation Platform},
howpublished = {\url{https://www.dismarc.org/info/wp-content/uploads/2025/06/LLTI-1.zip}},
year = {2011},
date = {2011-10-17},
note = {Data in XML format, provided by website https://www.dismarc.org/info/dismarc-data/}
}

Contributions

Auste Nakiene, Department of Folklore Archives, senior researcher
Ruta Zarskiene, Department of Folklore Archives, senior researcher

Composition

Data Category

Text

Object Type

Audio metadata, http://vocab.getty.edu/page/aat/300379475

Dataset Structure

Data Instances

<section name=”dmOAP”>
<dmOAP-Identifier>LTRF v 1 1</dmOAP-Identifier>
<dmOAP-Title>Plaukė žąselė per Nemunėlį</dmOAP-Title>
<dmOAP-Subject-genre>daina</dmOAP-Subject-genre>
<dmOAP-Contributor role=”Performer”>merginų choras</dmOAP-Contributor>
<dmOAP-Contributor role=”Collector”>E. Volteris Li W 20</dmOAP-Contributor>
<dmOAP-Date-dateRecorded>1908-09-09T00:00:00.000</dmOAP-Date-dateRecorded>
<dmOAP-Date-dateRecorded encoding=”dmEras”>dmEras:/XX amžius/XX a. pirmasis dešimtmetis
<thesaurus_link level=”2″ id=”dmEras:50040001″/><thesaurus_link level=”1″ id=”dmEras:50040000″/>
</dmOAP-Date-dateRecorded><dmOAP-Type>Sound</dmOAP-Type>
<dmOAP-Format encoding=”dmFormats”>dmFormats:/volelis<thesaurus_link level=”1″ id=”dmFormats:80006000″/>
</dmOAP-Format>
<dmOAP-Coverage-spatial>Kaunas</dmOAP-Coverage-spatial>
<dmOAP-Rights-accessRights>http://creativecommons.org/publicdomain/mark/1.0/</dmOAP-Rights-accessRights>
</section>

Compliance with Standard(s)

The dataset conforms to the Dublin Core Standard. A Dublin Core Application Format has been used. It is named: DISMARC Object Application Profile (dmOAP). The description of the individual data fields of the dmOAP can be found here: https://dismarc.ait.co.at/zebra/dismarclocal/DISMARCLOCALRegnetDescription.xml

Languages

English, en
Lithuanian, lt

Descriptive Statistics

7,346 metadata records

Data Collection Process

Curation Rationale

This dataset contains metadata descriptions of early sound recordings of Lithuanian folk music that were selected and integrated into an online accessible virtual catalogue in 2011 in order to make them online searchable for the first time (DISMARC Audio Aggregation Platform).

Source Data

Initial Data Collection and Normalisation

The data stems from the audio archive catalogues of the Institute of Lithuanian Literature and Folklore. This data has been processed by the Audio Aggregation Platform of DISMARC, and was then exposed in the virtual online catalogue of dismarc.org.

Source Data Producers

The source data has been produced by (human) researchers from the Institute of Lithuanian Literature and Folklore. Read more here: (zharskiene)

Annotations

Annotation Process

The data has been enriched with DISMARC vocabulary terms from the following vocabularies: dmEras, dmFormats (https://www.dismarc.org/vocabulary/dmFormats/)

Annotators

The enrichments with DISMARC vocabulary terms were produced machine generated. The DISMARC vocabularies have been created by the DISMARC consortium members.

Data Provenance

Use of Linked Open Data, Controlled Vocabularies, Multilingual Ontologies/Taxonomies

dmFormats (https://www.dismarc.org/vocabulary/dmFormats/)

DISMARC Formats is a vocabulary of audio formats, it was established by the DISMARC consortium to be used in the DISMARC Audio Aggregation Platform. The vocabulary is published with PURL.org

Version Information

Release Date

02.06.2025

Update Periodicity

No update foreseen.

Maintenance

b) Limited Maintenance – The data will not be updated, but any technical issues will be addressed.