Datasheet
Title
Lietuvių literatūros ir tautosakos institutas, DISMARC metadata (2011)
Description
This dataset was created for data ingest to the DISMARC and the Europeana platform during the EuropeanaConnect (2009-2011) project. The Institute of Lithuanian Literature and Folklore, which was a partner in EuropeanaConnect, is carrying out research of contemporary Lithuanian literature and folklore as well as ancient literature of the Great Duchy of Lithuania. The folklore archives of the Institute is the oldest repository of folklore in Lithuania comprising over 1.9 million of folklore items. The earliest sound records stored in the archives are phonograph records from 1908–1949. This dataset contains a selected part of 7,346 metadata records.
Homepage
http://www.llti.lt/en/default_en.htm
Repository
https://www.dismarc.org/info/wp-content/uploads/2025/06/LLTI-1.zip
Publisher
Lietuvių literatūros ir tautosakos institutas
DISMARC Audio Aggregation Platform (https://www.dismarc.org/)
Point of Contact
Gerda Koch, https://orcid.org/0000-0002-1257-092X, kochg[@]ait.co.at, AIT Angewandte Informationstechnik Forschungsgesellschaft mbH, Metadata Coordinator for the DISMARC Audio Aggregation Platform
Supported Tasks and Shared Tasks
AI Category
Natural Language Processing – Text Mining
Type of Cultural Heritage Application
Collection search
(Cultural Heritage) Application Example
Distribution
Dataset Curators
Gerda Koch, AIT Angewandte Informationstechnik Forschungsgesellschaft mbH, Metadata Coordinator DISMARC Audio Aggregation Platform
Auste Nakiene, Department of Folklore Archives, senior researcher
Licensing Information
http://creativecommons.org/publicdomain/mark/1.0/
Citation Information
@misc{LLTI dataset,
title = {Lietuvių literatūros ir tautosakos institutas, DISMARC metadata},
author = {DISMARC Audio Aggregation Platform},
howpublished = {\url{https://www.dismarc.org/info/wp-content/uploads/2025/06/LLTI-1.zip}},
year = {2011},
date = {2011-10-17},
note = {Data in XML format, provided by website https://www.dismarc.org/info/dismarc-data/}
}
Contributions
Auste Nakiene, Department of Folklore Archives, senior researcher
Ruta Zarskiene, Department of Folklore Archives, senior researcher
Composition
Data Category
Text
Object Type
Audio metadata, http://vocab.getty.edu/page/aat/300379475
Dataset Structure
Data Instances
<section name=”dmOAP”>
<dmOAP-Identifier>LTRF v 1 1</dmOAP-Identifier>
<dmOAP-Title>Plaukė žąselė per Nemunėlį</dmOAP-Title>
<dmOAP-Subject-genre>daina</dmOAP-Subject-genre>
<dmOAP-Contributor role=”Performer”>merginų choras</dmOAP-Contributor>
<dmOAP-Contributor role=”Collector”>E. Volteris Li W 20</dmOAP-Contributor>
<dmOAP-Date-dateRecorded>1908-09-09T00:00:00.000</dmOAP-Date-dateRecorded>
<dmOAP-Date-dateRecorded encoding=”dmEras”>dmEras:/XX amžius/XX a. pirmasis dešimtmetis
<thesaurus_link level=”2″ id=”dmEras:50040001″/><thesaurus_link level=”1″ id=”dmEras:50040000″/>
</dmOAP-Date-dateRecorded><dmOAP-Type>Sound</dmOAP-Type>
<dmOAP-Format encoding=”dmFormats”>dmFormats:/volelis<thesaurus_link level=”1″ id=”dmFormats:80006000″/>
</dmOAP-Format>
<dmOAP-Coverage-spatial>Kaunas</dmOAP-Coverage-spatial>
<dmOAP-Rights-accessRights>http://creativecommons.org/publicdomain/mark/1.0/</dmOAP-Rights-accessRights>
</section>
Compliance with Standard(s)
The dataset conforms to the Dublin Core Standard. A Dublin Core Application Format has been used. It is named: DISMARC Object Application Profile (dmOAP). The description of the individual data fields of the dmOAP can be found here: https://dismarc.ait.co.at/zebra/dismarclocal/DISMARCLOCALRegnetDescription.xml
Languages
English, en
Lithuanian, lt
Descriptive Statistics
7,346 metadata records
Data Collection Process
Curation Rationale
This dataset contains metadata descriptions of early sound recordings of Lithuanian folk music that were selected and integrated into an online accessible virtual catalogue in 2011 in order to make them online searchable for the first time (DISMARC Audio Aggregation Platform).
Source Data
Initial Data Collection and Normalisation
The data stems from the audio archive catalogues of the Institute of Lithuanian Literature and Folklore. This data has been processed by the Audio Aggregation Platform of DISMARC, and was then exposed in the virtual online catalogue of dismarc.org.
Source Data Producers
The source data has been produced by (human) researchers from the Institute of Lithuanian Literature and Folklore. Read more here: (zharskiene)
Annotations
Annotation Process
The data has been enriched with DISMARC vocabulary terms from the following vocabularies: dmEras, dmFormats (https://www.dismarc.org/vocabulary/dmFormats/)
Annotators
The enrichments with DISMARC vocabulary terms were produced machine generated. The DISMARC vocabularies have been created by the DISMARC consortium members.
Data Provenance
Use of Linked Open Data, Controlled Vocabularies, Multilingual Ontologies/Taxonomies
dmFormats (https://www.dismarc.org/vocabulary/dmFormats/)
DISMARC Formats is a vocabulary of audio formats, it was established by the DISMARC consortium to be used in the DISMARC Audio Aggregation Platform. The vocabulary is published with PURL.org
Version Information
Release Date
02.06.2025
Update Periodicity
No update foreseen.
Maintenance
b) Limited Maintenance – The data will not be updated, but any technical issues will be addressed.
