Suchergebnisse

Filtern nach

Letzte Suchanfragen

Ergebnisse für *

Es wurden 1 Ergebnisse gefunden.

Zeige Ergebnisse 1 bis 1 von 1.

Sortieren

Modelling large parallel corpora. The Zurich Parallel Corpus Collection

Autor*in: Graën, Johannes ; Kew, Tannon ; Shaitarova, Anastassia ; Volk, Martin

Erschienen: 2019

Verlag: Mannheim : Leibniz-Institut für Deutsche Sprache

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/9020 https://ids-pub.bsz-bw.de/files/9020/Graen_Kew_Shaitarova_Volk_Modelling_Large_Parallel_Corpora_2019.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-90207 https://doi.org/10.14618/ids-pub-9020

Text corpora come in many different shapes and sizes and carry heterogeneous annotations, depending on their purpose and design. The true benefit of corpora is rooted in their annotation and the method by which this data is encoded is an important factor in their interoperability. We have accumulated a large collection of multilingual and parallel corpora and encoded it in a unified format which is compatible with a broad range of NLP tools and corpus linguistic applications. In this paper, we present our corpus collection and describe a data model and the extensions to the popular CoNLL-U format that enable us to encode it.

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Konferenzveröffentlichung
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Korpus
Lizenz:	creativecommons.org/licenses/by/4.0/deed.de ; info:eu-repo/semantics/openAccess

Filtern nach

Aktive Filter

Kategorien:

Bereich

Quelle

Format

Beteiligt

Medientyp

Sprache

Jahr

Letzte Suchanfragen

Ergebnisse für *

Modelling large parallel corpora. The Zurich Parallel Corpus Collection

Kontakt

Partner