Suchergebnisse

Multimedia Corpora (Media encoding and annotation) : Draft submitted to CLARIN WG 5.7. as input to CLARIN deliverable D5.C3 “Interoperability and Standards”

Autor*in: Schmidt, Thomas ; Elenius, Kjell ; Trilsbeek, Paul

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2234 https://ids-pub.bsz-bw.de/files/2234/Schmidt_Multimedia%20corpora_2010.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22341

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	gesprochene Sprache; Korpus; Notation; Standardisierung; Computerlinguistik; Multimedia
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

EXMARaLDA - ein Modellierungs- und Visualisierungsverfahren für die computergestützte Transkription gesprochener Sprache

Autor*in: Schmidt, Thomas

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1126125911/34 http://www1.uni-hamburg.de/exmaralda/files/Konvens_Paper.pdf https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2366
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-23660

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Lizenz:	kostenfrei

Korpus "Skandinavische Semikommunikation" - ein mehrsprachiges Diskurskorpus auf XML-Basis

Autor*in: Schmidt, Thomas ; Seewald-Heeg, Uta ; Gesellschaft für Linguistische Datenverarbeitung

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1126125954/34 http://www.jlcl.org/2003_Doppelheft/421-427_Schmidt.pdf https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2371
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-23718

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Lizenz:	kostenfrei

Erstellen und Analysieren von Gesprächskorpora mit EXMARaLDA

Autor*in: Schmidt, Thomas ; Wörner, Kai

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1126126918/34 http://www.gespraechsforschung-ozs.de/heft2005/heft2005.html https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2423
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-24233

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Schlagworte:	Software; Datenarchivierung
Lizenz:	kostenfrei

Das Kicktionary : Beziehungen im Wortschatz am Beispiel der Fußballsprache

Autor*in: Schmidt, Thomas

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1126124753/34 https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2302
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-23025

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Schlagworte:	Fußballsprache; Wörterbuch
Lizenz:	kostenfrei

Transkriptionskonventionen für die computergestützte gesprächsanalytische Transkription

Autor*in: Schmidt, Thomas

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1126123412/34 http://www.gespraechsforschung-ozs.de/heft2007/heft2007.html https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2236
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22366

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Lizenz:	kostenfrei

EXMARaLDA - ein System zur computergestützten Diskurstranskription

Autor*in: Schmidt, Thomas ; Mehler, Alexander

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1126124036/34 https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2255
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22557

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Lizenz:	kostenfrei

Handbuch für das computergestützte Transkribieren nach HIAT

Autor*in: Rehbein, Jochen ; Schmidt, Thomas ; Meyer, Bernd ; Watzke, Franziska ; Herkenrath, Annette

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1134956193/34 https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2368
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-23681

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Schlagworte:	automatische Annotation
Lizenz:	kostenfrei

POS für(s) FOLK – Part of Speech Tagging des Forschungs- und Lehrkorpus Gesprochenes Deutsch

Autor*in: Westpfahl, Swantje ; Schmidt, Thomas

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1135918627/34 http://www.jlcl.org/index.php?modus=ausgaben&language=en https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2223
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22233

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Schlagworte:	Forschungs- und Lehrkorpus Gesprochenes Deutsch = FOLK; Part-of-Speech-Tagging = POS; Datenbank für gesprochenes Deutsch = DGD; Korpuslinguistik
Lizenz:	kostenfrei

The research and teaching corpus of spoken German – FOLK

Autor*in: Schmidt, Thomas

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1135918678/34 http://www.lrec-conf.org/proceedings/lrec2014/summaries/290.html https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2443
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-24434

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Schlagworte:	Forschungs- und Lehrkorpus Gesprochenes Deutsch = FOLK
Lizenz:	kostenfrei

Mündliche Korpora am IDS: vom deutschen Spracharchiv zur Datenbank für gesprochenes Deutsch

Autor*in: Stift, Ulf-Michael ; Schmidt, Thomas

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/1135918708/34 https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2477
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-24779

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Schlagworte:	Institut für Deutsche Sprache <Mannheim>
Lizenz:	kostenfrei

Gesprächsdatenbanken als methodisches Instrument der Interaktionalen Linguistik - Eine exemplarische Untersuchung auf Basis des Korpus FOLK in der Datenbank für Gesprochenes Deutsch (DGD2)

Autor*in: Deppermann, Arnulf ; Schmidt, Thomas ; Gansel, Christa ; Domke, Christine

Erschienen: 2014

Bibliographische Angaben
Zugang

Volltext:	https://d-nb.info/113695872X/34 http://www.v-r.de/de/magazine_edition-1-1/mitteilungen_des_deutschen_germanistenverbandes_2014_61_1-1010298/ https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2222
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22229

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Unbestimmt
Format:	Online
DDC Klassifikation:	Germanische Sprachen; Deutsch (430)
Lizenz:	kostenfrei

A TEI-based approach to standardising spoken language transcription

Autor*in: Schmidt, Thomas

Erschienen: 2014

This paper formulates a proposal for standardising spoken language transcription, as practised in conversation analysis, sociolinguistics, dialectology and related fields, with the help of the TEI guidelines. Two areas relevant to standardisation are... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2225 https://ids-pub.bsz-bw.de/files/2225/Schmidt-a-tei-based-approach-to-standardising-spoken-language-transcription_2011.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22256

This paper formulates a proposal for standardising spoken language transcription, as practised in conversation analysis, sociolinguistics, dialectology and related fields, with the help of the TEI guidelines. Two areas relevant to standardisation are identified and discussed: first, the macro structure of transcriptions, as embodied in the data models and file formats of transcription tools such as ELAN, Praat or EXMARaLDA; second, the micro structure of transcriptions as embodied in transcription conventions such as CA, HIAT or GAT. A two-step process is described in which first the macro structure is represented in a generic TEI format based on elements defined in the P5 version of the Guidelines. In the second step, character data in this representation is parsed according to the regularities of a transcription convention resulting in a more fine-grained TEI markup which is also based on P5. It is argued that this two step process can, on the one hand, map idiosyncratic differences in tool formats and transcription conventions onto a unified representation. On the other hand, differences motivated by different theoretical decisions can be retained in a manner which still allows a common processing of data from different sources. In order to make the standard usable in practice, a conversion tool—TEI Drop—is presented which uses XSL transformations to carry out the conversion between different tool formats (CHAT, ELAN, EXMARaLDA, FOLKER and Transcriber) and the TEI representation of transcription macro structure (and vice versa) and which also provides methods for parsing the micro structure of transcriptions according to two different transcription conventions (HIAT and cGAT). Using this tool, transcribers can continue to work with software they are familiar with while still producing TEI-conformant transcription files. The paper concludes with a discussion of the work needed in order to establish the proposed standard. It is argued that both tool formats and the TEI guidelines are in a sufficiently mature ...

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	gesprochene Sprache; Transkription; Standardisierung
Lizenz:	creativecommons.org/licenses/by-nd/3.0/de/ ; info:eu-repo/semantics/openAccess

New and future developments in EXMARaLDA

Autor*in: Schmidt, Thomas ; Wörner, Kai ; Hedeland, Hanna ; Lehmberg, Timm

Erschienen: 2014

Verlag: Hamburg : Universität

We present some recent and planned future developments in EXMARaLDA, a system for creating, managing, analysing and publishing spoken language corpora. The new functionality concerns the areas of transcription and annotation, corpus management, query... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2228 https://ids-pub.bsz-bw.de/files/2228/Schmidt_New%20and%20future%20developments%20in%20EXMARaLDA_2011.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22288

We present some recent and planned future developments in EXMARaLDA, a system for creating, managing, analysing and publishing spoken language corpora. The new functionality concerns the areas of transcription and annotation, corpus management, query mechanisms, interoperability and corpus deployment. Future work is planned in the areas of automatic annotation, standardisation and workflow management.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	gesprochene Sprache; Korpus
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Multilingual Corpora at the Hamburg Centre for Language Corpora

Autor*in: Hedeland, Hanna ; Lehmberg, Timm ; Schmidt, Thomas ; Wörner, Kai

Erschienen: 2014

We give an overview of the content and the technical background of a number of corpora which were developed in various projects of the Research Centre on Multilingualism (SFB 538) between 1999 and 2011 and which are now made available to the... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2230 https://ids-pub.bsz-bw.de/files/2230/Schmidt_Multilingual%20Corpora_2011.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22305

We give an overview of the content and the technical background of a number of corpora which were developed in various projects of the Research Centre on Multilingualism (SFB 538) between 1999 and 2011 and which are now made available to the scientific community via the Hamburg Centre for Language Corpora.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Mehrsprachigkeit; Korpus; gesprochene Sprache
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Linguistic tool development between community practices and technology standards

Autor*in: Schmidt, Thomas

Erschienen: 2014

This contribution addresses the workshop topic of “standardising policies within eHumanities infrastructures”. It relates 10 years of experience with language resource standards, gained in the development of EXMARaLDA, a system for the construction... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2231 https://ids-pub.bsz-bw.de/files/2231/Schmidt_Linguistic%20tool%20development_2010.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22314

This contribution addresses the workshop topic of “standardising policies within eHumanities infrastructures”. It relates 10 years of experience with language resource standards, gained in the development of EXMARaLDA, a system for the construction and exploitation of spoken language corpora. Section 2 gives an overview of the EXMARaLDA system focussing on its relationship with existing and evolving standards for language resources. Section 3 presents the HIAT system as an example of an established community practice. Section 4 then addresses several issues that where encountered when trying to bring together HIAT, EXMARaLDA and the wider standard world.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	gesprochene Sprache; Korpus; Transkription; Computerlinguistik; Standardisierung
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

FOLKER : an annotation tool for efficient transcription of natural, multi-party interaction

Autor*in: Schmidt, Thomas ; Schütte, Wilfried

Erschienen: 2014

Verlag: Valletta, Malta : European Language Resources Association (ELRA)

This paper presents FOLKER, an annotation tool developed for the efficient transcription of natural, multi-party interaction in a conversation analysis framework. FOLKER is being developed at the Institute for German Language in and for the FOLK... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2232 https://ids-pub.bsz-bw.de/files/2232/Schmidt_Schuette_FOLKER_2010_Paper.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22323

This paper presents FOLKER, an annotation tool developed for the efficient transcription of natural, multi-party interaction in a conversation analysis framework. FOLKER is being developed at the Institute for German Language in and for the FOLK project, whose aim is the construction of a large corpus of spoken present-day German, to be used for research and teaching purposes. FOLKER builds on the experience gained with multi-purpose annotation tools like ELAN and EXMARaLDA, but attempts to improve transcription efficiency by restricting and optimizing both data model and tool functionality to a single, well-defined purpose. This paper starts with a description of the GAT transcription conventions and the data model underlying the tool. It then gives an overview of the tool functionality and compares this functionality to that of other widely used tools.

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Konferenzveröffentlichung
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	gesprochene Sprache; Korpus; Transkription; Computerlinguistik
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Korpora gesprochener Sprache im Netz – eine Umschau

Autor*in: Merkel, Silke ; Schmidt, Thomas

Erschienen: 2014

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2235 https://ids-pub.bsz-bw.de/files/2235/Merkel_Schmidt_Korpora%20gesprochener%20Sprache%20im%20Netz_2009.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22353

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Linguistik (410)
Schlagworte:	gesprochene Sprache; Korpus
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Transkriptionskonventionen für die computergestützte gesprächsanalytische Transkription

Autor*in: Schmidt, Thomas

Erschienen: 2014

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2236 https://ids-pub.bsz-bw.de/files/2236/Schmidt_Transkriptionskonventionen_2007.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22366

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Standardsprache; Angewandte Linguistik (418)
Schlagworte:	gesprochene Sprache; Gesprächsanalyse; Transkription; Korpus; Computerlinguistik
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

EXMARaLDA : un système pour la constitution et l’exploitation de corpus oraux

Autor*in: Schmidt, Thomas

Erschienen: 2014

Verlag: Limoges : Lambert-Lucas

Bibliographische Angaben
Zugang

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2237 https://ids-pub.bsz-bw.de/files/2237/Schmidt_EXMARalDA_2010.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22378

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Französisch
Medientyp:	Konferenzveröffentlichung
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	gesprochene Sprache; Transkription; Computerlinguistik; Standardisierung; Korpus
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Comparison of multimodal annotation tools

Autor*in: Rohlfing, Katharina ; Loehr, Daniel ; Duncan, Susan ; Brown, Amanda ; Franklin, Amy ; Kimbara, Irene ; Milde, Jan-Torsten ; Parrill, Fey ; Rose, Travis ; Schmidt, Thomas ; Sloetjes, Han ; Thies, Alexandra ; Wellinghoff, Sandra

Erschienen: 2014

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2245 https://ids-pub.bsz-bw.de/files/2245/Schmidt_Comparision_2006.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22450

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Linguistik (410)
Schlagworte:	Korpus; Gesprächsanalyse; Computerlinguistik
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Creating and working with spoken language corpora in EXMARaLDA

Autor*in: Schmidt, Thomas

Erschienen: 2014

Verlag: Bozen : Europ. Akad.

Spoken language corpora— as used in conversation analytic research, language acquisition studies and dialectology— pose a number of challenges that are rarely addressed by corpus linguistic methodology and technology. This paper starts by giving an... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2254 https://ids-pub.bsz-bw.de/files/2254/Schmidt_Creating_and_Working_2009.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22548

Spoken language corpora— as used in conversation analytic research, language acquisition studies and dialectology— pose a number of challenges that are rarely addressed by corpus linguistic methodology and technology. This paper starts by giving an overview of the most important methodological issues distinguishing spoken language corpus workfrom the work with written data. It then shows what technological challenges these methodological issues entail and demonstrates how they are dealt with in the architecture and tools of the EXMARaLDA system.

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Konferenzveröffentlichung
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	gesprochene Sprache; Korpus; Computerlinguistik; geschriebene Sprache
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

EXMARaLDA - ein System zur computergestützten Diskurstranskription

Autor*in: Schmidt, Thomas

Erschienen: 2014

Verlag: Wiesbaden : VS, Verlag für Sozialwissenschaften

Der Aufsatz beschreibt EXMARaLDA, ein XML-basiertes System zur computergestutzten Diskurstranskription, das am Sonderforschungsbereich „Mehrsprachigkeit“ an der Universität Hamburg entwickelt wurde. mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2255 https://ids-pub.bsz-bw.de/files/2255/Schmidt_EXMARaLDA_%202004.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22557

Der Aufsatz beschreibt EXMARaLDA, ein XML-basiertes System zur computergestutzten Diskurstranskription, das am Sonderforschungsbereich „Mehrsprachigkeit“ an der Universität Hamburg entwickelt wurde.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Deutsch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	gesprochene Sprache; Computerlinguistik; Transkription; Korpus; Mehrsprachigkeit
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

EXMARaLDA - Creating, Analysing and Sharing Spoken Language Corpora for Pragmatic Research

Autor*in: Wörner, Kai ; Schmidt, Thomas

Erschienen: 2014

This paper presents EXMARaLDA, a system for the computer-assisted creation and analysis of spoken language corpora. The first part contains some general observations about technological and methodological requirements for doing corpus-based... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2256 https://ids-pub.bsz-bw.de/files/2256/Schmidt_EXMARaLDA%20-%20creating_2009.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22568

This paper presents EXMARaLDA, a system for the computer-assisted creation and analysis of spoken language corpora. The first part contains some general observations about technological and methodological requirements for doing corpus-based pragmatics. The second part explains the systems architecture and gives an overview of its most important software components a transcription editor, a corpus management tool and a corpus query tool. The last part presents some corpora which have been or are currently being compiled with the help of EXMARaLDA.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Linguistik (410)
Schlagworte:	gesprochene Sprache; Computerlinguistik; Korpus; Transkription; Gesprächsanalyse
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Refining and Exploiting the Structural Markup of the eWDG

Autor*in: Schmidt, Thomas ; Geyken, Alexander ; Storrer, Angelika

Erschienen: 2014

Verlag: Barcelona : Institut Universitari de Linguistica Aplicada, Universitat Pompeu Fabra:

In this paper, the authors describe a semi-automated approach to refine the dictionary-entry structure of the digital version of the Wörterbuch der deutschen Gegenwartssprache (WDG, en.: Dictionary of Present-day German), a dictionary compiled and... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2258 https://ids-pub.bsz-bw.de/files/2258/Schmidt_Geyken_Storrer_Refining_and_Exploiting_the_Structural_Markup_2008.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-22582

In this paper, the authors describe a semi-automated approach to refine the dictionary-entry structure of the digital version of the Wörterbuch der deutschen Gegenwartssprache (WDG, en.: Dictionary of Present-day German), a dictionary compiled and published between 1952 and 1977 by the Deutsche Akademie der Wissenschaften that comprises six volumes with over 4,500 pages containing more than 120,000 headwords. We discuss the benefits of such a refinement in the context of the dictionary project Digitales Wörterbuch der deutschen Sprache (DWDS, en: Digital Dictionary of the German language). In the current phase of the DWDS project, we aim to integrate multiple dictionary and corpus resources in German language into a digital lexical system (DLS). In this context, we plan to expand the current DWDS interface with several special purpose components, which are adaptive in the sense that they offer specialized data views and search mechanisms for different dictionary functions-e.g. text comprehension, text production-and different user groups-e.g. journalists, translators, linguistic researchers, computational linguists. One prerequisite for generating such data views is the selective access to the lexical items in the article structure of the dictionaries which are the object of study. For this purpose, the representation of the eWDG has to be refined. The focus of this paper is on the semiautomated approach used to transform eWDG into a refined version in which the main structural units can be explicitly accessed. We will show how this refinement opens new and flexible ways of visualizing and querying the lexicographic content of the refined version in the context of the DLS project.

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Konferenzveröffentlichung
Format:	Online
DDC Klassifikation:	Wörterbücher (413)
Schlagworte:	Computerunterstützte Lexikographie
Lizenz:	creativecommons.org/licenses/by-nc-sa/3.0/ ; info:eu-repo/semantics/openAccess

Filtern nach

Aktive Filter

Kategorien:

Bereich

Quelle

Format

Beteiligt

Medientyp

Sprache

Jahr

Letzte Suchanfragen

Ergebnisse für *

Multimedia Corpora (Media encoding and annotation) : Draft submitted to CLARIN WG 5.7. as input to CLARIN deliverable D5.C3 “Interoperability and Standards”

EXMARaLDA - ein Modellierungs- und Visualisierungsverfahren für die computergestützte Transkription gesprochener Sprache

Korpus "Skandinavische Semikommunikation" - ein mehrsprachiges Diskurskorpus auf XML-Basis

Erstellen und Analysieren von Gesprächskorpora mit EXMARaLDA

Das Kicktionary : Beziehungen im Wortschatz am Beispiel der Fußballsprache

Transkriptionskonventionen für die computergestützte gesprächsanalytische Transkription

EXMARaLDA - ein System zur computergestützten Diskurstranskription

Handbuch für das computergestützte Transkribieren nach HIAT

POS für(s) FOLK – Part of Speech Tagging des Forschungs- und Lehrkorpus Gesprochenes Deutsch

The research and teaching corpus of spoken German – FOLK

Mündliche Korpora am IDS: vom deutschen Spracharchiv zur Datenbank für gesprochenes Deutsch

Gesprächsdatenbanken als methodisches Instrument der Interaktionalen Linguistik - Eine exemplarische Untersuchung auf Basis des Korpus FOLK in der Datenbank für Gesprochenes Deutsch (DGD2)

A TEI-based approach to standardising spoken language transcription

New and future developments in EXMARaLDA

Multilingual Corpora at the Hamburg Centre for Language Corpora

Linguistic tool development between community practices and technology standards

FOLKER : an annotation tool for efficient transcription of natural, multi-party interaction

Korpora gesprochener Sprache im Netz – eine Umschau

Transkriptionskonventionen für die computergestützte gesprächsanalytische Transkription

EXMARaLDA : un système pour la constitution et l’exploitation de corpus oraux

Comparison of multimodal annotation tools

Creating and working with spoken language corpora in EXMARaLDA

EXMARaLDA - ein System zur computergestützten Diskurstranskription

EXMARaLDA - Creating, Analysing and Sharing Spoken Language Corpora for Pragmatic Research

Refining and Exploiting the Structural Markup of the eWDG

Kontakt

Partner