Suchergebnisse

Exploring newspaper language

using the web to create and investigate a large corpus of modern Norwegian

Autor*in:

Erschienen: 2012

Verlag: Benjamins, Amsterdam [u.a.]

Berlin: Humboldt-Universität zu Berlin, Universitätsbibliothek, Jacob-und-Wilhelm-Grimm-Zentrum

Standort:

Humboldt-Universität zu Berlin, Universitätsbibliothek, Jacob-und-Wilhelm-Grimm-Zentrum

Fernleihe:

uneingeschränkte Fernleihe, Kopie und Ausleihe

Link zum Verbundkatalog:

Kooperativer Bibliotheksverbund Berlin-Brandenburg (KOBV)

Berlin: Staatsbibliothek zu Berlin - Preußischer Kulturbesitz, Haus Unter den Linden

Standort:

Staatsbibliothek zu Berlin - Preußischer Kulturbesitz, Haus Unter den Linden

Fernleihe:

uneingeschränkte Fernleihe, Kopie und Ausleihe

Link zum Verbundkatalog:

Gemeinsamer Bibliotheksverbund (GBV)

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Hinweise zum Inhalt

Inhaltsverzeichnis

Quelle:	Verbundkataloge
Beteiligt:	Andersen, Gisle (Herausgeber)
Sprache:	Englisch
Medientyp:	Buch (Monographie)
ISBN:	9789027203540
RVK Klassifikation:	GW 2288
Schriftenreihe:	Studies in corpus linguistics ; 49
Schlagworte:	Norwegisch; Korpus <Linguistik>; Zeitungssprache
Umfang:	VI, 356 S., Ill., graph. Darst.
Bemerkung(en):	Literaturangaben

Exploring newspaper language

using the web to create and investigate a large corpus of modern Norwegian

Autor*in:

Erschienen: 2012

Verlag: Benjamins, Amsterdam [u.a.]

This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily... mehr

Lüneburg: Leuphana Universität Lüneburg, Medien- und Informationszentrum, Universitätsbibliothek

Standort:

Leuphana Universität Lüneburg, Medien- und Informationszentrum, Universitätsbibliothek

Fernleihe:

keine Fernleihe

Link zum Verbundkatalog:

Gemeinsamer Bibliotheksverbund (GBV)

Möckern: Hochschulbibliothek Friedensau

Standort:

Hochschulbibliothek Friedensau

Signatur:

Online-Ressource

Fernleihe:

keine Fernleihe

Link zum Verbundkatalog:

Gemeinsamer Bibliotheksverbund (GBV)

This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system whic

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Hinweise zum Inhalt

Volltext

Volltext (Connect to MyiLibrary resource)

Volltext

Quelle:	Verbundkataloge
Beteiligt:	Andersen, Gisle
Sprache:	Englisch
Medientyp:	Ebook
Format:	Online
ISBN:	1280497661; 9789027203540
Schriftenreihe:	Studies in corpus linguistics ; 49
Umfang:	Online-Ressource (VI, 356 S.)
Bemerkung(en):	Description based upon print version of record Exploring Newspaper Language; Editorial page; Titla page; LCC data; Table of contents; Building a large corpus based on newspapers from the web; 1. Introduction; 2. An overview of the Norwegian Newspaper Corpus and its system architecture; 2.1 Text harvesting; 2.2 Boilerplate and duplicate removal; 2.3 Language classification; 2.4 Text annotation; 2.4.1 Annotation of source, date and author information; 2.4.2 Topic classification; 2.4.3 Part-of-speech tagging; 2.5 Search system and user interface; 2.5.1 Corpus WorkBench; 2.5.2 Corpuscle; 2.6 Extraction of new words 2.7 Classification of new words2.7.1 Anglicism detection; 2.8 Frequency profiling and lexical database entry; 2.9 Identification of multiword expressions; 3. The content of the research contributions to this book; 4. Concluding remarks; References; Part II. Exploiting the web as a corpus - Methods and tools; Corpuscle - a new corpus management platform for annotated corpora; 1. Introduction; 2. Design principles; 3. Querying the corpus; 4. API and Web interface; 4.1 The API; 4.2 The Web interface; 5. Editing and manual annotation; 6. Evaluation and concluding remarks; References; OBT+stat 1. Introduction2. Background; 2.1 The history of the Oslo-Bergen Tagger; 2.2 State of the art for Norwegian POS taggers; 3. The architecture of the Oslo-Bergen Constraint Grammar Tagger; 4. Methodology of improvements to the Oslo-Bergen Tagger; 5. Dealing with left-over ambiguities in the Oslo-Bergen Tagger; 5.1 Morphological ambiguities; 5.2 Lemma ambiguities; 6. Statistical disambiguation; 7. Modelling challenges and engineering concerns; 8. Evaluation of the statistical module; 8.1 How to evaluate; 8.2 Evaluation results; 9. Conclusion; References Exploring corpora through syntactic annotation1. Introduction; 2. Treebanking; 3. INESS - the Norwegian treebanking infrastructure; 4. Searching for complex syntactic constructions in a treebank; 4.1 Passive constructions; 4.2 Relative clauses; 5. Conclusion; References; Collocations and statistical analysis of n-grams; 1. Introduction; 2. Background; 2.1 Multiword Expressions (MWEs); 2.2 Collocations; 3. Methodology; 3.1 Data and n-gram extraction; 3.2 Post-processing of n-gram lists; 3.3 Contingency tables; 3.3.1 Bigram Contingency Tables; 3.3.2 Trigram Contingency Tables 3.4 Bigram Association Measures3.5 Trigram Association Measures; 4. Results; 4.1 Bigrams; 4.2 Trigrams; 5. Conclusion and Future Work; References; Automatic topic classi?cation of a large newspaper corpus; 1. Introduction; 2. Background and related work; 2.1 The rule-based approach; 2.2 The pattern-matching approach; 2.3 Promising results; 3. Material; 3.1 Manual annotation; 3.2 Feature extraction; 3.3 Cleaning the text; 3.4 The gold standard; 4. Overview of our final approach; 5. Our approach in detail; 5.1 Hypothesis; 5.2 De?ning categories; 5.3 Tools; 5.4 Programming and experimenting 6. Data and experimental evaluation Electronic reproduction; Available via World Wide Web

keine Fernleihe

Link zum Verbundkatalog:

Südwestdeutscher Bibliotheksverbund (SWB)

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	Leibniz-Institut für Deutsche Sprache, Bibliothek
Beteiligt:	Andersen, Gisle (Hrsg.)
Sprache:	Englisch
Medientyp:	Buch (Monographie)
Format:	Druck
ISBN:	9789027203540
RVK Klassifikation:	GW 2288
Schriftenreihe:	Studies in corpus linguistics ; 49
Schlagworte:	Norwegian language (Nynorsk); Norwegian language (Nynorsk); Newspapers; Mass media; Information technology
Umfang:	VI, 356 S., graph. Darst., 25 cm
Bemerkung(en):	Enth. Literaturangaben und Register

Exploring newspaper language

using the web to create and investigate a large corpus of modern Norwegian

Autor*in:

Erschienen: 2012

Verlag: Benjamins, Amsterdam [u.a.]

Frankfurt/Main: Hessisches BibliotheksInformationsSystem HeBIS

Standort:

Hessisches BibliotheksInformationsSystem HeBIS

Fernleihe:

keine Fernleihe

Link zum Verbundkatalog:

Hessisches BibliotheksInformationsSystem (HeBIS)

Frankfurt/Main: Universitätsbibliothek J. C. Senckenberg, Zentralbibliothek (ZB)

Standort:

Universitätsbibliothek J. C. Senckenberg, Zentralbibliothek (ZB)

Signatur:

89.773.89

Fernleihe:

uneingeschränkte Fernleihe, Kopie und Ausleihe

Link zum Verbundkatalog:

Hessisches BibliotheksInformationsSystem (HeBIS)

Marburg/Lahn: Universität Marburg, Universitätsbibliothek

Standort:

Universität Marburg, Universitätsbibliothek

Signatur:

001 GW 2288 A544

Fernleihe:

uneingeschränkte Fernleihe, Kopie und Ausleihe

Link zum Verbundkatalog:

Hessisches BibliotheksInformationsSystem (HeBIS)

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Hinweise zum Inhalt

Kurzbeschreibung

Ausführliche Beschreibung

Inhaltsverzeichnis

Quelle:	Verbundkataloge
Beteiligt:	Andersen, Gisle (Hrsg.); Hofland, Knut; Meurer, Paul; Nøklestad, Anders; Rosén, Victoria; Lyse Samdal, Gunn Inger; Hagen, Thomas M.; Smørdal Losnegaard, Gyri; Dyvik, Helge; Fjeld, Ruth Vatvedt; Nygaard, Lars; De Smedt, Koenraad; Kristiansen, Marita; Halverson, Sandra; Breivik, Leiv Egil; Swan, Toril; Andersen, Øivin
Sprache:	Englisch
Medientyp:	Buch (Monographie)
Format:	Druck
ISBN:	9789027203540; 9027203547
RVK Klassifikation:	GW 2288
Schriftenreihe:	Studies in corpus linguistics ; 49
Schlagworte:	Norwegisch; Zeitungssprache; Korpus <Linguistik>
Umfang:	VI, 356 S., Ill., graph. Darst., 25x16 cm
Bemerkung(en):	Literaturangaben

Filtern nach

Aktive Filter

Kategorien:

Bereich

Quelle

Format

Beteiligt

Medientyp

Sprache

Jahr

Letzte Suchanfragen

Ergebnisse für *

Exploring newspaper language

Berlin: Humboldt-Universität zu Berlin, Universitätsbibliothek, Jacob-und-Wilhelm-Grimm-Zentrum

Berlin: Staatsbibliothek zu Berlin - Preußischer Kulturbesitz, Haus Unter den Linden

Exploring newspaper language

Lüneburg: Leuphana Universität Lüneburg, Medien- und Informationszentrum, Universitätsbibliothek

Möckern: Hochschulbibliothek Friedensau

Exploring newspaper language

Berlin: Staatsbibliothek zu Berlin - Preußischer Kulturbesitz, Haus Potsdamer Straße

Freiburg/Breisgau: Universitätsbibliothek Freiburg

Hamburg: Staats- und Universitätsbibliothek Hamburg Carl von Ossietzky

Hannover: Technische Informationsbibliothek (TIB) / Leibniz-Informationszentrum Technik und Naturwissenschaften und Universitätsbibliothek

Kiel: Universitätsbibliothek Kiel, Zentralbibliothek

Mannheim: Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek

Exploring newspaper language

Frankfurt/Main: Hessisches BibliotheksInformationsSystem HeBIS

Frankfurt/Main: Universitätsbibliothek J. C. Senckenberg, Zentralbibliothek (ZB)

Marburg/Lahn: Universität Marburg, Universitätsbibliothek

Kontakt

Partner