Syntactic Wordclass Tagging

Buch | Softcover

334 Seiten

2010 | Softcover reprint of hardcover 1st ed. 1999
Springer (Verlag)
978-90-481-5296-4 (ISBN)

Artikel merken

In both the linguistic and the language engineering community, the creation and use of annotated text collections (or annotated corpora) is currently a hot topic. Unfortunately, the annotation of text material, especially more interesting linguistic annotation, is as yet a difficult task and can entail a substan tial amount of human involvement.

In both the linguistic and the language engineering community, the creation and use of annotated text collections (or annotated corpora) is currently a hot topic. Annotated texts are of interest for research as well as for the development of natural language pro cessing (NLP) applications. Unfortunately, the annotation of text material, especially more interesting linguistic annotation, is as yet a difficult task and can entail a substan tial amount of human involvement. Allover the world, work is being done to replace as much as possible of this human effort by computer processing. At the frontier of what can already be done (mostly) automatically we find syntactic wordclass tagging, the annotation of the individual words in a text with an indication of their morpho syntactic classification. This book describes the state of the art in syntactic wordclass tagging. As an attempt to give an overall view of the field, this book is of interest to (at least) two, possibly very different, types of reader. The first type consists of those people who are using, or are planning to use, tagged material and taggers. They will want to know what the possibilities and impossibilities of tagging are, but are not necessarily interested in the internal working of automatic taggers. This, on the other hand, is the main interest of our second type of reader, the builders of automatic taggers and other natural language processing software.

I The User’s View.- 1 Orientation.- 2 A Short History of Tagging.- 3 The Use of Tagging.- 4 Tagsets.- 5 Standards for Tagsets.- 6 Performance of Taggers.- 7 Selection and Operation of Taggers.- II The Implementer’s View.- 8 Automatic Taggers: An Introduction.- 9 Tokenization.- 10 Lexicons for Tagging.- 11 Standardization in the Lexicon.- 12 Morphological Analysis.- 13 Tagging Unknown Words.- 14 Hand-Crafted Rules.- 15 Corpus-Based Rules.- 16 Hidden Markov Models.- 17 Machine Learning Approaches.- Appendix A: Example tagsets.- A.1 The Brown Corpus tagset.- A.2 The Penn Treebanktagset.- A.3 The EngCG tagset.- References.

Reihe/Serie	Text, Speech and Language Technology ; 9
Zusatzinfo	XVII, 334 p.
Verlagsort	Dordrecht
Sprache	englisch
Maße	160 x 240 mm
Themenwelt	Geisteswissenschaften ► Sprach- / Literaturwissenschaft ► Sprachwissenschaft
	Informatik ► Datenbanken ► Data Warehouse / Data Mining
	Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik
	Mathematik / Informatik ► Mathematik
ISBN-10	90-481-5296-8 / 9048152968
ISBN-13	978-90-481-5296-4 / 9789048152964
Zustand	Neuware