Language Corpora Annotation and Processing -  Niladri Sekhar Dash

Language Corpora Annotation and Processing (eBook)

eBook Download: PDF
2021 | 1st ed. 2021
XXX, 272 Seiten
Springer Singapore (Verlag)
978-981-16-2960-0 (ISBN)
Systemvoraussetzungen
160,49 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.




Dr. Niladri Sekhar Dash is Professor and Head, Linguistic Research Unit, Indian Statistical Institute, Kolkata (The Institute of National Importance, Govt. of India). For the last 28 years, he is working in corpus linguistics, language technology, computational lexicography, computer-assisted language teaching, language documentation, translation, clinical linguistics, and digital ethnography. To his credit, he has published 18 research monographs and more than 285 research papers in indexed and peer-reviewed research journals, anthologies, and conference proceedings. As an invited speaker, he has delivered lectures at more than 50 universities and institutes in India and abroad. He acts as a Research Advisor for several multinational organizations that work on language technology, artificial intelligence, lexicography, digital humanities, and language resource development. He acts as Principal Investigator for several LangTech projects funded by the Govt. of India and corporate houses. He is the Chief Editor of the Journal of Advanced Linguistic Studies-a reviewed international journal of linguistics. He is an Editorial Board Member for several international journals. He is also a member of several linguistic associations across the world. He is a British Academy International Visiting Fellow (2018), Visiting Research Fellow of School of Psychology & Clinical Language Sciences, University of Reading, UK (2018-2021), and Visiting Scholar of Language and Brain Laboratory, University of Oxford, UK (2019). At present, he is heading 5 projects: (a) 'Upgradation of Bengali WordNet' funded by the Ministry of Statistics and Programme Implementation (MoSPI), Govt. of India; (b) 'Sound Imitative Words in Bengali' in collaboration with the Dept. of British and American Studies, Faculty of Arts, P.J. Šafárik University, Slovakia; (c) 'Bilingual Dementia of Patients with Broca's Aphasia' in collaboration with the School of Psychology and Clinical Language Sciences, University of Reading, UK; (d) 'Public Announcement System at Airports and Railway Stations in Indian Sign Language with Animation' in a consortium-mode project headed by the Dept. of Computer Science, Punjabi University, Patiala, India, and (e) 'Dictionary for Sabar Speech Community' - an endangered tribe of West Bengal, India.


This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.
Erscheint lt. Verlag 7.7.2021
Zusatzinfo XXX, 272 p. 47 illus., 2 illus. in color.
Sprache englisch
Themenwelt Geisteswissenschaften Sprach- / Literaturwissenschaft Sprachwissenschaft
Schlagworte Computational Linguistics • Corpora Annotation • Language Processing • lexical collocation • Morphological processing • Sentence Parsing
ISBN-10 981-16-2960-9 / 9811629609
ISBN-13 978-981-16-2960-0 / 9789811629600
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 10,3 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich