Automatic Language Identification in Texts (eBook)

eBook Download: PDF
2024 | 1st ed. 2024
XIV, 148 Seiten
Springer International Publishing (Verlag)
978-3-031-45822-4 (ISBN)

Lese- und Medienproben

Automatic Language Identification in Texts - Tommi Jauhiainen, Marcos Zampieri, Timothy Baldwin, Krister Lindén
Systemvoraussetzungen
42,79 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
This book provides readers with a brief account of the history of Language Identification (LI) research and a survey of the features and methods most used in LI literature. LI is the problem of determining the language in which a document is written and is a crucial part of many text processing pipelines. The authors use a unified notation to clarify the relationships between common LI methods. The book introduces LI performance evaluation methods and takes a detailed look at LI-related shared tasks. The authors identify open issues and discuss the applications of LI and related tasks and proposes future directions for research in LI.

Tommi Jauhiainen, Ph.D., is a Post-doctoral Researcher at The University of Helsinki. He wrote his master's thesis on automatic language identification and continued his research on the same subject as a doctoral student. Dr. Jauhiainen organized the first shared task in Cuneiform Language Identification (CLI) in 2019 as well as the Uralic Language Identification (ULI) shared tasks in 2020 and 2021. He is the first author of approximately 20 peer-reviewed publications on language identification.

Marcos Zampieri, Ph.D., is an Assistant Professor at George Mason University. He received his PhD from Saarland University with a thesis on computational modelling of language variation. He has published over 100 peer-reviewed papers on various topics in computational linguistics and NLP such as language and dialect identification, native language identification, machine translation, lexical complexity prediction, and social media mining. 

Timothy Baldwin, Ph.D., is the Acting Provost and Chair of the Department of Natural Language Processing at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) in addition to being a Melbourne Laureate Professor in the School of Computing and Information Systems at The University of Melbourne. Prior to joining The University of Melbourne, he was a Senior Research Engineer at the Center for the Study of Language and Information at Stanford University. He is the author of over 450 peer-reviewed publications across diverse topics in natural language processing and AI, in addition to being an ARC Future Fellow, and the recipient of a number of prestigious awards at top conferences. 

Krister Lindén, Ph.D., is the Research Director of Language Technology at the University of Helsinki in addition to the National Coordinator of FIN-CLARIN, the Finnish Node of CLARIN ERIC, which is a European research infrastructure for Social Sciences and the Humanities. He is the Chair of the CLARIN National Coordinators Forum and a member of CLIC (Committee for Legal and Ethical Issues in CLARIN). He holds a doctoral degree in Language Technology from the University of Helsinki. He is the co-author of more than 160 publications related to language technology and its utilization in digital humanities and language resource processing. He is currently also a deputy team leader in the Centre of Excellence of Ancient Near Eastern Empires.
Erscheint lt. Verlag 2.2.2024
Reihe/Serie Synthesis Lectures on Human Language Technologies
Synthesis Lectures on Human Language Technologies
Zusatzinfo XIV, 148 p. 10 illus., 8 illus. in color.
Sprache englisch
Themenwelt Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Mathematik / Informatik Mathematik Statistik
Schlagworte Dialect Identification • Document Pre-processing • Language Identification • Language Modeling • Language Similarity • Natural Language Processing
ISBN-10 3-031-45822-2 / 3031458222
ISBN-13 978-3-031-45822-4 / 9783031458224
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 6,4 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
der Praxis-Guide für Künstliche Intelligenz in Unternehmen - Chancen …

von Thomas R. Köhler; Julia Finkeissen

eBook Download (2024)
Campus Verlag
38,99
Wie du KI richtig nutzt - schreiben, recherchieren, Bilder erstellen, …

von Rainer Hattenhauer

eBook Download (2023)
Rheinwerk Computing (Verlag)
24,90