Programming for Corpus Linguistics with Python and Dataframes - Daniel Keller

Programming for Corpus Linguistics with Python and Dataframes

(Autor)

Buch | Hardcover
75 Seiten
2024
Cambridge University Press (Verlag)
978-1-009-48678-1 (ISBN)
62,30 inkl. MwSt
This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes and provides core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.
This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

1. Data frame corpora; 2. Python basics for corpus linguistics; 3. Working with data frames; 4. Algorithms for common corpus linguistic tasks; 5. Creating data frame corpora; 6. Conclusion; References.

Erscheint lt. Verlag 31.7.2024
Reihe/Serie Elements in Corpus Linguistics
Zusatzinfo Worked examples or Exercises
Verlagsort Cambridge
Sprache englisch
Themenwelt Geisteswissenschaften Sprach- / Literaturwissenschaft Sprachwissenschaft
Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
ISBN-10 1-009-48678-0 / 1009486780
ISBN-13 978-1-009-48678-1 / 9781009486781
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich