The Handbook of NLP with Gensim - Chris Kuo

The Handbook of NLP with Gensim

Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data

(Autor)

Buch | Softcover
310 Seiten
2023
Packt Publishing Limited (Verlag)
978-1-80324-494-5 (ISBN)
47,35 inkl. MwSt
Elevate your natural language processing skills with Gensim and become proficient in handling a wide range of NLP tasks and projects

Key Features

Advance your NLP skills with this comprehensive guide covering detailed explanations and code practices
Build real-world topical modeling pipelines and fine-tune hyperparameters to deliver optimal results
Adhere to the real-world industrial applications of topic modeling in medical, legal, and other fields
Purchase of the print or Kindle book includes a free PDF eBook

Book DescriptionNavigating the terrain of NLP research and applying it practically can be a formidable task made easy with The Handbook of NLP with Gensim. This book demystifies NLP and equips you with hands-on strategies spanning healthcare, e-commerce, finance, and more to enable you to leverage Gensim in real-world scenarios.
You’ll begin by exploring motives and techniques for extracting text information like bag-of-words, TF-IDF, and word embeddings. This book will then guide you on topic modeling using methods such as Latent Semantic Analysis (LSA) for dimensionality reduction and discovering latent semantic relationships in text data, Latent Dirichlet Allocation (LDA) for probabilistic topic modeling, and Ensemble LDA to enhance topic modeling stability and accuracy.
Next, you’ll learn text summarization techniques with Word2Vec and Doc2Vec to build the modeling pipeline and optimize models using hyperparameters. As you get acquainted with practical applications in various industries, this book will inspire you to design innovative projects. Alongside topic modeling, you’ll also explore named entity handling and NER tools, modeling procedures, and tools for effective topic modeling applications.
By the end of this book, you’ll have mastered the techniques essential to create applications with Gensim and integrate NLP into your business processes.What you will learn

Convert text into numerical values such as bag-of-word, TF-IDF, and word embedding
Use various NLP techniques with Gensim, including Word2Vec, Doc2Vec, LSA, FastText, LDA, and Ensemble LDA
Build topical modeling pipelines and visualize the results of topic models
Implement text summarization for legal, clinical, or other documents
Apply core NLP techniques in healthcare, finance, and e-commerce
Create efficient chatbots by harnessing Gensim's NLP capabilities

Who this book is forThis book is for data scientists and professionals who want to become proficient in topic modeling with Gensim. NLP practitioners can use this book as a code reference, while students or those considering a career transition will find this a valuable resource for advancing in the field of NLP. This book contains real-world applications for biomedical, healthcare, legal, and operations, making it a helpful guide for project managers designing their own topic modeling applications.

Chris Kuo is a data scientist with over 23 years of experience. He led various data science solutions including customer analytics, health analytics, fraud detection, and litigation. He is also an inventor of a U.S. patent. He has worked at several Fortune 500 companies in the insurance and retail industries. Chris teaches at Columbia University and has taught at Boston University and other universities. He has published articles in economic and management journals and served as a journal reviewer. He is the author of the eXplainable A.I., Modern Time Series Anomaly Detection, Transfer Learning for Image Classification, and The Handbook of Anomaly Detection. He received his undergraduate degree in Nuclear Engineering and Ph.D. in Economics.

Table of Contents

Introduction to NLP
Word Embedding
Text Wrangling and Preprocessing
Latent Semantic Analysis with scikit-learn
Cosine Similarity
Latent Semantic Indexing with Gensim
Using Word2Vec
Doc2Vec with Gensim
Understanding Discrete Distributions
Latent Dirichlet Allocation
LDA Modeling
LDA Visualization
The Ensemble LDA for Model Stability
LDA and BERTopic
Real-World Use Cases

Erscheinungsdatum
Verlagsort Birmingham
Sprache englisch
Maße 191 x 235 mm
Themenwelt Informatik Theorie / Studium Künstliche Intelligenz / Robotik
ISBN-10 1-80324-494-1 / 1803244941
ISBN-13 978-1-80324-494-5 / 9781803244945
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
von absurd bis tödlich: Die Tücken der künstlichen Intelligenz

von Katharina Zweig

Buch | Softcover (2023)
Heyne (Verlag)
20,00
dem Menschen überlegen – wie KI uns rettet und bedroht

von Manfred Spitzer

Buch | Hardcover (2023)
Droemer (Verlag)
24,00