Text Analytics with Python -  Dipanjan Sarkar

Text Analytics with Python (eBook)

A Practical Real-World Approach to Gaining Actionable Insights from your Data
eBook Download: PDF
2016 | 1st ed.
XXI, 385 Seiten
Apress (Verlag)
978-1-4842-2388-8 (ISBN)
Systemvoraussetzungen
46,99 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

Derive useful insights from your data using Python. Learn the techniques related to natural language processing and text analytics, and gain the skills to know which technique is best suited to solve a particular problem.

Text Analytics with Python teaches you both basic and advanced concepts, including text and language syntax, structure, semantics. You will focus on algorithms and techniques, such as text classification, clustering, topic modeling, and text summarization. 

A structured and comprehensive approach is followed in this book so that readers with little or no experience do not find themselves overwhelmed. You will start with the basics of natural language and Python and move on to advanced analytical and machine learning concepts. You will look at each technique and algorithm with both a bird's eye view to understand how it can be used as well as with a microscopic view to understand the mathematical concepts and to implement them to solve your own problems.

This book:

  • Provides complete coverage of the major concepts and techniques of natural language processing (NLP) and text analytics
  • Includes practical real-world examples of techniques for implementation, such as building a text classification system to categorize news articles, analyzing app or game reviews using topic modeling and text summarization, and clustering popular movie synopses and analyzing the sentiment of movie reviews
  • Shows implementations based on Python and several popular open source libraries in NLP and text analytics, such as the natural language toolkit (nltk), gensim, scikit-learn, spaCy and Pattern


What you will learn: 
• Natural Language concepts
• Analyzing Text syntax and structure
• Text Classification
• Text Clustering and Similarity analysis
• Text Summarization 
• Semantic and Sentiment analysis
 
 
Readership :
IT professionals, analysts, developers, linguistic experts, data scientists, and anyone with a keen interest in linguistics, analytics, and generating insights from textual data.



Dipanjan Sarkar is an IT Engineer at Intel, the world's largest silicon company, where he works on data science, business intelligence and application development. He received his master's degree in Information Technology from the International Institute of Information Technology, Bangalore with focus on data science and software engineering. He has been an analytics practitioner for over 4 years now specializing in predictive and text analytics. He has also authored a book on Machine Learning with R and occasionally reviews technical books. Dipanjan's interests include learning about new technology, disruptive start-ups and data science. In his spare time he loves reading, gaming and watching popular sitcoms.
Derive useful insights from your data using Python. You will learn both basic and advanced concepts, including text and language syntax, structure, and semantics. You will focus on algorithms and techniques, such as text classification, clustering, topic modeling, and text summarization.Text Analytics with Python teaches you the techniques related to natural language processing and text analytics, and you will gain the skills to know which technique is best suited to solve a particular problem. You will look at each technique and algorithm with both a bird's eye view to understand how it can be used as well as with a microscopic view to understand the mathematical concepts and to implement them to solve your own problems.What You Will Learn:Understand the major concepts and techniques of natural language processing (NLP) and text analytics, including syntax and structureBuilda text classification system to categorize news articles, analyze app or game reviews using topic modeling and text summarization, and cluster popular movie synopses and analyze the sentiment of movie reviewsImplement Python and popular open source libraries in NLP and text analytics, such as the natural language toolkit (nltk), gensim, scikit-learn, spaCy and PatternWho This Book Is For :IT professionals, analysts, developers, linguistic experts, data scientists, and anyone with a keen interest in linguistics, analytics, and generating insights from textual data

Dipanjan Sarkar is a Data Scientist at Intel, the world's largest silicon company which is on a mission to make the world more connected and productive. He primarily works on Analytics, Business Intelligence, Application Development and building large scale Intelligent Systems. He received his master's degree in Information Technology from the International Institute of Information Technology, Bangalore with a focus on Data Science and Software Engineering. He is also an avid supporter of self-learning, especially Massive Open Online Courses and holds a Data Science Specialization from Johns Hopkins University on Coursera. He has been an analytics practitioner for over 4 years now specializing in statistical, predictive and text analytics. He has also authored a couple of books on R and Machine Learning and occasionally reviews technical books and acts as a course beta tester for Coursera. Dipanjan's interests include learning about new technology, financial markets, disruptive start-ups, data science and more recently, artificial intelligence and deep learning. In his spare time he loves reading, gaming and watching popular sitcoms and football.

Chapter 1:  Natural Language Basics.-Chapter Goal: Introduces the readers to the basics of NLP and Text processingNo of pages: 40 - 50Sub -Topics1.Language Syntax and Structure2.Text formats and grammars3.Lexical and Text Corpora resources4.Deep dive into the Wordnet corpus5.Parts of speech, Stemming and lemmatizationChapter 2:  Python Refresher for Text AnalyticsChapter Goal: A useful chapter for people who do not know python as well as for experienced people who can use it as a quick reference for useful commands and techniques for text processing using pythonNo of pages: 30 - 35Sub - Topics1.Python data structures and constructs  2.Functions, conditionals and code flow3. Handling strings with Python4. Regular Expressions with Python5.Quick glance into nltk, gensim and patternChapter 3:  Text Processing Chapter Goal: This chapter covers all the techniques and capabilities needed for processing and parsing text into easy to understand formats. We also look at how to segment and normalize text. No of pages : 35 - 40Sub - Topics: 1.Sentence and word tokenization2.Text tagging and chunking3.Text Parse Trees3.Text normalization4.   Text spell checks and removal of redundant characters5.   Synonyms and SynsetsChapter 4: Text ClassificationChapter Goal: Introduces readers to the concept of classification as a supervised machine learning problem and looks at a real world example for classifying text documentsNo of pages: 40 - 45Sub - Topics: 1.  Classification basics2.   Types of classifiers3.   Feature generation of text documents4.Types of feature generators5.Building a text classifier on real world data 6.Evaluating Classifiers7.Binary and multi-class classification modelsChapter 5: Text summarization and topic modelingChapter Goal: Introduces the concepts of text summarization, n-gram tagging analysis and topic models to the readers and looks at some real world datasets and hands-on implementations on the sameNo of pages: 40 - 45Sub - Topics: 1.Text summarization concepts2.Dimensionality reduction3. N-gram tagging models4. Topic modeling using LDA and LSA5. Generate topics from real world data6. N-gram analysis to generate patterns from app reviewsChapter 6: Text Clustering and Similarity analysisChapter Goal: We look at unsupervised machine learning concepts here like text clustering and similarity measuresNo of pages: 35 - 40Sub - Topics: 1. Clustering concepts2. Analyzing text similarity3. Implementing text similarity with cosine, jaccard measures4. Text clustering algorithms5. Hands on text clustering on real world dataChapter 7: Sentiment Analysis Chapter Goal: We look at solving a popular problem of analyzing sentiment from text using a combination of methods learnt earlier including classification and also lexical analysisNo of pages: 35 - 40Sub - Topics: 1. What is sentiment analysis2. Looking at lexical corpora for sentiment 3. Analyzing sentiment using lexical analysis (hands-on)4. Building a sentiment analysis classifier (hands-on)

Erscheint lt. Verlag 30.11.2016
Zusatzinfo XXI, 385 p. 54 illus., 33 illus. in color.
Verlagsort Berkeley
Sprache englisch
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Mathematik / Informatik Informatik Netzwerke
Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
Schlagworte Deep Learning in Text Analysis • Natural Language Basics • Python • sentiment analysis • text classification • Text Clustering • Text Mining
ISBN-10 1-4842-2388-8 / 1484223888
ISBN-13 978-1-4842-2388-8 / 9781484223888
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 6,8 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
Achieve data excellence by unlocking the full potential of MongoDB

von Marko Aleksendric; Arek Borucki; Leandro Domingues …

eBook Download (2024)
Packt Publishing (Verlag)
53,99