Für diesen Artikel ist leider kein Bild verfügbar.

Data Wrangling with R

Learn how to gather, visualize, clean, and transform data one step at a time
Buch | Softcover
345 Seiten
2019
Packt Publishing Limited (Verlag)
978-1-83855-979-3 (ISBN)
33,65 inkl. MwSt
  • Titel ist leider vergriffen;
    keine Neuauflage
  • Artikel merken
Leverage the flexibility of rvest, tidyr, stringr, and other data wrangling libraries of R to extract more out of data and take clever business decisions.
About This Book
* This course provides you with a hands-on experience on the full data wrangling cycle, starting with gathering and exploring, going through visualizing, cleaning, and finally transforming the data to make it ready for analysis, further processing, or reporting. This equips you with the essential skills required for a data engineer or data analyst role.
* Each chapter will be self-contained so that you can focus on specific topics or come back later to review them.
* The course features a learning-by-doing paradigm. Each concept will be explained with code that every student will be able to reproduce in their environments.
* Each chapter will feature a practical case/exercise in which they will make use of the concepts learned in class.
Who This Book Is For
This book is catered to data analysts who need to understand how to use R to perform Data Wrangling activities.
R developers who want to learn a methodology and practical examples for performing Data Wrangling can also benefit from this book.
What You Will Learn
* Gather and store data from different sources and various file formats
* Apply basic concepts of statistics to better describe data
* Use ggplot to create scatter plots, histograms, box plots, and facets
* Use regular expressions to detect, extract, fix, and replace data patterns
* Apply shuffling, sampling, and partitioning techniques to transform data
* Use packages like plyr, dplyr, map, and mutate for transforming data
In Detail
With this book, you will apply the popular language R to perform the data wrangling tasks, and step by step, chapter by chapter, will become proficient in data wrangling.
You will learn how to explore and characterize a dataset using different R functions to perform descriptive statistics. Then, you will be shown all the different ways data can be gathered and stored. You will understand how visualizations can be used to enhance understanding of the data, and for this you will use ggplot2 to generate different plot types.
Moving on, the book will cover the data cleaning stage of data wrangling, showing how in practice there's a high probability that data is either incomplete or dirty, and explaining diverse techniques to take care of those issues. Also you will understand the different issues that arise while working with different data types especially when importing or exporting data. You'll learn the tools that can be used to solve them (formatting strings/numbers/dates, converting from string to numeric or date data types and the other way around, string manipulation functions, and so on). Then, the book will cover regular expressions, a powerful tool for matching and replacing complex string patterns.
This book then deals with data transformation. It explains why once data is cleaned, we would need to transform it to a different format or structure.
The goal of this book is to guide you through the whole data wrangling journey, from gathering to making it ready for reporting or processing.

Romeo Cabrera is a graduate in computer science and has a master's degree in computer science from Georgia Tech, with a specialization in machine learning, big data, artificial intelligence, and its applications to different domains. Romeo has over 15 years of experience in leading the analysis, design, and development of in-house mission-critical software solutions, data pipelines, and systems integration for large scale companies with millions of customers. He is also highly experienced in data modeling, processing, and analytics ("making sense of data") using many different technologies. Prasad Vaidya is a computer science graduate with an experience of more than four years, leading the analysis, design, and development of in-house mission-critical software solutions and data pipelines for large-scale companies with millions of customers. He has currently ventured into the data science field and is a market researcher with Big Data analysis as the skill set.

Erscheint lt. Verlag 30.4.2019
Verlagsort Birmingham
Sprache englisch
Maße 191 x 235 mm
Themenwelt Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
Mathematik / Informatik Mathematik Computerprogramme / Computeralgebra
ISBN-10 1-83855-979-5 / 1838559795
ISBN-13 978-1-83855-979-3 / 9781838559793
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Das Handbuch für Webentwickler

von Philip Ackermann

Buch | Hardcover (2023)
Rheinwerk (Verlag)
49,90
das große Praxisbuch – Grundlagen, fortgeschrittene Themen und Best …

von Ferdinand Malcher; Danny Koppenhagen; Johannes Hoppe

Buch | Hardcover (2023)
dpunkt (Verlag)
42,90
Programmiersprache, grafische Benutzeroberflächen, Anwendungen

von Ulrich Stein

Buch | Hardcover (2023)
Hanser (Verlag)
39,99