The Essentials of Data Science: Knowledge Discovery Using R - Graham J. Williams

The Essentials of Data Science: Knowledge Discovery Using R

Buch | Hardcover
344 Seiten
2017
Chapman & Hall/CRC (Verlag)
978-1-4987-4000-5 (ISBN)
186,95 inkl. MwSt
This book presents data science material useful to data scientists. As a practitioner, the author brings a practical view, with a hands-on presentation useful to other practitioners. He concentrates on the current generation of R packages, including Hadley Wickam's suite of packages, such as tidyr, dplyr, lubridate, stringr, and ggplot2.
The Essentials of Data Science: Knowledge Discovery Using R presents the concepts of data science through a hands-on approach using free and open source software. It systematically drives an accessible journey through data analysis and machine learning to discover and share knowledge from data.

Building on over thirty years’ experience in teaching and practising data science, the author encourages a programming-by-example approach to ensure students and practitioners attune to the practise of data science while building their data skills. Proven frameworks are provided as reusable templates. Real world case studies then provide insight for the data scientist to swiftly adapt the templates to new tasks and datasets.

The book begins by introducing data science. It then reviews R’s capabilities for analysing data by writing computer programs. These programs are developed and explained step by step. From analysing and visualising data, the framework moves on to tried and tested machine learning techniques for predictive modelling and knowledge discovery. Literate programming and a consistent style are a focus throughout the book.

Graham J. Williams is Director of Data Science with Microsoft and Honorary Associate Professor with the Australian National University. He is also Adjunct Professor with the University of Canberra. He was previously Senior Director of Analytics with the Australian Taxation Office, Lead Data Scientist with the Australian Government's Centre of Excellence in Data Analytics, and International Visiting Professor of the Chinese Academy of Sciences. Over three decades , Graham has been an active machine learning researcher and author of many publications and software including Rattle. As a practitioner of data science he has deployed solutions in areas including finance, banking, insurance, health, education and government. He is also chair and steering committee member of international conferences in knowledge discovery, artificial intelligence, machine learning, and data mining.

Part I - An Overview for the Data Scientist. Data Science, Analytics, and Data Mining. From Rattle to R for the Data Scientist. Preparing Data. Building Models. Case Studies. R Basics. Part II - Data Foundations. Reading Data into R. Exploring and Summarising Data. Transforming Data. Presenting Data. Part III - Analytics. Descriptive Analytics. Predictive Analytics. Prescriptive Analytics. Text Analytics. Social Network Analytics. Part IV - Advanced Data Science in R. Dealing with Big Data. Parallel Processing for High Performance Analytics. Ensembles for Big Data.

Erscheinungsdatum
Reihe/Serie Chapman & Hall/CRC The R Series
Sprache englisch
Maße 156 x 234 mm
Gewicht 703 g
Themenwelt Mathematik / Informatik Informatik Datenbanken
Mathematik / Informatik Informatik Theorie / Studium
Mathematik / Informatik Mathematik Computerprogramme / Computeralgebra
ISBN-10 1-4987-4000-6 / 1498740006
ISBN-13 978-1-4987-4000-5 / 9781498740005
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich