Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
Seiten
2019
Chapman & Hall/CRC (Verlag)
978-0-367-40987-6 (ISBN)
Chapman & Hall/CRC (Verlag)
978-0-367-40987-6 (ISBN)
Zu diesem Artikel existiert eine Nachauflage
This is a modern textbook in statistical inference, using the principles of data science through R and the Tidyverse. It assumes minimal background knowledge of the reader: there is no algebra, no calculus, and no prior programming/coding experience.
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse provides a pathway for learning about statistical inference using data science tools widely used in industry, academia, and government. It introduces the tidyverse suite of R packages, including the ggplot2 package for data visualization, and the dplyr package for data wrangling. After equipping readers with just enough of these data science tools to perform effective exploratory data analyses, the book covers traditional introductory statistics topics like confidence intervals, hypothesis testing, and multiple regression modeling, while focusing on visualization throughout.
Features:
● Assumes minimal prerequisites, notably, no prior calculus nor coding experience
● Motivates theory using real-world data, including all domestic flights leaving New York City in 2013, the Gapminder project, and the data journalism website, FiveThirtyEight.com
● Centers on simulation-based approaches to statistical inference rather than mathematical formulas
● Uses the infer package for "tidy" and transparent statistical inference to construct confidence intervals and conduct hypothesis tests via the bootstrap and permutation methods
● Provides all code and output embedded directly in the text; also available in the online version at moderndive.com
This book is intended for individuals who would like to simultaneously start developing their data science toolbox and start learning about the inferential and modeling tools used in much of modern-day research. The book can be used in methods and data science courses and first courses in statistics, at both the undergraduate and graduate levels.
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse provides a pathway for learning about statistical inference using data science tools widely used in industry, academia, and government. It introduces the tidyverse suite of R packages, including the ggplot2 package for data visualization, and the dplyr package for data wrangling. After equipping readers with just enough of these data science tools to perform effective exploratory data analyses, the book covers traditional introductory statistics topics like confidence intervals, hypothesis testing, and multiple regression modeling, while focusing on visualization throughout.
Features:
● Assumes minimal prerequisites, notably, no prior calculus nor coding experience
● Motivates theory using real-world data, including all domestic flights leaving New York City in 2013, the Gapminder project, and the data journalism website, FiveThirtyEight.com
● Centers on simulation-based approaches to statistical inference rather than mathematical formulas
● Uses the infer package for "tidy" and transparent statistical inference to construct confidence intervals and conduct hypothesis tests via the bootstrap and permutation methods
● Provides all code and output embedded directly in the text; also available in the online version at moderndive.com
This book is intended for individuals who would like to simultaneously start developing their data science toolbox and start learning about the inferential and modeling tools used in much of modern-day research. The book can be used in methods and data science courses and first courses in statistics, at both the undergraduate and graduate levels.
• Chester Ismay is a Data Science Evangelist for DataRobot and is based in Portland, Oregon, USA. •Albert Y. Kim is an Assistant Professor of Statistical and Data Sciences at Smith College in Northampton, Massachusetts, USA.
Preface
1 Getting Started with Data in R
I Data Science via the tidyverse
2 Data Visualization
3 Data Wrangling
4 Data Importing & “Tidy” Data
II Data Modeling via moderndive
5 Basic Regression
6 Multiple Regression
III Statistical Inference via infer
7 Sampling
8 Bootstrapping & Confidence Intervals
9 Hypothesis Testing
10 Inference for Regression
11 Tell the Story with Data
Appendix
A Statistical Background
B Information about R packages Used
Bibliography
Index
Erscheinungsdatum | 16.12.2019 |
---|---|
Reihe/Serie | Chapman & Hall/CRC The R Series |
Zusatzinfo | 228 Illustrations, black and white |
Sprache | englisch |
Maße | 178 x 254 mm |
Gewicht | 957 g |
Themenwelt | Mathematik / Informatik ► Mathematik ► Statistik |
Technik ► Elektrotechnik / Energietechnik | |
Technik ► Umwelttechnik / Biotechnologie | |
ISBN-10 | 0-367-40987-9 / 0367409879 |
ISBN-13 | 978-0-367-40987-6 / 9780367409876 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
Eine Einführung für Wirtschafts- und Sozialwissenschaftler
Buch | Softcover (2022)
De Gruyter Oldenbourg (Verlag)
29,95 €