CRAN Recipes - William Yarberry

CRAN Recipes

DPLYR, Stringr, Lubridate, and RegEx in R
Buch | Softcover
344 Seiten
2021 | 1st ed.
Apress (Verlag)
978-1-4842-6875-9 (ISBN)
64,19 inkl. MwSt
Want to use the power of R sooner rather than later? Don’t have time to plow through wordy texts and online manuals? Use this book for quick, simple code to get your projects up and running. It includes code and examples applicable to many disciplines. Written in everyday language with a minimum of complexity, each chapter provides the building blocks you need to fit R’s astounding capabilities to your analytics, reporting, and visualization needs.  



CRAN Recipes recognizes how needless jargon and complexity get in your way. Busy professionals need simple examples and intuitive descriptions; side trips and meandering philosophical discussions are left for other books.  



Here R scripts are condensed, to the extent possible, to copy-paste-run format. Chapters and examples are structured to purpose rather than particular functions (e.g., “dirty data cleanup” rather than the R package name “janitor”). Everyday language eliminatesthe need to know functions/packages in advance. 



What You Will Learn





Carry out input/output; visualizations; data munging; manipulations at the group level; and quick data exploration
Handle forecasting (multivariate, time series, logistic regression, Facebook’s Prophet, and others)
Use text analytics; sampling; financial analysis; and advanced pattern matching (regex)
Manipulate data using DPLYR: filter, sort, summarize, add new fields to datasets, and apply powerful IF functions
Create combinations or subsets of files using joins
Write efficient code using pipes to eliminate intermediate steps (MAGRITTR)
Work with string/character manipulation of all types (STRINGR)
Discover counts, patterns, and how to locate whole words
Do wild-card matching, extraction, and invert-match
Work with dates using LUBRIDATE
Fix dirty data; attractive formatting; bad habits to avoid




Who This Book Is For 



Programmers/data scientists with at least some prior exposure to R.

William A. Yarberry, Jr., CPA, CISA, is principal consultant, ICCM Consulting LLC, based in Houston, Texas. His practice is focused on IT governance, Sarbanes-Oxley compliance, security consulting, and business analytics for cost management. He was previously a senior manager with PricewaterhouseCoopers, responsible for telecom and network services in the Southwest region. Yarberry has more than 30 years’ experience in a variety of IT-related services, including application development, internal audit management, outsourcing administration, and Sarbanes-Oxley consulting. His books include The Effective CIO (co-authored), Computer Telephony Integration, $250K Consulting, DPLYR, 50,000 Random Numbers, Telecommunications Cost Management, and GDPR: A Short Primer. In addition, he has written over 20 professional articles on topics ranging from wireless security to change management. One of his articles, "Audit Rights in anOutsource Environment," received the Institute of Internal Auditors Outstanding Contributor Award. Prior to joining PricewaterhouseCoopers, Yarberry was director of telephony services for Enron Corporation. He was responsible for operations, planning, and architectural design for voice communications servers and related systems for more than 7,000 employees. Yarberry graduated Phi Beta Kappa in chemistry from the University of Tennessee and earned an MBA at the University of Memphis. He enjoys reading history, swimming, hiking, and spending time with family.

1: DPLYR.- 2: STRINGR.- 3: Lubridate.- 4: Regular Expressions: Introduction.- 5: Typical Uses.- 6: Some Simple Patterns.- 7: Character Classes.- 8: Elements of Regular Expressions.- 9: The Magnificent Seven.- 10: Regular Expressions in Stringr.- 11: Unicode.- 12: Tools for Development and Resources.- 13: Regex Summary.- 14: Recipes for Common R Tasks.- 15: Data Structures.- 16: Visualization.- 17: Simple Prediction Methods.- 18: Smorgasbord of Simple Statistical Tests.- 19: Validation of Data.- 20: Shortcuts and Miscellaneous.- 21: Conclusion.- Appendix A: Suggested Websites.- Appendix B: Cheat Sheet for Regex in R.- Appendix C: General R Comments by John D. Cook, Consultant.- Appendix D: Understanding a Long Regular Expression.- Appendix E: Regular Expression-enabled Languages.- Appendix F: Sample Data Analysis Questions.- Appendix G: Formats Recognized by Lubridate.

Erscheinungsdatum
Zusatzinfo 35 Illustrations, color; 18 Illustrations, black and white; XXI, 344 p. 53 illus., 35 illus. in color.
Verlagsort Berkley
Sprache englisch
Maße 178 x 254 mm
Themenwelt Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
Mathematik / Informatik Informatik Theorie / Studium
Mathematik / Informatik Mathematik Computerprogramme / Computeralgebra
Schlagworte Big Data • CRAN • Data Mining • data munging • Data Science • dplyr • lubridate • programming • R • regex • Regular Expressions • Statistical • Statistics • stringr
ISBN-10 1-4842-6875-X / 148426875X
ISBN-13 978-1-4842-6875-9 / 9781484268759
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Das Handbuch für Webentwickler

von Philip Ackermann

Buch | Hardcover (2023)
Rheinwerk (Verlag)
49,90
das große Praxisbuch – Grundlagen, fortgeschrittene Themen und Best …

von Ferdinand Malcher; Danny Koppenhagen; Johannes Hoppe

Buch | Hardcover (2023)
dpunkt (Verlag)
42,90
Programmiersprache, grafische Benutzeroberflächen, Anwendungen

von Ulrich Stein

Buch | Hardcover (2023)
Hanser (Verlag)
39,99