Statistics for Data Science - James D. Miller

Statistics for Data Science

(Autor)

Buch | Softcover
286 Seiten
2017
Packt Publishing Limited (Verlag)
978-1-78829-067-8 (ISBN)
38,65 inkl. MwSt
Get your statistics basics right before diving into the world of data science

About This Book

• No need to take a degree in statistics, read this book and get a strong statistics base for data science and real-world programs;
• Implement statistics in data science tasks such as data cleaning, mining, and analysis
• Learn all about probability, statistics, numerical computations, and more with the help of R programs

Who This Book Is For

This book is intended for those developers who are willing to enter the field of data science and are looking for concise information of statistics with the help of insightful programs and simple explanation. Some basic hands on R will be useful.

What You Will Learn

• Analyze the transition from a data developer to a data scientist mindset
• Get acquainted with the R programs and the logic used for statistical computations
• Understand mathematical concepts such as variance, standard deviation, probability, matrix calculations, and more
• Learn to implement statistics in data science tasks such as data cleaning, mining, and analysis
• Learn the statistical techniques required to perform tasks such as linear regression, regularization, model assessment, boosting, SVMs, and working with neural networks
• Get comfortable with performing various statistical computations for data science programmatically

In Detail

Data science is an ever-evolving field, which is growing in popularity at an exponential rate. Data science includes techniques and theories extracted from the fields of statistics; computer science, and, most importantly, machine learning, databases, data visualization, and so on.
This book takes you through an entire journey of statistics, from knowing very little to becoming comfortable in using various statistical methods for data science tasks. It starts off with simple statistics and then move on to statistical methods that are used in data science algorithms. The R programs for statistical computation are clearly explained along with logic. You will come across various mathematical concepts, such as variance, standard deviation, probability, matrix calculations, and more. You will learn only what is required to implement statistics in data science tasks such as data cleaning, mining, and analysis. You will learn the statistical techniques required to perform tasks such as linear regression, regularization, model assessment, boosting, SVMs, and working with neural networks.
By the end of the book, you will be comfortable with performing various statistical computations for data science programmatically.

Style and approach

Step by step comprehensive guide with real world examples

James D. Miller An IBM certified expert, creative innovator and accomplished Director, Sr. Project Leader & Application/System Architect with +35 years of extensive applications and system design & development experience across multiple platforms and technologies. Experiences include introducing customers to new and sometimes disruptive technologies and platforms, integrating with IBM Watson Analytics, Cognos BI, TM1 and Web architecture design, systems analysis, GUI design and testing, Database modelling and systems analysis, design, and development of OLAP, Client/Server, Web and Mainframe applications and systems utilizing: IBM Watson Analytics, IBM Cognos BI & TM1 (TM1 rules, TI, TM1Web and Planning Manager), Cognos Framework Manager, dynaSight - ArcPlan, ASP, DHTML, XML, IIS, MS Visual Basic and VBA, Visual Studio, PERL, SPLUNK, WebSuite, MS SQL Server, ORACLE, SYBASE Server, etc. Responsibilities have also included all aspects of Windows and SQL solution development & design including: analysis; GUI (and Web site) design; data modelling; table, screen/form & script development; SQL (and remote stored procedures and triggers) development/testing; test preparation and management and training of programming staff. Other experience includes development of ETL infrastructure such as data transfer automation between mainframe (DB2, Lawson, Great Plains, etc.) systems and client/server SQL server & Web based applications & integration of enterprise applications & data sources. Mr. Miller has acted as Internet Applications Development Mgr. responsible for the design, development, QA and delivery of multiple Web Sites including online trading applications, warehouse process control & scheduling systems, administrative and control applications. Mr. Miller also was responsible for the design, development and administration of a Web based financial reporting system for a 450-million-dollar organization, reporting directly to the CFO and his executive team. Mr. Miller has also been responsible for managing and directing multiple resources in various management roles including project and team leader, lead developer and applications development director. Jim has authored the following: Mastering Predictive Analytics with R - Second Edition, Big Data Visualization, Learning IBM Watson Analytics, Implementing Splunk - Second Edition, Mastering Splunk, IBM Cognos TM1 Developer's Certification Guide In addition to the above, Jim has written a number of whitepapers on best practices such as “Establishing a Center of Excellence” and continues to post blogs on a number of relevant topics based upon personal experiences and industry best practices. Jim is a perpetual learner continuing to pursue experiences and certifications, currently holding the following current technical certifications: IBM Certified Developer Cognos TM1, IBM Certified Analyst Cognos TM1, IBM Certified Administrator Cognos TM1, IBM Cognos TM1 Master 385 Certification, IBM Certified Advanced Solution Expert Cognos TM1, IBM OpenPages Developer Fundamentals C2020-001-ENU, IBM Cognos 10 BI Administrator C2020-622, IBM Cognos 10 BI Author C2090-620-ENU, IBM Cognos BI Professional C2090-180-ENU, IBM Cognos 10 BI Metadata Model Developer C2090-632, IBM Certified Solution Expert - Cognos BI Specialties: The evaluation and introduction of innovative and disruptive technologies, Cloud migration, IBM Watson Analytics, Big Data, Data Visualizations, Cognos BI and TM1 application Design and Development, OLAP, Visual Basic, SQL Server, Forecasting and Planning; International Application Development, Business Intelligence, Project Development & Delivery and process improvement.

Erscheinungsdatum
Verlagsort Birmingham
Sprache englisch
Maße 75 x 93 mm
Themenwelt Mathematik / Informatik Mathematik Computerprogramme / Computeralgebra
Mathematik / Informatik Mathematik Statistik
ISBN-10 1-78829-067-4 / 1788290674
ISBN-13 978-1-78829-067-8 / 9781788290678
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich