Data Mining Using SAS Enterprise Miner (eBook)
584 Seiten
John Wiley & Sons (Verlag)
978-0-470-17142-4 (ISBN)
The Sample, Explore, Modify, Model, and Assess (SEMMA) methodology of SAS Enterprise Miner is an extremely valuable analytical tool for making critical business and marketing decisions. Until now, there has been no single, authoritative book that explores every node relationship and pattern that is a part of the Enterprise Miner software with regard to SEMMA design and data mining analysis.
Data Mining Using SAS Enterprise Miner introduces readers to a wide variety of data mining techniques and explains the purpose of-and reasoning behind-every node that is a part of the Enterprise Miner software. Each chapter begins with a short introduction to the assortment of statistics that is generated from the various nodes in SAS Enterprise Miner v4.3, followed by detailed explanations of configuration settings that are located within each node. Features of the book include:
* The exploration of node relationships and patterns using data from an assortment of computations, charts, and graphs commonly used in SAS procedures
* A step-by-step approach to each node discussion, along with an assortment of illustrations that acquaint the reader with the SAS Enterprise Miner working environment
* Descriptive detail of the powerful Score node and associated SAS code, which showcases the important of managing, editing, executing, and creating custom-designed Score code for the benefit of fair and comprehensive business decision-making
* Complete coverage of the wide variety of statistical techniques that can be performed using the SEMMA nodes
* An accompanying Web site that provides downloadable Score code, training code, and data sets for further implementation, manipulation, and interpretation as well as SAS/IML software programming code
This book is a well-crafted study guide on the various methods employed to randomly sample, partition, graph, transform, filter, impute, replace, cluster, and process data as well as interactively group and iteratively process data while performing a wide variety of modeling techniques within the process flow of the SAS Enterprise Miner software. Data Mining Using SAS Enterprise Miner is suitable as a supplemental text for advanced undergraduate and graduate students of statistics and computer science and is also an invaluable, all-encompassing guide to data mining for novice statisticians and experts alike.
Randall Matignon, MS, is Senior Clinical SAS / Microsoft Office VBA Programmer for Amgen, Inc. in San Francisco, California. He has over twenty years of experience as a statistical programmer and applications developer in the pharmaceutical, healthcare, and biotechnology industries, and he has a broad knowledge of several programming languages, including SAS, S-Plus, and PL-SQL.
Introduction
Chapter 1: Sample Nodes 1
1.1 Input Data Source Node 3
1.2 Sampling Node 32
1.3 Data Partition Node 45
Chapter 2: Explore Nodes 55
2.1 Distribution Explorer Node 57
2.2 Multiplot Node 64
2.3 Insight Node 74
2.4 Association Node 75
2.5 Variable Selection Node 99
2.6 Link Analysis Node 120
Chapter 3: Modify Nodes 153
3.1 Data Set Attributes Node 155
3.2 Transform Variables Node 160
3.3 Filter Outliers Node 169
3.4 Replacement Node 178
3.5 Clustering Node 192
3.6 SOMiKohonen Node 227
3.7 Time Series Node 248
3.8 Interactive Grouping Node 261
Chapter 4: Model Nodes 277
4.1 Regression Node 279
4.2 Model Manager 320
4.3 Tree Node 324
4.4 Neural Network Node 355
4.5 PrincompiDmneural Node 420
4.6 User Defined Node 443
4.7 Ensemble Node 450
4.8 Memory-Based Reasoning Node 460
4.9 Two Stage Node 474
Chapter 5: Assess Nodes 489
5.1 Assessment Node 491
5.2 Reporter Node 511
Chapter 6: Scoring Nodes 515
6.1 Score Node 517
Chapter 7: Utility Nodes 525
7.1 Group Processing Node 527
7.2 Data Mining Database Node 537
7.3 SAS Code Node 541
7.4 Control point Node 552
7.5 Subdiagram Node 553
References 557
Index 560
"The book provides a good account of the numerical and computational approaches used within the various nodes and explains necessary background concepts." (The American Statistician, May 2009)
"... a very detailed user guide." (MAA Reviews, December 26, 2007)
Erscheint lt. Verlag | 28.6.2008 |
---|---|
Reihe/Serie | Wiley Series in Computational Statistics | Wiley Series in Computational Statistics |
Sprache | englisch |
Themenwelt | Informatik ► Datenbanken ► Data Warehouse / Data Mining |
Mathematik / Informatik ► Mathematik ► Statistik | |
Mathematik / Informatik ► Mathematik ► Wahrscheinlichkeit / Kombinatorik | |
Technik | |
Schlagworte | Computational & Graphical Statistics • Computer Science • Database & Data Warehousing Technologies • Data Mining • Data Mining Statistics • Datenbanken u. Data Warehousing • Informatik • Rechnergestützte u. graphische Statistik • Rechnergestützte u. graphische Statistik • SAS • Statistics • Statistik |
ISBN-10 | 0-470-17142-1 / 0470171421 |
ISBN-13 | 978-0-470-17142-4 / 9780470171424 |
Haben Sie eine Frage zum Produkt? |
Größe: 61,7 MB
Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM
Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine
Geräteliste und zusätzliche Hinweise
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich