Data Mining Using SAS Enterprise Miner

Randall Matignon (Autor)

Buch | Softcover

584 Seiten

2007
Wiley-Interscience (Verlag)
978-0-470-14901-0 (ISBN)

Lieferbar (Termin unbekannt)
Versandkostenfrei
innerhalb Deutschlands
Auch auf Rechnung
Verfügbarkeit in der
Filiale vor Ort prüfen

Artikel merken

Data Mining Using SAS(r) Enterprise Miner introduces the reader to a wide variety of data mining techniques in SAS(r) Enterprise Miner. This first-of-a-kind book explains the purpose of -- and reasoning behind -- every node that is a part of Enterprise Miner with regard to SEMMA design and data mining analysis.

The most thorough and up-to-date introduction to data mining techniques using SAS Enterprise Miner.

The Sample, Explore, Modify, Model, and Assess (SEMMA) methodology of SAS Enterprise Miner is an extremely valuable analytical tool for making critical business and marketing decisions. Until now, there has been no single, authoritative book that explores every node relationship and pattern that is a part of the Enterprise Miner software with regard to SEMMA design and data mining analysis.

Data Mining Using SAS Enterprise Miner introduces readers to a wide variety of data mining techniques and explains the purpose of-and reasoning behind-every node that is a part of the Enterprise Miner software. Each chapter begins with a short introduction to the assortment of statistics that is generated from the various nodes in SAS Enterprise Miner v4.3, followed by detailed explanations of configuration settings that are located within each node. Features of the book include:

The exploration of node relationships and patterns using data from an assortment of computations, charts, and graphs commonly used in SAS procedures
A step-by-step approach to each node discussion, along with an assortment of illustrations that acquaint the reader with the SAS Enterprise Miner working environment
Descriptive detail of the powerful Score node and associated SAS code, which showcases the important of managing, editing, executing, and creating custom-designed Score code for the benefit of fair and comprehensive business decision-making
Complete coverage of the wide variety of statistical techniques that can be performed using the SEMMA nodes
An accompanying Web site that provides downloadable Score code, training code, and data sets for further implementation, manipulation, and interpretation as well as SAS/IML software programming code

This book is a well-crafted study guide on the various methods employed to randomly sample, partition, graph, transform, filter, impute, replace, cluster, and process data as well as interactively group and iteratively process data while performing a wide variety of modeling techniques within the process flow of the SAS Enterprise Miner software. Data Mining Using SAS Enterprise Miner is suitable as a supplemental text for advanced undergraduate and graduate students of statistics and computer science and is also an invaluable, all-encompassing guide to data mining for novice statisticians and experts alike.

Randall Matignon, MS, is Senior Clinical SAS / Microsoft Office VBA Programmer for Amgen, Inc. in San Francisco, California. He has over twenty years of experience as a statistical programmer and applications developer in the pharmaceutical, healthcare, and biotechnology industries, and he has a broad knowledge of several programming languages, including SAS, S-Plus, and PL-SQL.

Introduction

Chapter 1: Sample Nodes 1

1.1 Input Data Source Node 3

1.2 Sampling Node 32

1.3 Data Partition Node 45

Chapter 2: Explore Nodes 55

2.1 Distribution Explorer Node 57

2.2 Multiplot Node 64

2.3 Insight Node 74

2.4 Association Node 75

2.5 Variable Selection Node 99

2.6 Link Analysis Node 120

Chapter 3: Modify Nodes 153

3.1 Data Set Attributes Node 155

3.2 Transform Variables Node 160

3.3 Filter Outliers Node 169

3.4 Replacement Node 178

3.5 Clustering Node 192

3.6 SOMiKohonen Node 227

3.7 Time Series Node 248

3.8 Interactive Grouping Node 261

Chapter 4: Model Nodes 277

4.1 Regression Node 279

4.2 Model Manager 320

4.3 Tree Node 324

4.4 Neural Network Node 355

4.5 PrincompiDmneural Node 420

4.6 User Defined Node 443

4.7 Ensemble Node 450

4.8 Memory-Based Reasoning Node 460

4.9 Two Stage Node 474

Chapter 5: Assess Nodes 489

5.1 Assessment Node 491

5.2 Reporter Node 511

Chapter 6: Scoring Nodes 515

6.1 Score Node 517

Chapter 7: Utility Nodes 525

7.1 Group Processing Node 527

7.2 Data Mining Database Node 537

7.3 SAS Code Node 541

7.4 Control point Node 552

7.5 Subdiagram Node 553

References 557

Index 560

Erscheint lt. Verlag	24.8.2007
Reihe/Serie	Wiley-Interscience
Zusatzinfo	Drawings: 19 B&W, 0 Color; Screen captures: 453 B&W, 0 Color; Tables: 0 B&W, 0 Color; Graphs: 18 B&W, 0 Color
Sprache	englisch
Maße	218 x 279 mm
Gewicht	1388 g
Themenwelt	Informatik ► Datenbanken ► Data Warehouse / Data Mining
Themenwelt	Mathematik / Informatik ► Mathematik ► Wahrscheinlichkeit / Kombinatorik
ISBN-10	0-470-14901-9 / 0470149019
ISBN-13	978-0-470-14901-0 / 9780470149010
Zustand	Neuware