Data Lakes For Dummies (eBook)

(Autor)

eBook Download: PDF
2021 | 1. Auflage
384 Seiten
John Wiley & Sons (Verlag)
978-1-119-78617-7 (ISBN)

Lese- und Medienproben

Data Lakes For Dummies - Alan R. Simon
Systemvoraussetzungen
22,99 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
Take a dive into data lakes

"Data lakes" is the latest buzz word in the world of data storage, management, and analysis. Data Lakes For Dummies decodes and demystifies the concept and helps you get a straightforward answer the question: "What exactly is a data lake and do I need one for my business?" Written for an audience of technology decision makers tasked with keeping up with the latest and greatest data options, this book provides the perfect introductory survey of these novel and growing features of the information landscape. It explains how they can help your business, what they can (and can't) achieve, and what you need to do to create the lake that best suits your particular needs.

With a minimum of jargon, prolific tech author and business intelligence consultant Alan Simon explains how data lakes differ from other data storage paradigms. Once you've got the background picture, he maps out ways you can add a data lake to your business systems; migrate existing information and switch on the fresh data supply; clean up the product; and open channels to the best intelligence software for to interpreting what you've stored.

* Understand and build data lake architecture

* Store, clean, and synchronize new and existing data

* Compare the best data lake vendors

* Structure raw data and produce usable analytics

Whatever your business, data lakes are going to form ever more prominent parts of the information universe every business should have access to. Dive into this book to start exploring the deep competitive advantage they make possible--and make sure your business isn't left standing on the shore.

Alan Simon is the managing principal of Thinking Helmet, Inc., the author of 32 books on business technology, and a consultant who's worked with enterprise and government organizations. His professional focus is business intelligence, analytics, and data warehousing. He also teaches university courses in his specialty areas.

Introduction 1

Part 1: Getting Started with Data Lakes 5

Chapter 1: Jumping into the Data Lake 7

Chapter 2: Planning Your Day (and the Next Decade) at the Data Lake 25

Chapter 3: Break Out the Life Vests: Tackling Data Lake Challenges 49

Part 2: Building the Docks, Avoiding the Rocks 65

Chapter 4: Imprinting Your Data Lake on a Reference Architecture 67

Chapter 5: Anybody Hungry? Ingesting and Storing Raw Data in Your Bronze Zone 97

Chapter 6: Your Data Lake's Water Treatment Plant: The Silver Zone 121

Chapter 7: Bottling Your Data Lake Water in the Gold Zone 139

Chapter 8: Playing in the Sandbox 151

Chapter 9: Fishing in the Data Lake 159

Chapter 10: Rowing End-to-End across the Data Lake 169

Part 3: Evaporating the Data Lake into the Cloud 187

Chapter 11: A Cloudy Day at the Data Lake 189

Chapter 12: Building Data Lakes in Amazon Web Services 199

Chapter 13: Building Data Lakes in Microsoft Azure 217

Part 4: Cleaning Up the Polluted Data Lake 243

Chapter 14: Figuring Out If You Have a Data Swamp Instead of a Data Lake 245

Chapter 15: Defining Your Data Lake Remediation Strategy 259

Chapter 16: Refilling Your Data Lake 283

Part 5: Making Trips to the Data Lake a Tradition 297

Chapter 17: Checking Your GPS: The Data Lake Road Map 299

Chapter 18: Booking Future Trips to the Data Lake 325

Part 6: The Part of Tens 333

Chapter 19: Top Ten Reasons to Invest in Building a Data Lake 335

Chapter 20: Ten Places to Get Help for Your Data Lake 341

Chapter 21: Ten Differences between a Data Warehouse and a Data Lake 345

Index 351

Erscheint lt. Verlag 11.6.2021
Sprache englisch
Themenwelt Mathematik / Informatik Mathematik Statistik
Mathematik / Informatik Mathematik Wahrscheinlichkeit / Kombinatorik
Schlagworte Data Analysis • Data Lake • Datenanalyse • Statistics • Statistik
ISBN-10 1-119-78617-7 / 1119786177
ISBN-13 978-1-119-78617-7 / 9781119786177
Haben Sie eine Frage zum Produkt?
PDFPDF (Adobe DRM)
Größe: 14,9 MB

Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine Adobe-ID und die Software Adobe Digital Editions (kostenlos). Von der Benutzung der OverDrive Media Console raten wir Ihnen ab. Erfahrungsgemäß treten hier gehäuft Probleme mit dem Adobe DRM auf.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine Adobe-ID sowie eine kostenlose App.
Geräteliste und zusätzliche Hinweise

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich