Modern Data Architectures with Python (eBook)

A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python

(Autor)

eBook Download: EPUB
2023
318 Seiten
Packt Publishing (Verlag)
978-1-80107-641-8 (ISBN)

Lese- und Medienproben

Modern Data Architectures with Python -  Brian Lipp
Systemvoraussetzungen
35,99 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

Modern Data Architectures with Python will teach you how to seamlessly incorporate your machine learning and data science work streams into your open data platforms. You'll learn how to take your data and create open lakehouses that work with any technology using tried-and-true techniques, including the medallion architecture and Delta Lake.
Starting with the fundamentals, this book will help you build pipelines on Databricks, an open data platform, using SQL and Python. You'll gain an understanding of notebooks and applications written in Python using standard software engineering tools such as git, pre-commit, Jenkins, and Github. Next, you'll delve into streaming and batch-based data processing using Apache Spark and Confluent Kafka. As you advance, you'll learn how to deploy your resources using infrastructure as code and how to automate your workflows and code development. Since any data platform's ability to handle and work with AI and ML is a vital component, you'll also explore the basics of ML and how to work with modern MLOps tooling. Finally, you'll get hands-on experience with Apache Spark, one of the key data technologies in today's market.
By the end of this book, you'll have amassed a wealth of practical and theoretical knowledge to build, manage, orchestrate, and architect your data ecosystems.


Build scalable and reliable data ecosystems using Data Mesh, Databricks Spark, and KafkaKey FeaturesDevelop modern data skills used in emerging technologiesLearn pragmatic design methodologies such as Data Mesh and data lakehousesGain a deeper understanding of data governancePurchase of the print or Kindle book includes a free PDF eBookBook DescriptionModern Data Architectures with Python will teach you how to seamlessly incorporate your machine learning and data science work streams into your open data platforms. You'll learn how to take your data and create open lakehouses that work with any technology using tried-and-true techniques, including the medallion architecture and Delta Lake. Starting with the fundamentals, this book will help you build pipelines on Databricks, an open data platform, using SQL and Python. You ll gain an understanding of notebooks and applications written in Python using standard software engineering tools such as git, pre-commit, Jenkins, and Github. Next, you ll delve into streaming and batch-based data processing using Apache Spark and Confluent Kafka. As you advance, you ll learn how to deploy your resources using infrastructure as code and how to automate your workflows and code development. Since any data platform's ability to handle and work with AI and ML is a vital component, you ll also explore the basics of ML and how to work with modern MLOps tooling. Finally, you ll get hands-on experience with Apache Spark, one of the key data technologies in today s market. By the end of this book, you ll have amassed a wealth of practical and theoretical knowledge to build, manage, orchestrate, and architect your data ecosystems.What you will learnUnderstand data patterns including delta architectureDiscover how to increase performance with Spark internalsFind out how to design critical data diagramsExplore MLOps with tools such as AutoML and MLflowGet to grips with building data products in a data meshDiscover data governance and build confidence in your dataIntroduce data visualizations and dashboards into your data practiceWho this book is forThis book is for developers, analytics engineers, and managers looking to further develop a data ecosystem within their organization. While they re not prerequisites, basic knowledge of Python and prior experience with data will help you to read and follow along with the examples.]]>
Erscheint lt. Verlag 29.9.2023
Sprache englisch
Themenwelt Sachbuch/Ratgeber Freizeit / Hobby Sammeln / Sammlerkataloge
Informatik Datenbanken Data Warehouse / Data Mining
Mathematik / Informatik Informatik Theorie / Studium
ISBN-10 1-80107-641-3 / 1801076413
ISBN-13 978-1-80107-641-8 / 9781801076418
Haben Sie eine Frage zum Produkt?
EPUBEPUB (Adobe DRM)
Größe: 9,1 MB

Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM

Dateiformat: EPUB (Electronic Publication)
EPUB ist ein offener Standard für eBooks und eignet sich besonders zur Darstellung von Belle­tristik und Sach­büchern. Der Fließ­text wird dynamisch an die Display- und Schrift­größe ange­passt. Auch für mobile Lese­geräte ist EPUB daher gut geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine Adobe-ID und die Software Adobe Digital Editions (kostenlos). Von der Benutzung der OverDrive Media Console raten wir Ihnen ab. Erfahrungsgemäß treten hier gehäuft Probleme mit dem Adobe DRM auf.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine Adobe-ID sowie eine kostenlose App.
Geräteliste und zusätzliche Hinweise

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich