Learning and Operating Presto - Angelica Lo Duca, Tim Meehan, Vivek Bharathan, Ying Su

Learning and Operating Presto

Fast, Reliable SQL for Data Analytics and Lakehouses
Buch | Softcover
175 Seiten
2023
O'Reilly Media (Verlag)
978-1-0981-4185-1 (ISBN)
65,95 inkl. MwSt
With this practical book, data engineers and architects, platform engineers, cloud engineers, and software engineers will learn how to use Presto operations at your organization to derive insights on datasets wherever they reside.
The Presto community has mushroomed since its origins at Facebook in 2012. But ramping up this distributed SQL query engine can be challenging even for the most experienced engineers. This practical book shows you how to begin Presto operations at your organization to derive insights on datasets wherever they reside.

Authors Angelica Lo Duca, Vivek Bharathan, and George Wang explain what Presto is, where it came from, and how it differs from other data warehousing solutions. You'll discover why Facebook, Uber, Twitter, and cloud providers, including AWS, Google Cloud, and Alibaba, use Presto and how you can quickly deploy Presto in production.

You'll learn about:



Presto security and administration
Syntax and connectors
Clusters and tuning
Troubleshooting: logs, error messages, and more
Extending Presto for real-time business insight
Extending PrestoDB

Angelica Lo Duca is a researcher with a PhD in Computer Science. She currently works in Research and Technology at the Institute of Informatics and Telematics of the Italian National Research Council. Her research areas include Data Science, Machine Learning, Text Analytics, Data Visualization, Data Journalism, and Web Applications. She has also worked with Network Security, Semantic Web, Linked Data, and Blockchain. Additionally, she serves as a professor at the University of Pisa, where she teaches Data Journalism. Tim has been fascinated by data problems for much of his career. He's been working on the Presto project since 2018. He's currently works at IBM and heads the Presto Technical Steering Committee. Before IBM, he worked at Meta, Bloomberg, Goldman Sachs, among others. Vivek is the Cofounder and Principal Software Engineer at Ahana. Previously, Vivek was a Software Engineer at Uber where he managed Presto clusters with more than 2,500 nodes, processing 35PB of data per day, and worked on extending Presto to support Uber's interactive analytics needs. Prior to Uber, Vivek was an early member of the query-optimizer team at Vertica Systems and made several contributions to the core database engine and the Vertica ecosystem. Earlier in his career at the Laboratory for Artificial Intelligence Research, he developed emerging technologies in decision-support systems and reasoning systems. His Presto contributions include the pushdown of partial aggregations. Vivek holds a M.S. in Computer Science and Engineering from The Ohio State University. Ying is the performance architect at Ahana, where she works on building more efficient and better price-performant data lake services on Presto and Velox. She has worked for Microsoft SQLServer and Meta Presto in the past and is a Presto committer and TSC board member.

Erscheinungsdatum
Verlagsort Sebastopol
Sprache englisch
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
Mathematik / Informatik Informatik Software Entwicklung
Wirtschaft Betriebswirtschaft / Management
ISBN-10 1-0981-4185-7 / 1098141857
ISBN-13 978-1-0981-4185-1 / 9781098141851
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Datenanalyse für Künstliche Intelligenz

von Jürgen Cleve; Uwe Lämmel

Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
74,95
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90