SQL on Big Data - Sumit Pal

SQL on Big Data (eBook)

Technology, Architecture, and Innovation

(Autor)

eBook Download: PDF
2016 | 1st ed.
XVII, 157 Seiten
Apress (Verlag)
978-1-4842-2247-8 (ISBN)
Systemvoraussetzungen
34,99 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

Learn various commercial and open source products that perform SQL on Big Data platforms. You will understand the architectures of the various SQL engines being used and how the tools work internally in terms of execution, data movement, latency, scalability, performance, and system requirements.

This book consolidates in one place solutions to the challenges associated with the requirements of speed, scalability, and the variety of operations needed for data integration and SQL operations. After discussing the history of the how and why of SQL on Big Data, the book provides in-depth insight into the products, architectures, and innovations happening in this rapidly evolving space.

SQL on Big Data discusses in detail the innovations happening, the capabilities on the horizon, and how they solve the issues of performance and scalability and the ability to handle different data types. The book covers how SQL on Big Data engines are permeating the OLTP, OLAP, and Operational analytics space and the rapidly evolving HTAP systems.

You will learn the details of:

    Batch Architectures-an understanding of the internals and how the existing Hive engine is built and how it is evolving continually to support new features and provide lower latency on queries
  • Interactive Architectures-an understanding of how SQL engines are architected to support low latency on large data sets
  • Streaming Architectures-an understanding of how SQL engines are architected to support queries on data in motion using in-memory and lock-free data structures
  • Operational Architectures-an understanding of how SQL engines are architected for transactional and operational systems to support transactions on Big Data platforms
  • Innovative Architectures-an exploration of the rapidly evolving newer SQL engines on Big Data with innovative ideas and concepts


Sumit Pal is a big data and data science consultant working with multiple clients and advising them on their data architectures and big data solutions as well as providing hands on coding with Spark, Scala, Java and Python. He is a big data, visualization and data science consultant, and a software architect and big data enthusiast and builds end-to-end data-driven analytic systems. He has more than 22 years of experience in the software industry in various roles spanning companies from startups to enterprises.

Sumit has worked for Microsoft (SQL server development team), Oracle (OLAP development team) and Verizon (big data analytics team) in a career spanning 22 years. He has extensive experience in building scalable systems across the stack from middle-tier, data tier to visualization for analytics applications, using big data, and NoSQL DB. Sumit has deep expertise in database Internals, data warehouses, dimensional modeling, data science with Java and Python, and SQL.

Sumit has MS and BS in Computer Science.


Learn various commercial and open source products that perform SQL on Big Data platforms. You will understand the architectures of the various SQL engines being used and how the tools work internally in terms of execution, data movement, latency, scalability, performance, and system requirements.This book consolidates in one place solutions to the challenges associated with the requirements of speed, scalability, and the variety of operations needed for data integration and SQL operations. After discussing the history of the how and why of SQL on Big Data, the book provides in-depth insight into the products, architectures, and innovations happening in this rapidly evolving space.SQL on Big Data discusses in detail the innovations happening, the capabilities on the horizon, and how they solve the issues of performance and scalability and the ability to handle different data types. The book covers how SQL on Big Data engines are permeating the OLTP, OLAP, and Operational analytics space and the rapidly evolving HTAP systems.You will learn the details of:Batch Architectures-Understand the internals and how the existing Hive engine is built and how it is evolving continually to support new features and provide lower latency on queriesInteractive Architectures-Understanding how SQL engines are architected to support low latency on large data setsStreaming Architectures-Understanding how SQL engines are architected to support queries on data in motion using in-memory and lock-free data structuresOperational Architectures-Understanding how SQL engines are architected for transactional and operational systems to support transactions on Big Data platformsInnovative Architectures-Explore the rapidly evolving newer SQL engines on Big Data with innovative ideas and conceptsWho This Book Is For:Business analysts, BI engineers, developers, data scientists and architects, and quality assurance professionals

Sumit Pal is a big data and data science consultant working with multiple clients and advising them on their data architectures and big data solutions as well as providing hands on coding with Spark, Scala, Java and Python. He is a big data, visualization and data science consultant, and a software architect and big data enthusiast and builds end-to-end data-driven analytic systems. He has more than 22 years of experience in the software industry in various roles spanning companies from startups to enterprises. Sumit has worked for Microsoft (SQL server development team), Oracle (OLAP development team) and Verizon (big data analytics team) in a career spanning 22 years. He has extensive experience in building scalable systems across the stack from middle-tier, data tier to visualization for analytics applications, using big data, and NoSQL DB. Sumit has deep expertise in database Internals, data warehouses, dimensional modeling, data science with Java and Python, and SQL. Sumit has MS and BS in Computer Science.

Erscheint lt. Verlag 17.11.2016
Zusatzinfo XVII, 157 p. 80 illus., 52 illus. in color.
Verlagsort Berkeley
Sprache englisch
Themenwelt Informatik Datenbanken SQL Server
Mathematik / Informatik Informatik Netzwerke
Schlagworte Batch architectures • BI • Big Data • Business Intelligence • data integration • Data movement • data structures • Data Types • Hadoop • Interactive architectures • OLAP • OLTP • Scalability • SQL • SQL on big data • SQL on Hadoop • Streaming architectures
ISBN-10 1-4842-2247-4 / 1484222474
ISBN-13 978-1-4842-2247-8 / 9781484222478
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 6,4 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
A Practical Guide to Analyzing Performance in SQL Server and Azure …

von Thomas LaRock; Enrico van de Laar

eBook Download (2023)
Apress (Verlag)
62,99