Scala Programming for Big Data Analytics
Apress (Verlag)
978-1-4842-4809-6 (ISBN)
The author discusses functions at length and highlights a number of associated concepts such as functional programming and anonymous functions. The book then delves deeper into Scala’s powerful collections system because many of Apache Spark’s APIs bear a strong resemblance to Scala collections.
Along the way you’ll see thedevelopment life cycle of a Scala program. This involves compiling and building programs using the industry-standard Scala Build Tool (SBT). You’ll cover guidelines related to dependency management using SBT as this is critical for building large Apache Spark applications. Scala Programming for Big Data Analytics concludes by demonstrating how you can make use of the concepts to write programs that run on the Apache Spark framework. These programs will provide distributed and parallel computing, which is critical for big data analytics.
What You Will Learn
See the fundamentals of Scala as a general-purpose programming language
Understand functional programming and object-oriented programming constructs in Scala
Use Scala collections and functions
Develop, package and run Apache Spark applications for big data analytics
Who ThisBook Is For
Data scientists, data analysts and data engineers who intend to use Apache Spark for large-scale analytics.
Irfan Elahi is a senior consultant in Deloitte Australia specializing in big data and machine learning. His primary focus lies in using big data and machine learning to support business growth with multifaceted and strong ties to the telecommunications, energy, retail and media industries. He has worked on a number of projects in Australia to design, prototype, develop, and deploy production-grade big data solutions in Amazon Web Services (AWS) and Azure to support a number of use-cases ranging from enterprise data warehousing, ETL offloading, analytics, batch processing and stream processing while employing leading commercial Hadoop solutions such as Cloudera and Hortonworks. He has worked closely with clients’ systems and software engineering teams using DevOps to enhance the continuous integration and continuous deployment (CICD) processes and manage a Hadoop cluster’s operations and security. In addition to his technology competencies, Irfan has recently presented at the DataWorks Summit in Sydney on the subject of in-memory big data technologies and in a number of meetups all around the world. He also remains involved delivering knowledge-transfer sessions, training and workshops about big data and machine learning, both within his firm and at clients. He also has launched Udemy courses on Apache Spark for big data analytics and R programming for data science with more than 18,000 students from 145 countries enrolled.
Chapter 1 - Scala Language.- Chapter 2 - Installing Scala.- Chapter 3 - Using Scala Shell.- Chapter 4 – Variables.- Chapter 5 - Data Types.- Chapter 6 - Conditional Statements.- Chapter 7 - Code Blocks.- Chapter 8 – Functions.- Chapter 9 - Scala collections.- Chapter 10 – Loops.- Chapter 11 - Using classes and packages.- Chapter 12 - Exception Handling.- Chapter 13 .- Chapter 14 - Hello World in Apache Spark.
Erscheinungsdatum | 25.07.2019 |
---|---|
Zusatzinfo | 125 Illustrations, black and white; XXI, 306 p. 125 illus. |
Verlagsort | Berkley |
Sprache | englisch |
Maße | 155 x 235 mm |
Themenwelt | Mathematik / Informatik ► Informatik ► Datenbanken |
Mathematik / Informatik ► Informatik ► Programmiersprachen / -werkzeuge | |
Informatik ► Theorie / Studium ► Compilerbau | |
Mathematik / Informatik ► Mathematik ► Finanz- / Wirtschaftsmathematik | |
Wirtschaft ► Betriebswirtschaft / Management | |
ISBN-10 | 1-4842-4809-0 / 1484248090 |
ISBN-13 | 978-1-4842-4809-6 / 9781484248096 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich