Big Data Processing Using Spark in Cloud

Big Data Processing Using Spark in Cloud
Author :
Publisher : Springer
Total Pages : 275
Release :
ISBN-10 : 9789811305504
ISBN-13 : 9811305501
Rating : 4/5 (501 Downloads)

Book Synopsis Big Data Processing Using Spark in Cloud by : Mamta Mittal

Download or read book Big Data Processing Using Spark in Cloud written by Mamta Mittal and published by Springer. This book was released on 2018-06-16 with total page 275 pages. Available in PDF, EPUB and Kindle. Book excerpt: The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements big data’s immutable nature, and solves it with lazy evaluation, cacheable and type inference. It also addresses advanced topics in Spark, starting with the basics of Scala and the core Spark framework, and exploring Spark data frames, machine learning using Mllib, graph analytics using Graph X and real-time processing with Apache Kafka, AWS Kenisis, and Azure Event Hub. It then goes on to investigate Spark using PySpark and R. Focusing on the current big data stack, the book examines the interaction with current big data tools, with Spark being the core processing layer for all types of data. The book is intended for data engineers and scientists working on massive datasets and big data technologies in the cloud. In addition to industry professionals, it is helpful for aspiring data processing professionals and students working in big data processing and cloud computing environments.


Big Data Processing Using Spark in Cloud Related Books

Big Data Processing Using Spark in Cloud
Language: en
Pages: 275
Authors: Mamta Mittal
Categories: Computers
Type: BOOK - Published: 2018-06-16 - Publisher: Springer

DOWNLOAD EBOOK

The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the sh
Big Data Processing with Apache Spark
Language: en
Pages: 106
Authors: Srini Penchikala
Categories: Computers
Type: BOOK - Published: 2018-03-13 - Publisher: Lulu.com

DOWNLOAD EBOOK

Apache Spark is a popular open-source big-data processing framework thatÕs built around speed, ease of use, and unified distributed computing architecture. Not
Mastering Spark with R
Language: en
Pages: 296
Authors: Javier Luraschi
Categories: Computers
Type: BOOK - Published: 2019-10-07 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

If you’re like most R users, you have deep knowledge and love for statistics. But as your organization continues to collect huge amounts of data, adding tools
Spark: The Definitive Guide
Language: en
Pages: 594
Authors: Bill Chambers
Categories: Computers
Type: BOOK - Published: 2018-02-08 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With
Hands-On Big Data Analytics with PySpark
Language: en
Pages: 172
Authors: Rudy Lai
Categories: Computers
Type: BOOK - Published: 2019-03-29 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Use PySpark to easily crush messy data at-scale and discover proven techniques to create testable, immutable, and easily parallelizable Spark jobs Key FeaturesW