Summary
The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you’ll learn to take advantage of Spark’s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in enterprises worldwide, and with Spark’s powerful and flexible Java APIs, you can reap all the benefits without first learning Scala or Hadoop.

Foreword by Rob Thomas.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the technology
Analyzing enterprise data starts by reading, filtering, and merging files and streams from many sources. The Spark data processing engine handles this varied volume like a champ, delivering speeds 100 times faster than Hadoop systems. Thanks to SQL support, an intuitive interface, and a straightforward multilanguage API, you can use Spark without learning a complex new ecosystem.

About the book
Spark in Action, Second Edition, teaches you to create end-to-end analytics applications. In this entirely new book, you’ll learn from interesting Java-based examples, including a complete data pipeline for processing NASA satellite data. And you’ll discover Java, Python, and Scala code samples hosted on GitHub that you can explore and adapt, plus appendixes that give you a cheat sheet for installing tools and understanding Spark-specific terms.

What's inside

Writing Spark applications in Java
Spark application architecture
Ingestion through files, databases, streaming, and Elasticsearch
Querying distributed datasets with Spark SQL

About the reader
This book does not assume previous experience with Spark, Scala, or Hadoop.

About the author
Jean-Georges Perrin is an experienced data and software architect. He is France’s first IBM Champion and has been honored for 12 consecutive years.

Table of Contents

PART 1 - THE THEORY CRIPPLED BY AWESOME EXAMPLES

1 So, what is Spark, anyway?

2 Architecture and flow

3 The majestic role of the dataframe

4 Fundamentally lazy

5 Building a simple app for deployment

6 Deploying your simple app

PART 2 - INGESTION

7 Ingestion from files

8 Ingestion from databases

9 Advanced ingestion: finding data sources and building

your own

10 Ingestion through structured streaming

PART 3 - TRANSFORMING YOUR DATA

11 Working with SQL

12 Transforming your data

13 Transforming entire documents

14 Extending transformations with user-defined functions

15 Aggregating your data

PART 4 - GOING FURTHER

16 Cache and checkpoint: Enhancing Spark’s performances

17 Exporting data and building full data pipelines

18 Exploring deployment

About The Author

Jean-Georges Perrin

Jean-Georges “jgp” Perrin is a technology leader focusing on building innovative and modern data platforms, author, and president of AIDA User Group. He is passionate about software engineering and all things data, including Data Mesh. He is proud to have been recognized as a Lifetime IBM Champion.

Product Details

Publisher: Manning (June 2, 2020)
Length: 576 pages
ISBN13: 9781617295522

Browse Related Books

Resources and Downloads

High Resolution Images

Book Cover Image (jpg): Spark in Action, Second Edition
2nd Edition Trade Paperback 9781617295522

Spark in Action, Second Edition

Covers Apache Spark 3 with Examples in Java, Python, and Scala

By Jean-Georges Perrin

LIST PRICE $79.99

PRICE MAY VARY BY RETAILER

See More Retailers

32 Books

A Different Drummer Books

Albany

Another Story Bookshop

Armchair

Bakka Phoenix Books

Banyen Books

Beguiling

Beggar's Banquet Books

Biblioasis

Black Bond Books

Blue Heron Books

Bolen Books

Book City

Book Express

Book Keeper

BookLore

Bookmark

Books on Beechwood

Bookshelf Café

Brome Lake Books

Bryan Prince

Café Books

Chat Noir Books

Coho Books

Crockett Books

Curiosity House

Drawn & Quarterly

Epic Books

Fanfare Books

Finchers

Forster’s Book Garden

Galiano Island Books

Good Egg

Gulliver's Bookstore

Hager Books

Ivy's Book Shop

Kids Books

La Maison Anglaise

Laughing Oyster Books

Le James McGill University

Let’s Talk Books

Librairie Bonder

Librairie Clio

Librairie Paragraphe

Livres Babar

Mabel's Fables

MacNally Robinson

Mill Street Books

Millennia Books

Misty River Books

Mosaic Books

Mulberry Bush Books

Munro's Books

Novel Idea

Odyssey Books

Oink! Oink!

Open Book

Otter Books

Parent Books

Parry Sound Books

Perfect Books

Roxanne's Reflections

Self-Connection Books

Shelf Life Books

Simply AudioBooks

Tanner's Books

The Owl's Nest Books & Gifts

The University of Manitoba Bookstore

The Village Bookshop

Type Books

University of British Columbia

University of Victoria

University of Waterloo

University of Western Ontario

University of Winnipeg Bookstore

Volume One

Words Worth

Yellowknife Book Cellar

Get a FREE ebook by joining our mailing list today!

Plus, receive recommendations and exclusive offers on all of your favorite books and authors from Simon & Schuster.

By clicking 'Sign me up', I consent to Simon & Schuster sending me email offers and updates, including offers for a free eBook. I understand I can unsubscribe at any time. Any offer for an eBook may be subject to exclusions, terms, conditions, and registration with our fulfillment partners. We collect and use your information in accordance with our Privacy Policy. Free ebook offer available to NEW CA subscribers only. Offer redeemable at Simon & Schuster's ebook fulfillment partner. Must redeem within 90 days. See the full terms and conditions.