Spark in Action by Petar Zecevic

Spark in Action by Petar Zecevic

Regular price
Checking stock...
Regular price
Checking stock...
World of Books

At World of Books, you’ll find millions of preloved reads at great prices, from bestsellers to hidden gems. Every book you buy saves money and helps reduce waste, so you can read more for less while giving stories a second life.

The feel-good place to buy books
  • Free US shipping over $15
  • Buying preloved emits 41% less CO2 than new
  • Millions of affordable books
  • Give your books a new home - sell them back to us!

Spark in Action by Petar Zecevic

Working with big data can be complex and challenging, in part because of the multiple analysis frameworks and tools required. Apache Spark is a big data processing framework perfect for analyzing near-real-time streams and discovering historical patterns in batched data sets. But Spark goes much further than other frameworks. By including machine learning and graph processing capabilities, it makes many specialized data processing platforms obsolete. Spark's unified framework and programming model significantly lowers the initial infrastructure investment, and Spark's core abstractions are intuitive for most Scala, Java, and Python developers.   Spark in Action teaches readers to use Spark for stream and batch data processing. It starts with an introduction to the Spark architecture and ecosystem followed by a taste of Spark's command line interface. Readers then discover the most fundamental concepts and abstractions of Spark, particularly Resilient Distributed Datasets (RDDs) and the basic data transformations that RDDs provide. The first part of the book covers writing Spark applications using the the core APIs. Readers also learn how to work with structured data using Spark SQL, how to process near-real time data with Spark Streaming, how to apply machine learning algorithms with Spark MLlib, how to apply graph algorithms on graph-shaped data using Spark GraphX, and an introduction to Spark clustering.   Key Features: • Clear introduction to Spark • Teaches how to ingest near real-time data • Gaining value from big data • Includes real-life case studies   AUDIENCE Readers should be familiar with Java, Scala, or Python. No knowledge of Spark or streaming operations is assumed, but some acquaintance with machine learning is helpful.   ABOUT THE TECHNOLOGY Apache Spark is a big data processing framework perfect for analyzing near-real-time streams and discovering historical patterns in batched data sets. Spark also offers machine learning and graph processing capabilities.

Petar Zečević is a CTO at SV Group. During the last 14 years he has

worked on various projects as a Java developer, team leader, consultant and

software specialist. He is the founder and, with Marko, organizer of popular

Spark@Zg meetup group. Marko Bonaći has worked with Java for 13

years.He works Sematext as a Spark developer and consultant. Before that,

he was team lead for SV Group's IBM Enterprise Content Management

team.

SKU Unavailable
ISBN 13 9781617292606
ISBN 10 1617292605
Title Spark in Action
Author Petar Zecevic
Condition Unavailable
Binding Type Paperback
Publisher Manning Publications
Year published 2016-11-24
Number of pages 468
Cover note Book picture is for illustrative purposes only, actual binding, cover or edition may vary.
Note Unavailable