Learn how to process massive amounts of streaming data in real time on a cluster, using Spark Streaming! Includes a crash course in Scala, and lots of hands-on examples of connecting to various data sources such as Kafka, Flume, TCP ports, Cassandra, and more.

Published by
Frank Kane
Frank spent 9 years at Amazon and IMDb, developing and managing the technology that automatically delivers product and movie recommendations to hundreds of millions of customers, all the time. Frank holds 17 issued patents in the fields of distributed computing, data mining, and machine learning. In 2012, Frank left to start his own successful company, Sundog Software, which focuses on virtual reality environment technology, and teaching others about big data analysis.
View all posts by Frank Kane