Wednesday, December 18, 2013

Predictive Analytics using Storm, Hadoop, R and AWS

This presentation gives a quick refresher on Storm concepts, however most of the time will be spent discussing a recent project where Storm was a critical part of implementing a predictive analytics use case for an actual customer


This talk provides an overview of the open source Storm system for processing Big Data in realtime. The talk starts with an overview of the technology, including key components: Nimbus, Zookeeper, Topology, Tuple, Trident. The presentation then dives into the complex Big Data architecture in which Storm can be integrated. The result is a compelling stack of technologies including integrated Hadoop clusters, MPP, and NoSQL databases.