Monday, April 7, 2014

Stream Mining with Rfx Framework

Estimate the unique words from data stream URL
Using new data structure HyperLogLog since Redis 2.8.9

Open Source Stream Library of AddThis

HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm
Original Paper:

Mining Data Stream 

Applicable Problems:
  • Estimate the unique elements in continuous data stream
  • Estimation for Big Data
  • finding an ever growing number of applications in networking and traffic monitoring, such as the detection of worm propagation, of network attacks (e.g., by Denial of Service), and of link-based spam on the web
  • an important indication for detecting attacks and monitoring traffic, as it records the number of distinct active flows
Refer Links