Browsing All posts tagged under »big data«

In-Stream Big Data Processing

August 20, 2013


The shortcomings and drawbacks of batch-oriented data processing were widely recognized by the Big Data community quite a long time ago. It became clear that real-time query processing and in-stream processing is the immediate need in many practical applications. In recent years, this idea got a lot of traction and a whole bunch of solutions […]

Probabilistic Data Structures for Web Analytics and Data Mining

May 1, 2012


Statistical analysis and mining of huge multi-terabyte data sets is a common task nowadays, especially in the areas like web analytics and Internet advertising. Analysis of such large data sets often requires powerful distributed data stores like Hadoop and heavy data processing with techniques like MapReduce. This approach often leads to heavyweight high-latency¬†analytical processes and […]


Get every new post delivered to your Inbox.

Join 1,696 other followers