Your complete guide to plan, design and build Big Data applications

Webinar: Using R with Hadoop

Revolution Analytics has just published a webinar titled “Using R with Hadoop”. Among the topics touched in this video

Check the full webinar on Revolution Analytics blog.

Analyzing Big Data with Twitter course

UC Berkley has just published a full semester of recorded lectures of the course Analyzing Big Data with Twitter. The syllabus is really impressive, and includes topics and tool to build a truly comprehensive big data stack: Twitter software … [Continue reading]

Analytics on a shoestring – Part 1

Analytics

Since Hadoop, HDFS and other big data analytics tools are widely discussed every day in articles, blog posts and reports, it's no surprise that every company thinking of implementing an internal analytics solution is looking at them. However, all … [Continue reading]

The Twitter Ecosystem

I wrote a few days ago about the need to create applications where analytics are took into consideration since day 1. Johan Oskarsson has an interesting blog post out explaining various pieces and bits of Twitter architecture (and related open source … [Continue reading]

Netflix Hadoop Platform

The Netflix blog is always a great source to learn about real big data architectures. In the first post of the year, they present their internal Hadoop platform. Some distinctive solutions of their architecture: Use of Amazon S3 as data … [Continue reading]

Big Data vs the “Learning Platform”

It's January 2013, and of course we are reading plenty of "Big Data predictions for 2013" articles. Many of them are about technology: cloud based big data and machine learning solutions, real time data processing, new visualization tools.... but I … [Continue reading]

Apache Cassandra 1.2: Virtual Nodes and Atomic Batches

One of the news of last week is the release of Apache Cassandra 1.2. Despite being a point release, the new version contains some long waited improvements like Virtual Nodes and Atomic Batches. Virtual Nodes are my favourite new feature, as they … [Continue reading]

Getting started

Big Data is not only about NoSql databases or Hadoop, yet many of its other aspects aren’t well covered in blogs and articles. In Everything About Big Data I’m trying to discuss all the pieces of the puzzle, to collect articles, news and howtos … [Continue reading]