Friday, January 27, 2012

Big Data links

A few things about big data:
  • A book on best practices by Marz and Ritchie. You can pre-read it as well. I'm thinking about it. [hnn thread]
  • The data science toolkit, a specialized Linux VM for download with lots of juicy databases and tools preinstalled and a common API.
  • Real-time feed processing with Storm. Storm abstracts out the queue-workers pattern into an infrastructure component. Interesting stuff.

No comments:

Post a Comment