Filtering 100s of megabytes to a few gigabytes of CSV data: RSQLite, queryBuildR, DT and Shiny.

Dealing with CSV files on the order of 100s MBs or GBs is often difficult in R (or even worse, Excel). For example, loading a 1 GB file in R using read.table is likely to take a few minutes (or will make Excel crash).

Lyrics Explorer - Which are the most used words in English and French songs?

The Web interface allows to explore the lexical fields of 86 English artists (17614 songs) and 56 French artists (8837 songs).The English interface is available at and the French interface at 

An Agenda for Probabilistic Programming (from Microsoft Research)

How Bayesian inference Found Air France Flight 447 Two Years After It Crashed Into Atlantic on machine learning

CERN openlab Whitepaper on Future IT Challenges in Scientific Research

The Parable of Google Flu: Traps in Big Data Analysis

6000 Companies Hiring Data Scientists


