Jeff Rothschild, Vice President of Technology at Facebook gave a great presentation at UC San Diego on "High Performance at Massive Scale – Lessons learned at Facebook". The presentation’s abstract:
Facebook has grown into one of the largest...
Facebook has the second largest installation of Hadoop (a software platform that lets one easily write and run distributed applications that process vast amounts of data), Yahoo being the first. It is also the creator of Hive, a data warehouse infrastructure...
The basic model for MapReduce derives from the map and reduce concept in functional languages like Lisp. In Lisp, a map takes as input a function and a sequence of values and applies the function to each value in the sequence. A reduce takes as input...
Designing for scale is one of the greatest challenges when building when building web applications for the Internet. The huge scale of the Internet and the amount of potentials users requires applications to be able to handle huge amounts of data and...