UC Berkeley’s RADlab retreat
I’m on my way back from the UC Berkeley’s RADlab winter retreat at Lake Tahoe. I’m really grateful to Prof. Randy Katz and the UC Berkeley faculty for inviting me.
RADlab’s vision is to enable one person (possibly the next Pierre Omidyar) to single-handedly invent and operationalize the next multi-million-user service (possibly the next eBay) over the course of a long weekend. RADlab’s retreat is an opportunity for industry leaders to mix with faculty and students to review the progress and provide feedback.
Over these intense three days, students and faculty gave excellent talks and demonstrations. Things that I can call-out in a (very) short list:
- By now, it is commonplace for researchers and students to develop, test, and run on cloud;
- Ruby has made it into the classroom. I’ve seen some impressive term projects with very little code written (and, interestingly enough, the test code has more lines than the actual code);
- Machine learning is alive! I have found some encouraging proof points in scaling up/down resources, generating equivalent synthetic workloads, timely detecting datacenter problems by way of signature correlation, etc.;
- A scalable data store (scads) for which cost/user doesn’t increase and there’s a declarative language to set performance/consistency tradeoffs;
- Use of a performance model to perform accurate diagnosis while using just about 10% of log data;
I often think about the scale divide between those who can get behind the curtains of internet-scale data centers (often times, in the 10^5 servers realm) and those who cannot. Those who have access typically have limited freedom to chase high-risk, high-payout propositions. Next-gen infrastructure researchers must get a chance to see their artifacts operating at scale. What can researchers and practitioners do to bridge the scale divide to a mutual advantage? I’ve seen that students clearly extract a lot of value out of their internships. Without a doubt, internships are a great win-win-win. Also, I’ve heard speakers asking for anonymized production traces. Unfortunately, this is a tough nut to crack … in the web age, the unintended ripple effects of taking traces out of the house give many corporations the heebie jeebies.
I look forward to monitoring the continued progress of RADlab.
