This is an archived version of CCC's website. Please visit the new ccc website for the latest information.

Big-Data Computing Study Group

Under sponsorship by the CCC, the Big-Data Study Group will explore and enable opportunities for research and applications of high-performance, data-intensive computing systems, benefiting application areas ranging from astronomy to machine translation. To begin this effort, two events were held in March, 2008.

One Pager

Establishing a Big-Data Computing Study Group - [72 KB PDF]

Leads for this workshop and Lead for effort

Randy Bryant (CMU) and Thomas Kwan (Yahoo!)

CCC council liaison for this workshop and effort

Ed Lazowska (University of Washington)

Hadoop Summit

[3/25/08, Sunnyvale, CA] | Speakers, Slides and Videos

 

Hadoop is an open source project developing software that enables data-intensive computing on cluster-based systems.  It includes a distributed file system and programming support for Map/Reduce, a data-parallel notation for expressing both element-wise and aggregating operations on collections of data.

Data-Intensive Computing Symposium

[3/26/08, Sunnyvale, CA] | Speakers, Slides and Videos

 

This symposium covered a broad range of topics, with presentations by industry and academic leaders on all aspects of data-intensive computing, including systems, programming, algorithms, data management, and both scientific and information-based applications. 

Participants

Bernie Acs (NCSA), Eugene Agichtein (Emory), William Arms (Cornell), Eric Baldeschwieler (Yahoo!), Roger Barga (Microsoft), Chaitin Baru (SDSC), Sugato Basu (Google), Jacek Becla (SLAC), Emery Berger (UMass-Amherst), Fran Berman (SDSC), Christophe Bisciglia (Google), Andrei Broder (Yahoo!), Randy Bryant (CMU), Jamie Callan (CMU), Andrew Chien (Intel), Charlie Clarke (Waterloo), Andrew Connolly (UWashington), Gene Cooperman (Northeastern), Jeff Dean (Google), Tina Eliassi-Rad (LLNL), Christos Faloutsos (CMU), Usama Fayyad (Yahoo!), Ian Foster (Argonne), Jim French (NSF), Dennis Gannon (Indiana), Phil Gibbons (Intel), Garth Gibson (CMU), Ian Gorton (Pacific NW National Lab), Robert Grossman (UI-Chicago), Milton Halem (UM-BC), Jeff Hammerbacher (Facebook), Jiawei Han (UIUC), Steve Heller (Sun), Joe Hellerstein (Berkeley), Haym Hirsh (NSF/Rutgers), Chenyi Hu (Central Arkansas), Anita Jones (Virginia), Richard Karp (Berkeley), Randy Katz (Berkeley), Yoo-Ah Kim (UConn), Jay Kistler (Yahoo!), Jon Kleinberg (Cornell), Ed Lazowska (UWashington), Michael Lesk (Rutgers), Xiaozhou Li (HP Labs), Xavier Llora (NCSA), Qi Lu (Yahoo!), Chris Manning (Stanford), Steve Meacham (NSF), Jill Mesirov (Broad Institute), Marc Najork (Microsoft), Nicholas Nystrom (Pittsburgh Supercomputing), Dave O'Hallaron (Intel/CMU), Chris Olston (Yahoo!), Kunle Olukotun (Stanford), Patrick Pantel (Yahoo!), Savas Parastatidis (Microsoft), Beth Plale (Indiana), Prabhakar Raghavan (Yahoo!), Raghu Ramakrishnan (Yahoo!), Bina Ramamurthy (SUNY Buffalo), Dan Reed (Microsoft), Anne Rogers (Chicago), Mikael Ronstrom (MySQL AB), Arie Shoshani (Lawrence Berkeley Laboratory), Padhraic Smyth (UC Irvine), Raymie Stata (Yahoo!), Ravi Sundaram (Northeastern), Alex Szalay (JHU), Douglas Thain (Notre Dame), Paul Thompson (Dartmouth), Andrew Tomkins (Yahoo!), Cristian Ungureanu (NEC Labs), Stephan Vogel (CMU), Dan Weld (UWashington), John Wilkes (HP), Jeannette Wing (NSF), Jay Wylie (HP Labs), Ke-Thia Yao (ISI/USC), Hongyuan Zha (GeorgiaTech), ChengXiang Zhai (UIUC), Yi Zhang (UC Santa Cruz)

Highlights:

Milestone Week in Evolving History of Data-Intensive Scalable Computing [85 KB PDF]

Blog Post:

Big Data Computing Group Kicks Off

Status Update:

Progress Report: CCC’s Support for Data-Intensive Computing

Related Events

Data-Intensive Scalable Computing in Education (DISC 2008)
July 16 - 18, 2008, University of Washington, Seattle, WA

Cloud Computing and Its Applications 2008 (CCA-08)
October 22-23, 2008, Gleacher Center, Chicago, Illinois