Keynote Talks

Keynote Speakers

John Wilkes (Google)

Building the warehouse scale computer

Abstract

Imagine some product team inside Google wants 100,000 CPU cores + RAM + flash + accelerators + disk in a couple of months. We need to decide where to put them, when; whether to deploy new machines, or re-purpose/reconfigure old ones; ensure we have enough power, cooling, networking, physical racks, data centers and (over longer a time-frame) wind power; cope with variances in delivery times from supply logistics hiccups; do multi-year cost-optimal placement+decisions in the face of literally thousands of different machine configurations; keep track of parts; schedule repairs, upgrades, and installations; and generally make all this happen behind the scenes at minimum cost. And then after breakfast, we get to dynamically allocate resources (on the small-minutes timescale) to the product groups that need them most urgently, accurately reflecting the cost (opex/capex) of all the machines and infrastructure we just deployed, and monitoring and controlling the datacenter power and cooling systems to achieve minimum overheads - even as we replace all of these on the fly. This talk will highlight some of the exciting problems we're working on inside Google to ensure we can supply the needs of an organization that is experiencing (literally) exponential growth in computing capacity.

Bio

John Wilkes has been at Google since 2008, where he is working on automation for building warehouse scale computers. Before that, he spent a long time at HP Labs, becoming an HP and ACM Fellow in 2002. He is interested in far too many aspects of distributed systems, but a recurring theme has been technologies that allow systems to manage themselves. In his spare time he continues, stubbornly, trying to learn how to blow glass.

John Wilke's Talk (Slides)


Michael J. Carey (UC Irvine)

Big NoSQL Data, Apache AsterixDB, and Beyond

Abstract

Big Data comes in many shapes and sizes. Today's varieties of Big Data include Big Tabular Data (e.g., large enterprise-style relational data sets), Big Graph Data (e.g., large social networks), Big Textual Data (e.g., large collections of blogs or messages), and of course Big Semistructured Data (e.g., large collections of JSON objects) — a.k.a. Big NoSQL Data. This keynote will examine the NoSQL faction of the Big Data movement, describing the nature of this data and then surveying some of the platforms for storing and querying such data today — a.k.a. document database systems. To make things concrete, the talk will include a deeper look at Apache AsterixDB, an open-source Big Data Management System that originated from several University of California campuses and provides an excellent foundation for managing NoSQL data. Details covered will include its underlying storage technologies and the sorts of schema-related, ingestion-related, and query-related features that more and more such systems are beginning to offer. The talk will also discuss our current efforts to move from our current world of passive Big Data platforms to a new 'BAD' (Big Active Data) world. The keynote will close with an enumeration of some of the open technical challenges in the NoSQL data management space.

Bio

Michael J. Carey received his B.S. and M.S. degrees from Carnegie-Mellon University and his Ph.D. from the University of California, Berkeley, in 1979, 1981, and 1983, respectively. He is currently a Bren Professor of Information and Computer Sciences at the University of California, Irvine (UCI) and a Consulting Architect at Couchbase, Inc. Before joining UCI in 2008, Dr. Carey worked at BEA Systems for seven years and led the development of BEA's AquaLogic Data Services Platform product for virtual data integration. He also spent a dozen years teaching at the University of Wisconsin-Madison, five years at the IBM Almaden Research Center working on object-relational databases, and a year and a half at e-commerce platform startup Propel Software during the infamous 2000-2001 Internet bubble. Dr. Carey is an ACM Fellow, an IEEE Fellow, a member of the National Academy of Engineering, and a recipient of the ACM SIGMOD E.F. Codd Innovations Award. His current interests all center around data-intensive computing and scalable data management (a.k.a. Big Data).

Michael Carey's Talk (Slides)


Latest news