2018 ACM Symposium on Cloud Computing

Michael J. Carey (UC Irvine)

Big NoSQL Data, Apache AsterixDB, and Beyond

Abstract

Big Data comes in many shapes and sizes. Today's varieties of Big Data include Big Tabular Data (e.g., large enterprise-style relational data sets), Big Graph Data (e.g., large social networks), Big Textual Data (e.g., large collections of blogs or messages), and of course Big Semistructured Data (e.g., large collections of JSON objects) — a.k.a. Big NoSQL Data. This keynote will examine the NoSQL faction of the Big Data movement, describing the nature of this data and then surveying some of the platforms for storing and querying such data today — a.k.a. document database systems. To make things concrete, the talk will include a deeper look at Apache AsterixDB, an open-source Big Data Management System that originated from several University of California campuses and provides an excellent foundation for managing NoSQL data. Details covered will include its underlying storage technologies and the sorts of schema-related, ingestion-related, and query-related features that more and more such systems are beginning to offer. The talk will also discuss our current efforts to move from our current world of passive Big Data platforms to a new 'BAD' (Big Active Data) world. The keynote will close with an enumeration of some of the open technical challenges in the NoSQL data management space.

Bio

Michael J. Carey received his B.S. and M.S. degrees from Carnegie-Mellon University and his Ph.D. from the University of California, Berkeley, in 1979, 1981, and 1983, respectively. He is currently a Bren Professor of Information and Computer Sciences at the University of California, Irvine (UCI) and a Consulting Architect at Couchbase, Inc. Before joining UCI in 2008, Dr. Carey worked at BEA Systems for seven years and led the development of BEA's AquaLogic Data Services Platform product for virtual data integration. He also spent a dozen years teaching at the University of Wisconsin-Madison, five years at the IBM Almaden Research Center working on object-relational databases, and a year and a half at e-commerce platform startup Propel Software during the infamous 2000-2001 Internet bubble. Dr. Carey is an ACM Fellow, an IEEE Fellow, a member of the National Academy of Engineering, and a recipient of the ACM SIGMOD E.F. Codd Innovations Award. His current interests all center around data-intensive computing and scalable data management (a.k.a. Big Data).

Keynote Talks

Keynote Speakers

John Wilkes (Google)

Abstract

Bio

Michael J. Carey (UC Irvine)

Abstract

Bio

Latest news