AsterixDB: A Counter but Intuitive Approach to Big Data Management
We are living in the Big Data era, and we are witnessing a shift in the role of data management system: Rather than “just” being the systems of record at the heart of traditional enterprises, modern Big Data management systems must model, capture, track, and react to the current state of the world. Doing so requires the ingestion of event data, arriving from a variety of devices, as well as enabling query access to the history of captured data over time. These requirements span a variety of scientific disciplines, including the handling of data produced by a variety sensors in health care, environmental monitoring applications, traffic monitoring, dynamic social network data, and many other domains.
AsterixDB is an open source Big Data Management System (BDMS) with a feature set that’s very different than those of other platforms in today's Big Data ecosystem. The system was initially co-developed by UC Irvine and UC Riverside, starting in 2009 and leading eventually to its first beta release in mid-2013. It has recently moved to Apache, where AsterixDB is now an active incubating project. Many of the system’s key design decisions relate to the aforementioned shift. This talk will first briefly review AsterixDB’s data model, query language, and scale-out architecture. It will then examine a number of counter-cultural aspects of the AsterixDB system, including where its data lives, its runtime architecture, its approach to streaming data, its view of transactions, and its features for handling time-based data.
Michael J. Carey is a Bren Professor of Information and Computer Sciences at UC Irvine. Before joining UCI in 2008, Carey worked at BEA Systems for seven years and led the development of BEA's AquaLogic Data Services Platform product for virtual data integration. He also spent a dozen years teaching at the University of Wisconsin-Madison, five years at the IBM Almaden Research Center working on object-relational databases, and a year and a half at e-commerce platform startup Propel Software during the infamous 2000-2001 Internet bubble. Carey is an ACM Fellow, a member of the National Academy of Engineering, and a recipient of the ACM SIGMOD E.F. Codd Innovations Award. His current interests all center around data-intensive computing and scalable data management (a.k.a. Big Data).