We build System Calculators that automatically invent, design, and implement data and AI systems for end-to-end AI solutions that meet tailored workload, cloud cost, and performance targets.
Systems are the foundation of the data and AI era. This is how data is stored, how models are created, and how AI and context are managed. In short, systems define what is possible. But there is no single system that can support the massive diversity of data and AI applications, and building new systems takes years. In fact, a single AI system type can have a design space of over 10100 alternatives, yet practice still relies on a handful of “good templates,” each requiring years of manual tuning and suited to narrow scenarios.
DASlab pursues a fundamental shift: self-designing data and AI systems. We build Systems Calculators that unlock data and AI systems design and implementation by enabling understanding and reasoning about the massive design space of systems, i.e., navigating “all” possible ways to design a system.
We treat system design as a language:
We map data-structure design into a structured space of primitives, so we can reason about what exists, predict what’s missing, and ultimately calculate new designs tailored to workload and hardware.
Big data systems sit in the critical path of everything we do, i.e., in businesses, in sciences, as well as in everyday life. The lab's courses offer a comprehensive introduction to modern data systems, and a research-oriented roadmap towards building systems that "scale up" and "scale out".
So far 11 undergraduate DASlab teams have made it to the finals of the ACM SIGMOD Undergraduate Research Competition. We won the first place 6 times in 2016, 2017, 2018, 2019, 2020, and 2022. In 2020, we won both the first and the second place. In 2021, we won the third place.
If you are a Harvard undergrad interested in research with DASlab, taking CS165 and CS265 is the first step.