Exploratory Data Science
on Raw Data
Publications
2022
Materialization and Reuse Optimizations for Production Data Science Pipelines
SIGMOD 2022
Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Zoi Kaoudi, Tilmann Rabl, Volker Markl
DORIAN in action: Assisted Design of Data Science Pipelines
VLDB 2022
Sergey Redyuk, Zoi Kaoudi, Sebastian Schelter, Volker Markl
2021
Continuous Training and Deployment of Deep Learning Models.
Datenbank-Spektrum 2021
Ioannis Prapas, Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Volker Markl
ExDRa: Exploratory Data Science on Federated Raw Data.
SIGMOD 2021
Sebastian Baunsgaard, Matthias Boehm, Ankit Chaudhary, Behrouz Derakhshan, Stefan Geißelsöder, Philipp Grulich, Michael Hildebrand, Kevin Innerebner, Volker Markl, Claus Neubauer, Sarah Osterburg, Olga Ovcharenko, Sergey Redyuk, Tobias Rieger, Alireza Rezaei Mahdiraji, Sebastian Benjamin Wrede, Steffen Zeuch
LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems.
SIGMOD 2021
Arnab Phani; Benjamin Rath; Matthias Boehm
SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging.
SIGMOD 2021
Svetlana Sagadeeva; Matthias Boehm
2020
A Survey of Adaptive Sampling and Filtering Algorithms for the Internet of Things.
DEBS 2020
Dimitrios Giouroukis, Alexander Dadiani, Jonas Traub, Steffen Zeuch, Volker Markl
Grizzly: Efficient Stream Processing Through Adaptive Query Compilation.
SIGMOD 2020
Philipp M. Grulich, Sebastian Breß, Steffen Zeuch, Jonas Traub, Janis von Bleichert, Zongxiong Chen, Tilmann Rabl, Volker Markl
Rhino: Efficient Management of Very Large Distributed State for Stream Processing Engines.
SIGMOD 2020
Bonaventura Del Monte, Steffen Zeuch, Tilmann Rabl, Volker Markl
Optimizing Machine Learning Workloads in Collaborative Environments.
SIGMOD 2020
Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Ziawasch Abedjan, Tilmann Rabl, Volker Markl
SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle.
CIDR 2020
Matthias Boehm, Iulian Antonov, Mark Dokter, Robert Ginthör, Kevin Innerebner, Florijan Klezin, Stefanie Lindstaedt, Arnab Phani, Benjamin Rath
The NebulaStream Platform: Data and Application Management for the Internet of Things.
CIDR 2020
Steffen Zeuch, Ankit Chaudhary, Bonaventura Del Monte, Haralampos Gavriilidis,
Dimitrios Giouroukis, Philipp M. Grulich, Sebastian Breß, Jonas Traub, Volker Markl
2019
AJoin: Ad-hoc Stream Joins at Scale.
VLDB 2019
Jeyhun Karimov, Tilmann Rabl, Volker Markl