Exploratory Data Science
on Raw Data

Publications

 

2022

Materialization and Reuse Optimizations for Production Data Science Pipelines
SIGMOD 2022

Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Zoi Kaoudi, Tilmann Rabl, Volker Markl

DORIAN in action: Assisted Design of Data Science Pipelines
VLDB 2022

Sergey Redyuk, Zoi Kaoudi, Sebastian Schelter, Volker Markl

2021

Continuous Training and Deployment of Deep Learning Models.
Datenbank-Spektrum 2021

Ioannis Prapas, Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Volker Markl

ExDRa: Exploratory Data Science on Federated Raw Data.
SIGMOD 2021

Sebastian Baunsgaard, Matthias Boehm, Ankit Chaudhary, Behrouz Derakhshan, Stefan Geißelsöder, Philipp Grulich, Michael Hildebrand, Kevin Innerebner, Volker Markl, Claus Neubauer, Sarah Osterburg, Olga Ovcharenko, Sergey Redyuk, Tobias Rieger, Alireza Rezaei Mahdiraji, Sebastian Benjamin Wrede, Steffen Zeuch

LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems.
SIGMOD 2021

Arnab Phani; Benjamin Rath; Matthias Boehm

SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging.
SIGMOD 2021

Svetlana Sagadeeva; Matthias Boehm

2020

A Survey of Adaptive Sampling and Filtering Algorithms for the Internet of Things.
DEBS 2020

Dimitrios Giouroukis, Alexander Dadiani, Jonas Traub, Steffen Zeuch, Volker Markl

Grizzly: Efficient Stream Processing Through Adaptive Query Compilation.
SIGMOD 2020

Philipp M. Grulich, Sebastian Breß, Steffen Zeuch, Jonas Traub, Janis von Bleichert, Zongxiong Chen, Tilmann Rabl, Volker Markl

Rhino: Efficient Management of Very Large Distributed State for Stream Processing Engines.
SIGMOD 2020

Bonaventura Del Monte, Steffen Zeuch, Tilmann Rabl, Volker Markl

Optimizing Machine Learning Workloads in Collaborative Environments.
SIGMOD 2020

Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Ziawasch Abedjan, Tilmann Rabl, Volker Markl

SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle.
CIDR 2020

Matthias Boehm, Iulian Antonov, Mark Dokter, Robert Ginthör, Kevin Innerebner, Florijan Klezin, Stefanie Lindstaedt, Arnab Phani, Benjamin Rath

The NebulaStream Platform: Data and Application Management for the Internet of Things.
CIDR 2020

Steffen Zeuch, Ankit Chaudhary, Bonaventura Del Monte, Haralampos Gavriilidis,
Dimitrios Giouroukis, Philipp M. Grulich, Sebastian Breß, Jonas Traub, Volker Markl

2019

AJoin: Ad-hoc Stream Joins at Scale.
VLDB 2019

Jeyhun Karimov, Tilmann Rabl, Volker Markl