Exploratory Data Science
on Raw Data
of the exdra project is to investigate suitable system support for the exploratory data science process on heterogeneous and distributed raw data sources and to provide a prototype for real-world use cases.
In detail, the approach mainly includes the following research areas:
- Ad-hoc and federated data integration over raw data
- Data organization and reuse of intermediate results
- Horizontal optimizations across the entire data science life cycle
- Request planning for limited access to data