In my work, I deal with different type of data and try to develop machine learning models to learn relationship within that data. My data consist of a mix of images, tabular data or even text data. During my work I usually deal with issues, where my data runs through different processing pipelines. These pipelines on one hand can take a while (up to several hours) and also renames the files. For my work, I aim to guarantee reproducibility and also share my data.
Therefore, my question: Are there any tools that help machine learning practitioners to simplify there