I am looking for advice and resources on versioning and managing machine learning models and data.
Thanks in advance.
However, if you're more than two people, the increase in complexity is substantial, because the workflows are different.
We use our own machine learning platform, https://iko.ai, to do machine learning projects. What we've done is leverage our experience for the past seven years doing ML for large organizations, and we're removing every part that sucked and offloading it to the platform.
It really depends what kind of work you're doing. If you're experimenting in the week-end on toy projects with no stakes, then anything goes. If you're on a team and you need access to data, collaborate on notebooks, have fresh environments, track experiments, launch training jobs, deploy and monitor models, the complexity is exponential.