HACKER Q&A
📣 elric

FOSS licenses which disallow inclusion in Machine Learning datasets?


There's been a lot of discussion on HN about the fairness and legality of Github's usage of open source software in the copilot training data.

Are there any "FOSS" licenses which are free towards humans but restrictive towards machine learning/AI models? Could we even call such a thing an open source license?


  👤 RobotToaster Accepted Answer ✓
Not so much disallows, but it reminds me of a specific additional clause in the reprap license[1]:

"If any part of RepRap covered by the GPL is used to train any AI, then all the products of that AI are derivative works of RepRap and must comply with the parts of the GPL on derivative works."

[1]: https://reprap.org/wiki/RepRapGPLLicence


👤 pabs3
Such a license would not meet the Open Source Definition, in particular items 5 and 6:

https://opensource.org/osd