For software, my feeling is that this is very much a grey area, so why not make it explicit?
Should we develop licences that are crystal clear around whether you are permitted to use this codebase for the purposes of training models?
(I vote 'yes')
The lack of clear signalling around this topic leaves it ripe for OSS devs and enthusiasts to explore, but extremely scary for commercial entities to navigate. In other words, it's working exactly as GPL should.