1. Do you think model weights can actually be copyrighted?
2. Should they be?
The closest thing to authorship that happens is curation of the training data, but that's too tenuous to support a copyright. For one thing, the curation choices aren't directly targeted to shape the weights themselves, only the output when a separate program runs inference on those weights. And even for the output, there's probably not enough direct control there to support much of a claim of authorship.
And the people building the models had better hope that's true, because if that curation is strong enough to support a copyright, then it's really hard to claim that the contributions from the training data aren't strong enough to make the model a derivative work of every single training item.
2. No. The last thing we need is more ways for people with a bit of capital to get vast sweeping IPR claims.
Also, isn't about time HN got a real Markdown engine?
An ML model has no personality. It is not a legal entity. Also, its outputs are in no way the direct expression of the personality of those who trained it.
It is precisely this what differentiates ML from regular code: With regular code, it’s the programmer who solves the problem in a way that is similarly personal as someone writing a novel or an easy. But with ML, the specific implementation is found by the software itself.
You can’t have it both ways. Both not having to write the solution itself AND claim personal ownership of the results.
(But of course, we’ll see what the courts actually say.)
He said “we have no idea, and we won’t until someone with a boatload of cash burns it in a lawsuit.”
So we might find out soon!
I'd be interested to look and see if there are any old analog computer patents, and how they are worded? Like if the specify list of resistor values, or focus more on architecture and tuning. I think that's a possible precedent to look to.
Under US law, the answer is clearly "no." US requires there to be, at minimum, a "spark of creativity." (Also, as far as legal definition is concerned, machines cannot possess any creativity.) If you try to start doing some creative legal argument to work around that, you're going to run into the issue that the same lines of legal thinking will conclude that you are infringing all of your training set... and trying to argue that infringing more or less everything ever made is somehow fair use is a truly uphill task.
You might be able to make a case for it under jurisdictions that follow the sweat-of-the-brow doctrine, but I know too little about jurisprudence in such jurisdictions to make any confident predictions.
Lawyers will consider how experts make decisions that affects and shapes the (sometimes far-) downstream representation of a valuable thing: thus an mp3 recording, a compiled executable, and, yes, a neural network’s model weights are all going to be in scope of copyright discussions.
As for “should”…I don’t know. I feel like the model I train to synthesise my own voice should be copyrightable. Meanwhile I’m wary of titanic foundation models destroying entire markets in part because of a copyright moat.
As with all IP, it's one thing to be granted intellectual property rights, and a completely different thing to actually go about enforcing those rights. Without access to infringer's source code, it's challenging to make the case.
I can't answer (1) as my opinion is just noise until it works through the legal system. I suspect they will be.
For (2), I think they should. You can't copyright the algorithm (recipe) but you can copyright the output (expression) of the recipe. On that note, this allows for open sourcing of models as well.
If a model is trained on data that one owns the copyright to, it should be copyrightable.
2. Definitely no. Copyright should only apply to things entirely crafted by hand IMO.
2. NO
Absolutely not. Software should not be copyrightable because math equations are not copyrightable. But besides that, ML Models end up as polynomials at the end of the day, and that literally cannot be copyrighted.
Not to mechanically create an index of facts about other art/writing, no matter its usefulness.
1) Purely mechanistic transformations aren't copyrightable, so if a model is determined to be that (it may be debated if it is or isn't), then it has no independent protection from copyright law other than any potential restrictions imposed by fragments of other copyrighted works included in that particular model.
2) On the other hand, even the tiniest bit of creativity is sufficient to make something copyrightable. Some work on ML definitely has some creativity, however is the "creative part" fully within the code used for training and, or is part of that creativity embedded also in the weights output by that code?
3) Facts about a copyrighted work; e.g. the word frequency statistics of a copyrighted book do not violate the work's copyright and also are not copyrightable themselves, so at least some language models - the very simplest word n-gram models, which were used both not that long ago for statistical machine translation and also long, long ago in pre-computer dictionary work - aren't covered by copyright, just as any other collection of facts. Databases may have specific ('sui generis') extra protection granted by law, but then that's a separate debate than general copyright.
4) In general, there are different things which often are conflated but in this complex situation the difference may be meaningful and even critical - e.g. (a) being a derived work of the training data; (b) violating the (limited, explicitly enumerated) exclusive rights of the author of that training data; and (c) being copyrightable itself - those are three separate things that might have different answers for the same type of model.
5) "sweat of the brow" doctrine asserts that (at least in USA) it doesn't matter how much effort and money it took to make something, so the effort/expense of training a model is not relevant to it 'deserving' protection and being copyrightable.
6) Not all creative creation is protected by copyright - the law lists very many protected types, but the rest are not protected unless courts get convinced that they fall under the umbrella of something explicitly listed in copyright law; for example, copyright applies to software because (and only because) program code is considered to be a type of literary work, which is one of the things listed in https://www.copyright.gov/title17/92chap1.html#102 or equivalent legislation elsewhere. Are model weights a type of literary work? That's a tricky question, and my opinion on it doesn't really matter, we'd need to hear a precendent-setting judge assert if they are or not.
... and there's probably many more arguments that should be considered before even trying to answer the original questions. And I think that it's illustrative to see what all the megacorps with megapaid megalawyers are doing; it seems that they're writing all their legal documents from a perspective where models might not be copyrightable and trying to assert whatever conditions they want under some type of contract, since conditions properly agreed under contract law will apply no matter how the copyright law will be interpreted.