

I’m somewhat disappointed by the fair use assessment, since I think calling AI models “transformative” is a bit of a stretch from how that is normally used, but I also see where the judge is coming from. Would the analytics that go into Google’s Ngram word frequency engine be considered infringing? You know, provided we ignore that the fuckers couldn’t be bothered to find a single goddamn copy of the book they wanted to feed into the data shredder.
We were joking about this last week if memory serves, but at least one person out there has started a rough aggregator of different sources of pre-AI internet dumps.
It’s all gotta be in the models by now, but it’s gonna be a cool resource for something, right?