I don’t think it’s going to be public data alone. I think it’s going to be DMs and chats as well. I wondered why Reddit was pushing chats so much suddenly, well it makes sense now.
Yeah. I think there is a kind of power grab under way. Social media will try to push that they own the IP rights to the large texts uses for LLM. This will then require that producers of LLM software aquire the licensing rights which will cost many millions which in turn restricts the free use of LLM and in general any AI software that requires training data.
The end result is that as the "means of production" become less based on human work the "means of generation" and AI will be controlled by the capitalists. If you can turn something into a commodity (like knowledge with patents and IP) you can control it. Leading to a darker timeline.