In today’s issue of Command Line, I reported that ByteDance has been violating the developer license of both Microsoft and OpenAI by using GPT-generated data to train its own, competing model in China. After my report was published, OpenAI spokesperson Niko Felix sent the following statement confirm...
I thought about that too, but then decided to go with Bender because he's a robot built by an ominous megacorp. Plus he has an attitude that pretty much sums it all up.
I don't know about that. Training your AI on someone else's AI feels a lot like drinking someone else's piss. I doubt you are going to extract much innovation out of that
It works pretty well. You can create a good dataset for a fraction of the effort and price it would have required to do it by hand. The quality is similar. You just have to review each prompt so you don't train your model on bad data.
These kinds of articles are the worst, it's all inferred with no qualified actual sources for the claim. And worst of all, it doesn't make sense.
No one uses Ais to train other ais because it doesn't work. It falls apart really fast and is a well-known problem in the ai community.
Bytedance aren't using gpt genersted data to train their own ai because it doesn't work. They may be accidentally including ai generated data in their own training data, but that's just gonna break their own models, so who even cares.
This isn't corporate espionage. This isn't anything but shit journalism.