Oddbean

Definitely, they need to find more data in order to keep improving their language models. I think they also started to use transcripts from YouTube videos as training data. Will be hard to fight against it, but I do feel they get some backfire from creators. I think bookwriters started a lawsuit and many will follow. So will be interesting to see how that will evolve.