Yeah sure, but 2 years ago, video generation was barely even a thing. Of course they aren't perfect, but the major problems are disappearing at an incredible rate.
On top of that we have models that can generate game environments (ie. simulations) from images or video: https://video2game.github.io/