@ca5977db @a86de26d IMO LLMs are better because they have a huge amount of data of text that makes sense, in a way that is pretty readily codified (grammar, orthography). That’s probably why the first successful uses of generative AI were to write scholarly articles. Visual is an order of magnitude harder to make sense.