i mean i can absolutely imagine incrementally editing image, video, audio content is a bigger ask of a platform than editing text, analogous with the problems with large assets in git (for text we have pretty much perfected diff algorithms, and it's dealing with much less data in the first place)