Oddbean new post about | logout
 I read the source on wikistr for the kind 1987 specification.
I like the idea about precalculating embeddings when generating content for efficiency.

One main question though:
You cannot avoid forged embeddings, can you?
This is SEO back to field one. The content “does not speak for itself”.
You cannot get around some concept of source reputation, I guess.

Or is there a fast way to check on an embedding by verifying it against the content? Like we do for hashes?