Oddbean new post about | logout
 I hear my laptop's fans start whirring around when its making a response, I wouldn't be surprised if its doing something locally first. Either the encoding process (words to tokens) or the retrieval (finding relevant documents from a project) 
 retrieval maybe? 

btw have you seen https://www.mixedbread.ai/blog/mxbai-embed-large-v1 
 wait nvm encoding should be first no? Since converting words to tokens is usually needed before retrieval unless the retrieval uses pre-computed embedding, maybe it skips straight to that? Idk