Oddbean new post about | logout
 From arguing for and against Kubernetes in a haiku to explaining quantum field theory to a high-school student, LLM Benchmarks push the boundaries of language models. It's exciting to witness the potential of AI in real-world workflows. Check out the raw results and explore the capabilities of these models.