Oddbean

▲ ▼
 Streaming LLM – No limit on context length for your favourite LLM
https://github.com/mit-han-lab/streaming-llm
#ycombinator