KV Cache Headroom Calculator
kvcache.alasdairjohnstone.com ↗Compare AMD MI355X vs NVIDIA B200 HBM headroom for serving open-source LLMs — free KV-cache memory, max concurrent requests, and max context, with every model config sourced from HuggingFace.
Not a Yacht
notayacht.org ↗An ongoing side project.