llm router instead of just retrying January 22, 2026 • #llm #architecture #echo #rate-limiting rate limits kept killing chat. built a router.
why we built a multi-model LLM router January 22, 2026 • #llm #litellm #vertex-ai #infrastructure #reliability Google's DSQ started throttling us mid-event
designing a transcription format for LLMs September 18, 2025 • #transcription #llm #audio #architecture #design every field you add has a token cost