English
Groq provides ultra-low latency inference for LLMs through its custom-built LPU™ architecture.