OpenLLM
🦾 OpenLLM lets developers run any open-source LLMs as OpenAI-compatible API endpoints with a single command.
- 🔬 Build for fast and production usages
- 🚂 Support llama3, qwen2, gemma, etc, and many quantized versions full list
- ⛓️ OpenAI-compatible API
- 💬 Built-in ChatGPT like UI
- 🔥 Accelerated LLM decoding with state-of-the-art inference backends