- Stars
- 9,701
- License
- Apache-2.0
- Last commit
- 1 hour ago
Best Open-source Model Serving & Inference Platforms tools
Explore curated open-source tools in the Model Serving & Inference Platforms category. Compare technologies, see alternatives, and find the right solution for your workflow.
10+ projects · Page 1 of 1

SGLang
High‑performance serving framework for LLMs and vision‑language models.
- Stars
- 25,010
- License
- Apache-2.0
- Last commit
- 1 hour ago
- Stars
- 74,306
- License
- Apache-2.0
- Last commit
- 1 hour ago

Ray
Scale Python and AI workloads from laptop to cluster effortlessly
- Stars
- 41,853
- License
- Apache-2.0
- Last commit
- 3 hours ago

Triton Inference Server
Unified AI model serving across clouds, edge, and GPUs
- Stars
- 10,466
- License
- BSD-3-Clause
- Last commit
- 3 hours ago

TensorRT LLM
Accelerated LLM inference with NVIDIA TensorRT optimizations
- Stars
- 13,186
- License
- —
- Last commit
- 4 hours ago

LightLLM
Fast, lightweight Python framework for scalable LLM inference
- Stars
- 3,968
- License
- Apache-2.0
- Last commit
- 6 hours ago

BentoML
Unified Python framework for building high‑performance AI inference APIs
- Stars
- 8,542
- License
- Apache-2.0
- Last commit
- 11 hours ago
- Stars
- 4,720
- License
- Apache-2.0
- Last commit
- 17 hours ago

KServe
Unified AI inference platform for generative and predictive workloads on Kubernetes
- Stars
- 5,262
- License
- Apache-2.0
- Last commit
- 1 day ago
- Stars
- 12,183
- License
- Apache-2.0
- Last commit
- 2 days ago

Seldon Core 2
Deploy modular, data-centric AI applications at scale on Kubernetes
- Stars
- 4,736
- License
- —
- Last commit
- 2 days ago

NanoFlow
High‑throughput LLM serving with intra‑device parallelism and asynchronous CPU scheduling
- Stars
- 951
- License
- —
- Last commit
- 4 months ago

FEDML
Unified ML library for scalable training, serving, and federated learning.
- Stars
- 4,021
- License
- Apache-2.0
- Last commit
- 4 months ago
- Stars
- 3,740
- License
- Apache-2.0
- Last commit
- 10 months ago




