All activity
Nick Stogner
āœ…ļø Drop-in replacement for OpenAI with API compatibility
šŸš€ Serve OSS LLMs on CPUs or GPUs
āš–ļø Autoscaling with scale from 0
šŸ› ļø Zero dependencies (no Istio, Knative, etc.)
šŸ¤– Operates OSS model servers (vLLM and Ollama)
šŸ”‹ Chat UI included
KubeAI: Private Open AI on K8s
KubeAI: Private Open AI on K8s
Serve LLMs privately with an OpenAI API compatible API