Skip to content
@llm-d

llm-d

llm-d is a Kubernetes-native high-performance distributed LLM inference framework

Pinned Loading

  1. llm-d llm-d Public

    llm-d is a Kubernetes-native high-performance distributed LLM inference framework

    Makefile 1k 59

  2. llm-d-inference-scheduler llm-d-inference-scheduler Public

    Inference scheduler for llm-d

    Go 48 18

  3. llm-d-deployer llm-d-deployer Public

    Helm charts for llm-d

    Shell 36 21

  4. llm-d-kv-cache-manager llm-d-kv-cache-manager Public

    Distributed KV cache coordinator

    Go 31 4

  5. llm-d-model-service llm-d-model-service Public

    Simplified model deployment on llm-d

    Go 21 7

  6. llm-d-benchmark llm-d-benchmark Public

    llm-d benchmark scripts and tooling

    Shell 12 5

Repositories

Showing 10 of 10 repositories

Most used topics

Loading…