Skip to content

Commit 5129314

Browse files
committed
Document Sync by Tina
1 parent 29eec69 commit 5129314

File tree

2 files changed

+1
-1
lines changed

2 files changed

+1
-1
lines changed

docs/stable/intro.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ sidebar_position: 1
55
# Serverless LLM
66

77
<!-- Scaled logo -->
8-
<img src="/img/serverlessllm.jpg" alt="ServerlessLLM" width="256px"/>
8+
<img src="../images/serverlessllm.jpg" alt="ServerlessLLM" width="256px"/>
99

1010
ServerlessLLM is a **fast** and **easy-to-use** serving system designed for **affordable** multi-LLM serving, also known as LLM-as-a-Service. ServerlessLLM is ideal for environments with multiple LLMs that need to be served on limited GPU resources, as it enables efficient dynamic loading of LLMs onto GPUs. By elastically scaling model instances and multiplexing GPUs, ServerlessLLM can significantly reduce costs compared to traditional GPU-dedicated serving systems while still providing low-latency (Time-to-First-Token, TTFT) LLM completions.
1111

img

100 KB
Binary file not shown.

0 commit comments

Comments
 (0)