Performance of Vulkan backend looks amazing #11918

foldl · 2025-02-17T04:22:43Z

foldl
Feb 17, 2025

I have just updated chatllm.cpp to use ggml from the last commit (0f2bbe6). Performance of Vulkan backend looks amazing: it is much faster than the CUDA backend (in this test to be precious).

Command line options: -m qwen2.5-1.5b.bin -ngl all -t 0 -p "write a quick sort function in python" --max_length 200

Hardware: 2080TI with 22GB.

Backend	t/s (prompt evel)	t/s (generation)
CUDA	335	77
Vulkan	802	103

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance of Vulkan backend looks amazing #11918

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Performance of Vulkan backend looks amazing #11918

Uh oh!

foldl Feb 17, 2025

Replies: 0 comments

foldl
Feb 17, 2025