vulkan: use smaller combined allocations to avoid fragmentation #11551

jeffbolznv · 2025-01-31T15:17:46Z

See #11520 for previous discussion. Large host visible vidmem allocations can lead to fragmentation and the allocations may be placed in sysmem by the OS. This change advertises a smaller size to use for combined allocations, and updates the allocator to try allocations larger than advertised.

slaren

The ggml-alloc changes look good.

ggml/src/ggml-vulkan/ggml-vulkan.cpp

…-org#11551)

jeffbolznv requested review from slaren and 0cc4m January 31, 2025 15:17

jeffbolznv mentioned this pull request Jan 31, 2025

vulkan: Avoid using too much host-visible vidmem, which can lead to fragmentation #11520

Closed

slaren approved these changes Jan 31, 2025

View reviewed changes

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Jan 31, 2025

0cc4m reviewed Feb 5, 2025

View reviewed changes

ggml/src/ggml-vulkan/ggml-vulkan.cpp Show resolved Hide resolved

vulkan: use smaller combined allocations to avoid fragmentation

7700971

jeffbolznv force-pushed the smaller_allocations branch from 01413a9 to 7700971 Compare February 5, 2025 14:53

0cc4m approved these changes Feb 6, 2025

View reviewed changes

0cc4m merged commit 1b598b3 into ggml-org:master Feb 6, 2025
46 checks passed

tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025

vulkan: use smaller combined allocations to avoid fragmentation (ggml…

0cf3f98

…-org#11551)

orca-zhang pushed a commit to orca-zhang/llama.cpp that referenced this pull request Feb 26, 2025

vulkan: use smaller combined allocations to avoid fragmentation (ggml…

ce4d852

…-org#11551)

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

vulkan: use smaller combined allocations to avoid fragmentation (ggml…

0bc4b41

…-org#11551)

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

vulkan: use smaller combined allocations to avoid fragmentation (ggml…

399002a

…-org#11551)

0cc4m mentioned this pull request Mar 17, 2025

Vulkan: Default to 1GB allocations instead of 4GB to avoid fragmentation and driver issues #12434

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: use smaller combined allocations to avoid fragmentation #11551

vulkan: use smaller combined allocations to avoid fragmentation #11551

Uh oh!

jeffbolznv commented Jan 31, 2025

Uh oh!

slaren left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vulkan: use smaller combined allocations to avoid fragmentation #11551

vulkan: use smaller combined allocations to avoid fragmentation #11551

Uh oh!

Conversation

jeffbolznv commented Jan 31, 2025

Uh oh!

slaren left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!