Update vLLM version to v0.9.1 #2061

CICD-at-OPEA · 2025-06-10T22:43:01Z

Update vLLM version to v0.9.1

Signed-off-by: CICD-at-OPEA <[email protected]>

github-actions · 2025-06-10T22:43:14Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

louie-tsai · 2025-06-13T15:51:52Z

this is because core binding requires privileged mode to run numa_migrate_pages call. this PR solved the issue: vllm-project/vllm#19241. two options here:

use privileged mode to run vLLM.
disable core binding in vLLM by setting VLLM_CPU_OMP_THREADS_BIND=all and TP/PP=1

fix vllm vllm-project/vllm#19241 issue

fix the vllm-project/vllm#19241

Copilot

Pull Request Overview

This PR updates the vLLM version from v0.9.0.1 to v0.9.1 across multiple test scripts, docker-compose files, and environment configuration files.

Updated vLLM version in test scripts for WorkflowExecAgent, VisualQnA, HybridRAG, DocSum, CodeTrans, CodeGen, ChatQnA, AudioQnA, etc.
Added a new environment variable (VLLM_CPU_OMP_THREADS_BIND) in docker-compose files.
Updated the build script environment variable in .github/env/_build_image.sh.

Reviewed Changes

Copilot reviewed 19 out of 19 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
WorkflowExecAgent/tests/2_start_vllm_service.sh	Updated vLLM version.
VisualQnA/tests/test_compose_on_xeon.sh	Updated vLLM version.
HybridRAG/tests/test_compose_on_gaudi.sh	Updated vLLM version.
DocSum/tests/test_compose_on_xeon.sh	Updated vLLM version.
CodeTrans/tests/test_compose_on_xeon.sh	Updated vLLM version.
CodeGen/tests/test_compose_on_xeon.sh	Updated vLLM version.
ChatQnA/tests/test_compose_without_rerank_on_xeon.sh	Updated vLLM version.
ChatQnA/tests/test_compose_qdrant_on_xeon.sh	Updated vLLM version.
ChatQnA/tests/test_compose_pinecone_on_xeon.sh	Updated vLLM version.
ChatQnA/tests/test_compose_on_xeon.sh	Updated vLLM version.
ChatQnA/tests/test_compose_milvus_on_xeon.sh	Updated vLLM version.
ChatQnA/tests/test_compose_mariadb_on_xeon.sh	Updated vLLM version.
ChatQnA/tests/test_compose_faqgen_tgi_on_xeon.sh	Updated vLLM version.
ChatQnA/tests/test_compose_faqgen_on_xeon.sh	Updated vLLM version.
AudioQnA/tests/test_compose_on_xeon.sh	Updated vLLM version.
AudioQnA/tests/test_compose_multilang_on_xeon.sh	Updated vLLM version.
AudioQnA/docker_compose/intel/cpu/xeon/compose_multilang.yaml	Added VLLM_CPU_OMP_THREADS_BIND variable.
AudioQnA/docker_compose/intel/cpu/xeon/compose.yaml	Added VLLM_CPU_OMP_THREADS_BIND variable.
.github/env/_build_image.sh	Updated vLLM version in the environment export.

Comments suppressed due to low confidence (3)

AudioQnA/docker_compose/intel/cpu/xeon/compose_multilang.yaml:47

[nitpick] Consider adding a comment or updating related documentation to clarify the purpose of the new VLLM_CPU_OMP_THREADS_BIND variable, ensuring maintainability of the configuration.

VLLM_CPU_OMP_THREADS_BIND: all

AudioQnA/docker_compose/intel/cpu/xeon/compose.yaml:43

[nitpick] Consider adding a comment or updating related documentation to clarify the purpose of adding VLLM_CPU_OMP_THREADS_BIND to the service configuration for easier future maintenance.

VLLM_CPU_OMP_THREADS_BIND: all

.github/env/_build_image.sh:5

Ensure that VLLM_FORK_VER (currently set to v0.6.6.post1+Gaudi-1.20.0) remains compatible with the updated vLLM version v0.9.1 to prevent any integration issues.

export VLLM_VER=v0.9.1

fix vllm vllm-project/vllm#19241

yinghu5 · 2025-06-16T05:30:28Z

@louie-tsai thank you very much for the solution. Both are workable.
but consider several related factors, we may upgrade the vllm docker image next release.

Update vLLM version to v0.9.1

224ec28

Signed-off-by: CICD-at-OPEA <[email protected]>

CICD-at-OPEA requested review from JoshuaL3000, lkk12014402, WenjiaoYue, lvliang-intel, Spycsh, letonghan, yao531441, chensuyue, ZePan110, ftian1, minmin-intel and rbrugaro as code owners June 10, 2025 22:43

yinghu5 mentioned this pull request Jun 13, 2025

[Bug] AudioQnA failed with multilang on xeon after vllm version update to v0.9.0.1 #2048

Open

10 tasks

yinghu5 requested a review from louie-tsai June 16, 2025 01:02

yinghu5 added 3 commits June 16, 2025 09:54

Update compose_multilang.yaml

1719623

fix vllm vllm-project/vllm#19241 issue

Update compose.yaml

b1608e0

fix the vllm-project/vllm#19241

Merge branch 'main' into update_vLLM

a4109ae

yinghu5 requested review from yinghu5 and Copilot June 16, 2025 02:01

Copilot AI reviewed Jun 16, 2025

View reviewed changes

Update compose_faqgen.yaml

a8bbf72

fix vllm vllm-project/vllm#19241

This was referenced Jun 16, 2025

[Feature] Enable vLLM V1 feature and Tensor/Pipeline Parallel to improve Performance #2044

Open

Daily update vLLM&vLLM-fork version opea-project/GenAIComps#1790

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update vLLM version to v0.9.1 #2061

Update vLLM version to v0.9.1 #2061

Uh oh!

CICD-at-OPEA commented Jun 10, 2025

Uh oh!

github-actions bot commented Jun 10, 2025 •

edited

Loading

Uh oh!

louie-tsai commented Jun 13, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

yinghu5 commented Jun 16, 2025

Uh oh!

Uh oh!

Update vLLM version to v0.9.1 #2061

Are you sure you want to change the base?

Update vLLM version to v0.9.1 #2061

Uh oh!

Conversation

CICD-at-OPEA commented Jun 10, 2025

Uh oh!

github-actions bot commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

louie-tsai commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

yinghu5 commented Jun 16, 2025

Uh oh!

Uh oh!

github-actions bot commented Jun 10, 2025 •

edited

Loading

louie-tsai commented Jun 13, 2025 •

edited

Loading