server: continue to update other slots on embedding concurrent request #5699

phymbert · 2024-02-24T12:05:55Z

Context

If multiple slots are computing embedding, only the first one is updated.

Changes

Continue to update remaining slots in update_slots in the main loop on embedding task.
Test scenario moved to parallel feature.

Closes #5655

…t request. server: tests: add multi users embeddings as fixed

ggerganov

Let's go 🚀

phymbert · 2024-02-24T13:19:11Z

I will enjoy this PR to add OAI compatible embeddings concurrent scenario

ngxson

LGTM. Thanks!

ggml-org#5699) * server: ggml-org#5655 - continue to update other slots on embedding concurrent request. * server: tests: add multi users embeddings as fixed * server: tests: adding OAI compatible embedding concurrent endpoint * server: tests: adding OAI compatible embedding with multiple inputs

server: #5655 - continue to update other slots on embedding concurren…

09b77b4

…t request. server: tests: add multi users embeddings as fixed

phymbert requested review from ggerganov and ngxson February 24, 2024 12:05

phymbert added bug Something isn't working server/webui labels Feb 24, 2024

ggerganov approved these changes Feb 24, 2024

View reviewed changes

ngxson approved these changes Feb 24, 2024

View reviewed changes

phymbert added 2 commits February 24, 2024 18:06

server: tests: adding OAI compatible embedding concurrent endpoint

466987e

server: tests: adding OAI compatible embedding with multiple inputs

04f4cbb

phymbert merged commit 9e359a4 into master Feb 24, 2024

phymbert deleted the hotfix/server-issue-5655-concurrent-embedding-final branch February 24, 2024 18:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server: continue to update other slots on embedding concurrent request #5699

server: continue to update other slots on embedding concurrent request #5699

Uh oh!

phymbert commented Feb 24, 2024 •

edited

Loading

Uh oh!

ggerganov left a comment

Uh oh!

phymbert commented Feb 24, 2024

Uh oh!

ngxson left a comment

Uh oh!

Uh oh!

server: continue to update other slots on embedding concurrent request #5699

server: continue to update other slots on embedding concurrent request #5699

Uh oh!

Conversation

phymbert commented Feb 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Changes

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

phymbert commented Feb 24, 2024

Uh oh!

ngxson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

phymbert commented Feb 24, 2024 •

edited

Loading