fix support for `litserve>0.2.4` #1994

ali-alshaar7 · 2025-04-03T01:41:33Z

LitGPT CI fails with latest LitServe (>=0.2.4), this PR fixes that.

The real issue was that LitGPT was always failing with LitServe but the tests were flaky and they passed. Post 0.2.4, LitServe added the mechanism to check if the inference worker is alive before starting the uvicorn worker and it made the LitGPT CI fail.

for more information, see https://pre-commit.ci

t-vi · 2025-04-03T21:15:35Z

Do we expect the serving to work on macos? that seems to be failing.

ali-alshaar7 · 2025-04-03T21:25:57Z

Do we expect the serving to work on macos? that seems to be failing.

Yeah, passes locally.

for more information, see https://pre-commit.ci

aniketmaurya

Love this @ali-alshaar7!! thank you so much 🚀

Also, can we set the version >=0.2.7? This way we will know if in case LitGPT stops working for a new version of LitServe again?

Borda · 2025-04-07T10:45:50Z

@aniketmaurya do you see any reason why it may hang/fail on Mac?

aniketmaurya · 2025-04-07T11:07:14Z

reran with debugging enabled @Borda! probably some issue with LitGPT setting up the devices. Ali mentioned that it works locally. will debug this in depth later today.

…to alia/lsup

aniketmaurya · 2025-04-07T11:56:10Z

I get this error locally:

 File "/Users/aniket/Projects/github/litgpt/litgpt/config.py", line 149, in from_file
    return cls(**file_kwargs)
           ^^^^^^^^^^^^^^^^^^
TypeError: Config.__init__() got an unexpected keyword argument 'sliding_window_layer_placing'

investigating more cc @ali-alshaar7 @k223kim

k223kim · 2025-04-07T11:57:23Z

@aniketmaurya Did you pull the main branch? That's coming from my gemma PR

t-vi · 2025-04-07T12:10:33Z

Also, can we set the version >=0.2.7? This way we will know if in case LitGPT stops working for a new version of LitServe again?

Note that the useful CI relation is opposite to >= dependencies: If we want to declare a >= 0.2.7 (or whatever version should be the minimum) dependency on LitServe, LitServe should ideally gain a test that updates to it don't break LitGPT.
If we don't have that, it seems more prudent to limit LitGPT compat to what we know we work with and then bump the dependency in a PR (that will check compat in the CI tests).

aniketmaurya · 2025-04-07T12:24:08Z

right @t-vi, we should aim for a test to ensure this!

Borda · 2025-04-07T13:57:34Z

we should aim for a test to ensure this!

it did not help so let's revert it back and try to debug it...

aniketmaurya · 2025-04-07T15:51:36Z

seems like the server process is not terminated properly on macos - ERROR: [Errno 48] Address already in use

ali-alshaar7 · 2025-04-07T16:10:30Z

seems like the server process is not terminated properly on macos - ERROR: [Errno 48] Address already in use

Hey, thanks for looking into this. I did add a kill_process_tree which seemed to resolve that on Ubuntu and windows. In any case, it shouldn't fail the test, since if there's already a server at that port, the test can just ping that and pass right?

aniketmaurya · 2025-04-07T16:37:37Z

I think uvicorn runs in the main process and rest of the FastAPI logic lies in the child processes so if the child processes die then it won't be reachable.

Maybe try using this logic from LitServe tests for running and stopping the tests?

ali-alshaar7 · 2025-04-07T16:40:55Z

I think uvicorn runs in the main process and rest of the FastAPI logic lies in the child processes so if the child processes die then it won't be reachable.

Yeah, so if we kill all the children, we should free the port and be able to spin up a new server and so we won't have this issue right? that's what kill_process_tree does.

aniketmaurya · 2025-04-07T16:41:27Z

yup, you're right! theoretically that's how things should go.

Borda · 2025-04-07T19:29:22Z

@aniketmaurya would be nice if litServe has a function to properly terminate itself :)

ali-alshaar7 requested review from lantiga and t-vi as code owners April 3, 2025 01:41

ali-alshaar7 changed the title ~~upgrade lit serve~~ upgrade litserve Apr 3, 2025

Borda approved these changes Apr 3, 2025

View reviewed changes

ali-alshaar7 force-pushed the alia/lsup branch from 534d793 to 5ffe470 Compare April 3, 2025 04:00

Ali Alshaarawy added 9 commits April 3, 2025 10:48

upgrade litserve

510c756

handle cpu and mps in llm api

f004409

dont timeout serving tests since wiont poersist if failed on setup

9bd4208

print

2c72d55

/teamspace/studios/this_studio/litgpt/.azure/gpu-test.yml

346bc8d

increase sleep

7b9051a

bump timeouts

8d871c8

revert

1c7079b

bump timeout

967e28f

ali-alshaar7 force-pushed the alia/lsup branch from cdfd939 to 967e28f Compare April 3, 2025 14:55

ali-alshaar7 and others added 5 commits April 3, 2025 11:35

add logging

7798912

[pre-commit.ci] auto fixes from pre-commit.com hooks

47f1529

for more information, see https://pre-commit.ci

Merge branch 'main' into alia/lsup

6a9d680

Merge branch 'main' into alia/lsup

46c2e95

update

08f2779

ali-alshaar7 force-pushed the alia/lsup branch 2 times, most recently from 0133554 to 7270c23 Compare April 4, 2025 02:15

ali-alshaar7 added 4 commits April 3, 2025 22:41

Your new commit message

bbf1449

reset test

4361aa8

update tests

771bb09

update tests

78d067d

ali-alshaar7 force-pushed the alia/lsup branch from d95814f to 78d067d Compare April 4, 2025 02:49

add test

019db38

pre-commit-ci bot and others added 3 commits April 4, 2025 14:58

[pre-commit.ci] auto fixes from pre-commit.com hooks

afbefa1

for more information, see https://pre-commit.ci

Merge branch 'main' into alia/lsup

a785665

Merge branch 'main' into alia/lsup

2c7f9a9

aniketmaurya approved these changes Apr 6, 2025

View reviewed changes

Borda added 2 commits April 7, 2025 13:32

Merge branch 'alia/lsup' of https://github.com/Lightning-AI/litgpt in…

056d5ab

…to alia/lsup

_wait_and_check_response

0df32c0

k223kim and others added 2 commits April 7, 2025 12:57

Merge branch 'main' into alia/lsup

60fe8c3

increase the timeout for macos

334f728

aniketmaurya changed the title ~~upgrade litserve~~ fix support for litserve>0.2.4 Apr 7, 2025

Merge branch 'main' into alia/lsup

9cbdd69

Borda and others added 2 commits April 15, 2025 12:22

@pytest.mark.xfail

6801fa9

Merge branch 'main' into alia/lsup

bcd44e4

Borda enabled auto-merge (squash) April 15, 2025 10:22

@pytest.mark.xfail

7664ac4

Borda merged commit 3d66f32 into main Apr 15, 2025
15 checks passed

Borda deleted the alia/lsup branch April 15, 2025 11:40

bhimrazy mentioned this pull request May 16, 2025

litserve version constraint #2045

Closed

fix support for litserve>0.2.4 #1994

fix support for litserve>0.2.4 #1994

Uh oh!

Conversation

ali-alshaar7 commented Apr 3, 2025 • edited by aniketmaurya Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

t-vi commented Apr 3, 2025 • edited by ali-alshaar7 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ali-alshaar7 commented Apr 3, 2025

Uh oh!

aniketmaurya left a comment

Choose a reason for hiding this comment

Uh oh!

Borda commented Apr 7, 2025

Uh oh!

aniketmaurya commented Apr 7, 2025

Uh oh!

aniketmaurya commented Apr 7, 2025

Uh oh!

k223kim commented Apr 7, 2025

Uh oh!

t-vi commented Apr 7, 2025

Uh oh!

aniketmaurya commented Apr 7, 2025

Uh oh!

Borda commented Apr 7, 2025

Uh oh!

aniketmaurya commented Apr 7, 2025

Uh oh!

ali-alshaar7 commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aniketmaurya commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ali-alshaar7 commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aniketmaurya commented Apr 7, 2025

Uh oh!

Borda commented Apr 7, 2025

Uh oh!

Uh oh!

Uh oh!

fix support for `litserve>0.2.4` #1994

fix support for `litserve>0.2.4` #1994

ali-alshaar7 commented Apr 3, 2025 •

edited by aniketmaurya

Loading

t-vi commented Apr 3, 2025 •

edited by ali-alshaar7

Loading

ali-alshaar7 commented Apr 7, 2025 •

edited

Loading

aniketmaurya commented Apr 7, 2025 •

edited

Loading

ali-alshaar7 commented Apr 7, 2025 •

edited

Loading