How to use streaming in sync ? #1502

vitalik · 2025-04-16T11:53:22Z

Question

Cannot find any example on how to use streaming text chunks in NON-async context

trying something like this:

import asyncio
from datetime import datetime
from pydantic_ai import Agent


def get_event_loop():
    try:
        event_loop = asyncio.get_event_loop()
    except RuntimeError:
        event_loop = asyncio.new_event_loop()
        asyncio.set_event_loop(event_loop)
    return event_loop


agent = Agent(
    'openai:gpt-4o',
    system_prompt=(
        'You are are a helpful assistant that always answer with too many words loosing some times context but finally getting the right answer.'
        'Also always call tool get_current_time to find the current time to know if answer make sense.'
        'Basically be a bit annoying but always correct. it must be at least three paragraphs long.'
    ),
)


@agent.tool
def get_current_time(ctx) -> str:
    """Get the current time."""
    print(f' --> call get_current_time <--')
    return str(datetime.now())


async def agent_stream_deltas(agent):
    async with agent.run_stream('What is the capital of the UK?') as response:
        async for chunk in response.stream_text(delta=True):
            yield chunk


def agent_stream_sync(agent):
    loop = get_event_loop()
    gen = agent_stream_deltas(agent)
    while True:
        try:
            chunk = loop.run_until_complete(gen.__anext__())
            yield chunk
        except StopAsyncIteration:
            pass


if __name__ == '__main__':
    for chunk in agent_stream_sync(agent):
        print(chunk, end='', flush=True)

it prints chunks as they arrive - but et the end crashes/freezes:

 --> call get_current_time <--
The capital of the United Kingdom
,...
. The city is an exuberant mix of old and new, tradition and innovation, known for its diverse communities and vibrant urban fabric. Therefore, while my explanation may have taken you on a slightly winding path filled with relevant details, London stands firmly as the city's heart and soul of the United Kingdom.Failed to detach context\

Traceback (most recent call last):
  File "/private/tmp/aa_differencingly_kusti/.venv/lib/python3.12/site-packages/opentelemetry/context/__init__.py", line 155, in detach
    _RUNTIME_CONTEXT.detach(token)
  File "/private/tmp/aa_differencingly_kusti/.venv/lib/python3.12/site-packages/opentelemetry/context/contextvars_context.py", line 53, in detach
    self._current_context.reset(token)
ValueError: <Token var=<ContextVar name='current_context' default={} at 0x1020d87c0> at 0x104550c00> was created in a different Context

Additional Context

No response

The text was updated successfully, but these errors were encountered:

DouweM · 2025-04-17T15:09:42Z

@vitalik Thanks for reporting this!

There are two things going on here:

The script freezes because it never breaks out of the while True loop: in except StopAsyncIteration, can you change pass to break?
agent.run_stream automatically wraps the work in a logfire span (even if you're not using logfire), and these cannot contain the yield keyword as explained on https://logfire.pydantic.dev/docs/reference/advanced/generators/.

As @alexmojaki wrote in Handling Incorrect Error Type in validate_structured_result Causes Context Detachment Issue #674 (comment), which hits the same error:

The reason is that agent.run_stream opens the span and relies on with to ensure it closes nicely. That doesn't happen in the last case because it's in a generator that gets suspended when the loop breaks and never resumes. Instead it gets closed by garbage collection in a different context, hence the error.

There's more context on the OTel issue tracker as well: Runtime context fails to detach token open-telemetry/opentelemetry-python#2606

The error is harmless, but obviously distracting and a bit scary-looking, so we're going to do two things:

Stop creating a logfire span unless necessary. This will make your example work without any issues, but it's obviously not a complete solution because it'd still affect those using logfire, which we want to be everyone.
Natively support running PydanticAI in a synchronous context, so you don't have to manually do things like loop.run_until_complete(gen.__anext__()): Feature Request: Synchronous Calls #934

@alexmojaki I think you mentioned you're going to look at 1, and @Kludex is already looking into 2.

cspiecker · 2025-05-26T14:46:09Z

I’m also running into this issue. Would love to see proper sync support or a clean fix for the span issue. Thanks for looking into it!

vitalik added the question Further information is requested label Apr 16, 2025

Kludex added the asyncio label Apr 17, 2025

Kludex self-assigned this Apr 17, 2025

alexmojaki mentioned this issue Apr 17, 2025

Don't set non-recording span as current #1527

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to use streaming in sync ? #1502

How to use streaming in sync ? #1502

vitalik commented Apr 16, 2025

DouweM commented Apr 17, 2025

Uh oh!

cspiecker commented May 26, 2025

Uh oh!

How to use streaming in sync ? #1502

How to use streaming in sync ? #1502

Comments

vitalik commented Apr 16, 2025

Question

Additional Context

DouweM commented Apr 17, 2025

Uh oh!

cspiecker commented May 26, 2025

Uh oh!