Skip to content

network: general fixes for failure states #10466

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 6 commits into
base: master
Choose a base branch
from
Open

Conversation

edsiper
Copy link
Member

@edsiper edsiper commented Jun 12, 2025

fixes: #10458

In addition, other fixes are part of this PR for async DNS resolution (remove stale fd from the event loop), extra validation of socket status for a synchronous network connection and the busy_flag management in the upstream interface.


Fluent Bit is licensed under Apache 2.0, by submitting this pull request I understand that this code will be released under the terms of that license.

edsiper added 2 commits June 12, 2025 11:49
…loop

When dropping a DNS lookup context, check if the c-ares socket is still
registered into the event loop, if so, remove it from the queue and
set the proper ares_socket_registered flag to FLB_FALSE.

Signed-off-by: Eduardo Silva <[email protected]>
There are situations where a race condition can happen with keepalive
connections with a mix of failed states of intermediary connections,
an example can be HTTP output + AWS auth, where the AWS auth failed
and could not be obtained in the DNS resolution state).

This commit ensures the busy_flag is set as soon as the connection
is created and only removed when released or an exception happens.

When destroying connections, we now not only validate the busy flag but
as well check if the connection is not assigned to a priority queue.

I also added a new check for the event structure clearing up the states
before its used again with the event loop interface.

The fixes address the issue described in: #10458

Signed-off-by: Eduardo Silva <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Fluent Bit 4.0.3 using http plugin sink log to opensearch, randomly restart, [engine] caught signal (SIGSEGV)
1 participant