Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Epoll failure on interrupted system call #134

Open
suola opened this issue Oct 12, 2022 · 3 comments
Open

Epoll failure on interrupted system call #134

suola opened this issue Oct 12, 2022 · 3 comments

Comments

@suola
Copy link

suola commented Oct 12, 2022

I ran massdns for several hours to resolve large number of domains, and it quit prematurely with

DEBUG Epoll failure: Interrupted system call

This comes from https://github.com/blechschmidt/massdns/blob/master/src/main.c#L2013.

I'm not expert on this, but maybe epoll_wait() failures due to signal interrupts could be ignored?

At least that the understanding I got from e.g. https://stackoverflow.com/a/6870391

@blechschmidt
Copy link
Owner

epoll_wait failures due to signal interrupts only cause this message to be printed. They do not terminate massdns. Since there is no change in state when epoll gets interrupted, the loop will just be re-entered and epoll_wait will be called again.

Did you see the error message only once or did it occur repeatedly? Is there maybe any hint in dmesg that the OOM killer killed massdns?

@suola
Copy link
Author

suola commented Oct 13, 2022

I run massdns with --processes 4, and all the 4 processes printed that message once and exited.

The reason why I suspect this was a premature exit, is that 3 out of 4 processes had already finished (their CPU usage was zero), and the size of one of the output files was 25% smaller than the three other, which were basically equal in size.

I'm not sure whether this is relevant for this issue, but another observation: The reported massdns progress reached 100% quite a long time before the processes exited. This has occurred several time earlier too, but the assumedly premature exit with the Epoll-warning has occurred only once.

@suola
Copy link
Author

suola commented Oct 13, 2022

I took a log at the logs, and there were no signs of OOM killer.

However, I noticed that the timestamp of log message was incorrect (I redirect massdns logs to my common log file, and all the messages were buffered and flushed on exit).

Therefore, this issue is a user error. The problem described in the previous message is real, though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants