You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Recently we've had a lot of issues with our QN instances being unavailable or very slow to respond. This is most likely caused by some expensive queries running for a long time basically blocking all the other queries. To fix this situation I think we should do 2 things:
Add ability to track queries that took more than X seconds to execute - query body, request headers so we can see what apps are making those. Ideally we would save that to some log file so it's easy to inspect later.
Ability to abort queries that take longer than X seconds
The text was updated successfully, but these errors were encountered:
About (1) as I wrote on Discord the current open telemetry implementation should already enough that. Can the prod QN instance be run with open telemetry on ? should I change the command to also run it with pm2 ?
Recently we've had a lot of issues with our QN instances being unavailable or very slow to respond. This is most likely caused by some expensive queries running for a long time basically blocking all the other queries. To fix this situation I think we should do 2 things:
The text was updated successfully, but these errors were encountered: