Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suspected memory leak on frontend. #1762

Closed
ahmeteser89 opened this issue May 17, 2024 · 1 comment
Closed

Suspected memory leak on frontend. #1762

ahmeteser89 opened this issue May 17, 2024 · 1 comment
Labels
kind/bug Something isn't working

Comments

@ahmeteser89
Copy link

What happened:
Although it is not recommended to use openmatch without a game frontend, first we wanted to test openmatch for our project without a game frontend.
We are using openmatch frontend directly from our clients with a simple flow:
-> CreateTicket
-> GetTicket until there is an assignment. (one per every X seconds)
-> DeleteTicket after we got an assignment.

We were able to stabilize the open file descriptors (that are used for connections to redis) and go routines by using production values here, but we could not understand why memory usage of frontend keeps increasing until the pod is killed by kubernetes. It looks like the only thing it does is talking to redis and returning ticket information. Every other metric except memory usage of frontend looks normal.

Appending some graphs below that illustrates the issue better than words.
Memory usage of processes (as you can see frontend keeps increasing until the pod is killed):
image
Open FDs and Go Routines:
image
Client and Server request rates(as you can see they are pretty much at the same rate):
image

What you expected to happen:

We expect the frontend memory usage should be stabilized at some point because there is no other change in any of the other metrics and query rate is the same.

How to reproduce it (as minimally and precisely as possible):
Send requests to openmatch frontend from many clients with the flow below:
-> CreateTicket
-> GetTicket until there is an assignment. (once per every X seconds)
-> DeleteTicket after we got an assignment.

Output of kubectl version:

Client Version: v1.28.4
Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
Server Version: v1.29.3-eks-adc7111

Cloud Provider/Platform (AKS, GKE, Minikube etc.):
EKS

Open Match Release Version:
1.8.1

Install Method(yaml/helm):
helm

@ahmeteser89 ahmeteser89 added the kind/bug Something isn't working label May 17, 2024
@ahmeteser89
Copy link
Author

We have found the cause of the leak for the frontend, it was telemetry metrics.
After we disabled the telemetry.prometheus.enable for frontend memory increase stopped.
It might be related to a specific metric of the frontend as it doesnt cause any problems for other components but we did not have time to identify this specific metric.
So if anyone encounters this problem disabling metrics by setting telemetry.prometheus.enable to false is worth a shot.
Closing this issue...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant