-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Issues: triton-inference-server/server
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Milestones
Assignee
Sort
Issues list
Does Triton Server support Dynamic Request Batching for models which has sparse tensors as inputs
#7333
opened Jun 7, 2024 by
MorrisMLZ
Triton Server 24.05 can't detect CUDA drivers if host system has installed Nvidia driver 555.85
#7319
opened Jun 4, 2024 by
romanvelichkin
Uneven QPS leads to low throughput and high latency as well as low GPU utilization
#7318
opened Jun 4, 2024 by
SunnyGhj
When the request is large, the Triton server has a very high TTFT.
#7316
opened Jun 4, 2024 by
Godlovecui
Low QPS with momentary traffic surges cause significant increases in inference TP99 latency.
#7313
opened Jun 3, 2024 by
a1342772
unexpected datatype TYPE_INT64 for inference input ,expecting TYPE_INT32
#7307
opened May 31, 2024 by
CallmeZhangChenchen
ONNX backend with TensorRT optimizer sometimes fails to start
#7296
opened May 29, 2024 by
ShuaiShao93
How does Triton implement one instance to handle multiple requests simultaneously?
investigating
The developement team is investigating this issue
#7295
opened May 29, 2024 by
SeibertronSS
Support histogram custom metric in Python backend
enhancement
New feature or request
#7287
opened May 28, 2024 by
ShuaiShao93
What is the correct way to run inference in parallel in Triton?
#7283
opened May 28, 2024 by
sandesha-hegde
A Confusion about prefetch
performance
A possible performance tune-up
question
Further information is requested
#7282
opened May 28, 2024 by
SunnyGhj
Windows 10 docker build Error "Could not locate a complete Visual Studio instance"
investigating
The developement team is investigating this issue
#7281
opened May 28, 2024 by
jinkilee
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.