You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In cmp_fmt(), non-emulated formats with more caps are always preferred. However, Some GPUs, e.g. my Intel Arc A750, and perhaps other Intel GPUs, have better performance with rgba16f, which is an emulated format, than rgba32f, which is non-emulated. This is confirmed by my test.
It is not strange that even though rgba16f is emulated, it performs better in practice. The GPU can do some internal SIMD with 16f.
The text was updated successfully, but these errors were encountered:
ruihe774
changed the title
gpu: suboptimal selection of format in pl_find_fmt()
gpu: suboptimal preference of formats - emulated formats can have better performance
May 18, 2024
In
cmp_fmt()
, non-emulated formats with more caps are always preferred. However, Some GPUs, e.g. my Intel Arc A750, and perhaps other Intel GPUs, have better performance with rgba16f, which is an emulated format, than rgba32f, which is non-emulated. This is confirmed by my test.Content of my
gpu->formats
:It is not strange that even though rgba16f is emulated, it performs better in practice. The GPU can do some internal SIMD with 16f.
The text was updated successfully, but these errors were encountered: