AArch64: Separate fmls vector subtraction into vector elements #6519
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
As part of a research project testing the accuracy of the sleigh specifications compared to real hardware, we observed an unexpected behaviour in the fmls instruction for AARCH64. According to Section C7.2.126, the expected behaviour is to operate on several items stored within a single reigster. While the current behaviour instead treats the entire vector register as a single value.
e.g.:
53cfae0e
"fmls v19.2S, v26.2S, v14.2S" with z19=0xbff34c546c04b2a7, z26=0xc37b69b4ba630f35, z14=0xbeb4b66dc01ec6fbHardware Reference: 0xc2b546ac6c04b2a7
Existing Spec: 0xc2b1797b3b0cd514
Patched Spec: 0xc2b546ac6c04b2a7