Bug #98520 | Suboptimal implementations for some load/store functions for little-endian arch | ||
---|---|---|---|
Submitted: | 8 Feb 2020 9:37 | Modified: | 5 May 2020 14:51 |
Reporter: | Alexey Kopytov | Email Updates: | |
Status: | Verified | Impact on me: | |
Category: | MySQL Server: Optimizer | Severity: | S5 (Performance) |
Version: | 8.0 | OS: | Any |
Assigned to: | CPU Architecture: | ARM |
[8 Feb 2020 9:37]
Alexey Kopytov
[11 Feb 2020 7:02]
MySQL Verification Team
Hello Alexey, Thank you for the report and feedback. regards, Umesh
[16 Mar 2020 1:20]
Daniel Black
Very nice Alexey. Even on x86_64 gcc-4.1.2 this correctly inlines memcpy and generates less code than the original. Save our CPU L1 instruction caches for other stuff. Keep up the good work.
[5 May 2020 14:51]
Alexey Kopytov
I noticed that CPU architecture for this issue has been updated from "Any" to "ARM". I'd like to emphasize that the optimization proposed here applies to any little-endian architecture such as x86, amd64, arm64, ppc64le and likely others.