Add Cortex-A320 to MIDR decode table#384
Conversation
fbarchard
left a comment
There was a problem hiding this comment.
once landed i'll roll it into xnnpack so the uarch_cortex_a320 is available. or youre welcome to, but its almost easier for me to do it. If youre able to run benchmarks in xnnpack, you could select kernels that really are faster on a320. likely a55 or whatever a520 is doing.
|
Thanks for the approval! Happy to let you handle the XNNPACK side — appreciate the offer. Mapping to On benchmarks: not yet, unfortunately — we're running on the Corstone-1000-A320 FVP, which is functionally accurate but not cycle-accurate for the CPU side, so any kernel ordering we'd derive from it would be misleading. Once we have real A320 silicon we'll come back with measurements and can compare against A55 / A520 kernel paths. |
|
Also update ./tools/cpu-info.c with the string. Ideally build and run the tool to confirm detects. |
ARM Cortex-A320 (MIDR part 0xD8F) is an ARMv9.2-A efficiency core. Add its uarch enum and MIDR mapping so XNNPACK can select optimized kernels when running on this core. Signed-off-by: Nicolas Pitre <npitre@baylibre.com>
|
Done — added the |
Split out from #379 per review request.
ARM Cortex-A320 (MIDR part
0xD8F) is an ARMv9.2-A efficiency core.Add its uarch enum value and MIDR decode entry so consumers (XNNPACK,
KleidiAI, etc.) can dispatch optimised kernels when running on this
core.
The A320 implements the ARMv9.2-A mandatory feature set: NEON, SVE2,
dotprod, FP16, BF16, I8MM (per the Cortex-A320 TRM).
The new MIDR/uarch entries are inserted in numerical order alongside
the existing ARMv9 cores added by recent commits (A520, A720, X4,
X925, A725, Lumex variants).