embeded/ARM
vfp, neon cycle
구차니
2022. 1. 23. 18:08
부동 소수점 +,-,*는 그래도 1사이클에 끝나지만 /는 10사이클 소요됨.
VFP에 vector가 들어간다고 해도 NEON 수준의 병렬성은 제공하지 않음
The VFP architecture was intended to support execution of short "vector mode" instructions but these operated on each vector element sequentially and thus did not offer the performance of true single instruction, multiple data (SIMD) vector parallelism. |