it's how the basic linear algebra subprograms are implemented
dewritoninja @ dewritoninja @pawb.social Posts 2Comments 159Joined 2 yr. ago
dewritoninja @ dewritoninja @pawb.social
Posts
2
Comments
159
Joined
2 yr. ago
Not only is it very difficult to write in assembly, the resulting code is not portable. Meaning that if you wrote it on x86 assembly it can't run on ARM chips without emulation and that takes a significant hit on performance defeating the point