Abstract: Matrix multiplication is one of the most basic and important operation in many computation applications, which comes with high time complexity. Several parallel algorithms have been proposed ...
Abstract: In this paper, we propose three modular multiplication algorithms that use only the IEEE 754 binary floating-point operations. Several previous studies have used floating-point operations to ...
* Program re-ordering for improved L2 cache hit rate. * Automatic performance tuning. # Motivations # Matrix multiplications are a key building block of most modern high-performance computing systems.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results