Abstract: Large-scale matrix multiplication is a computational bottleneck in various applications including artificial intelligence and machine learning. Given the time complexity of O(n 3) for matrix ...