Abstract: Google’s Tensor Processing Unit (TPU) verifies the power of the systolic array architecture in accelerating specific computational tasks. Since the present Neural Processing Unit (NPU) ...