Learning to Optimize Tensor Programs(arxiv.org) |
Learning to Optimize Tensor Programs(arxiv.org) |
[0]: https://en.wikipedia.org/wiki/Polytope_model [1]: https://gcc.gnu.org/wiki/Graphite [2]: https://gcc.gnu.org/wiki/Graphite
The GEMM example was just there as the details of the optimization have been published, unlike most other hand-tuned assembler routines for DNN workloads.