HALP: Hardware-Aware Latency Pruning

Publication
arXiv preprint arXiv:2110.10811