Structural pruning via latency-saliency knapsack

Publication
In NeurIPS2022