Leveraging sparsity to drive fast response times at the edge