there is still some appreciable bias with large flattened conv layers and `m=1000`. Idea: - we keep `m` moderate for geometric calculations - ...but initially estimating what the top `p` PCs are for a layer with large `n` could benefit from a big `m`