In the test example jupyter notebook, for some metrics the enhanced result is smaller than noisy one's, while some's are larger; e.g, PESQ should be the larger the better, but it's not the case in the demo.
and the scale/range of the result isn't mentioned in readme, I know some lie in 0 to 5, but not familiar with others.