The configurable policies introduced in #254 use a prefix on the utility estimator name to identify which ones to override with the policy estimator. The epsilon module does the same thing and the presence of both prefixes causes epsilon estimators to not be bound.