This repository contains the code to reproduce the results in the paper Optimized Conformal Selection: Powerful Selective Inference After Conformity Score Optimization.
simulation/: Simulation experimentssimulation/Msel/: Conformity score selection with pre-trained models (Section 5.1)simulation/Full/: Conformal selection without data splitting (Section 5.2)simulation/Full-Msel/: Model training and selection with full data (Section 5.3)
real/: Real-data appliationsreal/drug/: Drug discovery with model selection (Section 6.1)real/llm/: Boosting LLM Alignment (Section 6.2)
requirements.txt: A list of required python packages
For simulation, a single job submission is sufficient to run the experiment.
For the drug discovery application:
- Use
modelpred.pyto generate and save model predictions based on different drug encodings. - Evaluate the performance of various methods using
evaluate.py.
The code for the LLM alignment application is largely adapted from this repository. After performing report generation and score extraction as described in the linked repository:
- Use
collect.pyto compile all uncertainty/confidence scores and labels into a single.csvfile. - Conduct experiments using different combinations of models via
llm_set1.pyandllm_set2.py.