sanity-check always returns pass@1: 0.0

When running evaluate_functional_correctness with the provided example files (data/example_samples.jsonl and data/example_problem.jsonl), the output always shows pass@1: 0.0. According to the documentation, it should be around 0.5.

Command: 
evaluate_functional_correctness data/example_samples.jsonl --problem_file=data/example_problem.jsonl

Result:
Reading samples...
6it [00:00, 2636.27it/s]
Running test suites...
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [00:00<00:00,  7.23it/s]
Writing results to data/example_samples.jsonl_results.jsonl...
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [00:00<00:00, 11915.64it/s]
{'pass@1': 0.0}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sanity-check always returns pass@1: 0.0 #57

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

sanity-check always returns pass@1: 0.0 #57

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions