Fixes to parser and explicitly print parsing error #138

shubhamugare · 2024-12-24T18:26:26Z

This PR includes two main changes:

There are some fixes in parsers when the generation starts with empty output in the case of NL to code tasks for general programming languages.
The parsing error and switch to unconstrained generation on an error is made more explicit. -- On any parsing error SynCode typically falls back to unconstrained generation (unless dev_mode flag is on). In the case of the grammar_mask mode, a parsing error can occur when the SynCode generation diverges from the grammar as SynCode does not always guarantee adherence to grammar.

Test:

python3 syncode/infer.py --mode grammar_mask --model meta-llama/Llama-2-7b-hf  --dataset humaneval --grammar python --max_new_tokens 400 --parser lr --parse_output_only False

results in pass@1 of ~12%

Fixes to parser and explicitly print parsing error

a88b744

shubhamugare force-pushed the instruct branch from efe46ed to a88b744 Compare December 25, 2024 06:52

shubhamugare merged commit 12a5e28 into main Dec 25, 2024
1 check passed

shubhamugare mentioned this pull request Dec 25, 2024

Issues with built-in python, java, and go grammars #137

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixes to parser and explicitly print parsing error #138

Fixes to parser and explicitly print parsing error #138

Uh oh!

shubhamugare commented Dec 24, 2024 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fixes to parser and explicitly print parsing error #138

Fixes to parser and explicitly print parsing error #138

Uh oh!

Conversation

shubhamugare commented Dec 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

shubhamugare commented Dec 24, 2024 •

edited

Loading