Run files of Vanilla BERT checkpoints do not match test folds in data/robust

First of all, thanks a lot for your interesting work on CEDR and for the code in this repository. 

I downloaded the Vanilla BERT and CEDR-KNRM checkpoints from #18 and checked the query ids in the `.run` files contained in the downloaded archive. While the sets of query ids in `cedrknrm-robust-f[1-5].run` match those in `data/robust/f[1-5].test.run`, the sets of query ids in `vbert-robust-f[1-5].run` do not match those in `data/robust/f[1-5].test.run` (e.g. the set of query ids in `vbert-robust-f1.run` is **different** from the set of query ids in `data/robust/f1.test.run`, and also `cedrknrm-robust-f1.run`).

Why are the folds for Vanilla BERT and CEDR-KNRM different? On which folds have the Vanilla BERT checkpoints been trained/validated? Given that the test folds of the Vanilla BERT and CEDR-KNRM checkpoints are different I assume that the provided Vanilla BERT checkpoints have **not** been used as initial weights for obtaining the provided CEDR-KNRM checkpoints. Is this assumption correct? If yes, which Vanilla BERT checkpoints have been used to initialize CEDR-KNRM training? Do you mind sharing these checkpoints too? 

I'm currently investigate issues reproducing the results published in the paper. More on that in a separate ticket ...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run files of Vanilla BERT checkpoints do not match test folds in data/robust #21

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Run files of Vanilla BERT checkpoints do not match test folds in data/robust #21

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions