Why training with ELMo instead of BERT does not improve results ?

Hello,

I have tried to use ELMo instead of BERT as you can see on [my fork](https://github.com/DbrRoxane/SDNet?organization=DbrRoxane&organization=DbrRoxane)
The training is working but the results are very similar with the training without any contextual embedding (just GLoVE).
Do you have any idea why or how to fix it? 
I think that I might have forgotten smth in my code... 

Moreover I can notice that x_cemb and ques_cemb are never instanciate, they are always None, would this be part of the issue? 

Thanks in advance

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why training with ELMo instead of BERT does not improve results ? #17

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Why training with ELMo instead of BERT does not improve results ? #17

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions