Skip to content

about the performence #2

@Kenwwww

Description

@Kenwwww

I conducted tests on the following common rag situations and made the following findings:
1.There are several sentences in the long text that are related to the question
2. In long passages, one needs to have an overall understanding of the entire text to answer the question (that is, the knowledge points related to the question are very scattered).
3. Long texts have no relation to the question

The effect is as follows:

  1. It can be found that there are no issues with the current test
  2. The performance was poor. Instead of summarizing, it directly returned to the original text snippet
  3. Poor performance, still returning the original text snippet

It seems that there is no summary process at all. It's more like "selecting sentences related to the problem from the original text".
But indeed, in terms of the performance of the task of selecting the relevant parts, it is indeed much better than the base model of the same scale.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions