Evals

This is the clone of the following OPENAI evals repo: https://github.com/openai/evals.

With Evals, we aim to make it as simple as possible to build an eval while writing as little code as possible. An "eval" is a task used to evaluate the quality of a system's behavior. To get started, we recommend that you follow these steps:

Setup

To run evals, you will need to set up and specify your OpenAI API key. You can generate one at https://platform.openai.com/account/api-keys. After you obtain an API key, please search and replace the string 'API_KEY' in the code with your own key.

Minimal Required Version: Python 3.9

Making evals

If you are going to be creating evals, we suggest cloning this repo directly from GitHub and installing the requirements using the following command:

pip install -e .

Using -e, changes you make to your eval will be reflected immediately without having to reinstall.

Running evals

oaieval gpt-3.5-turbo test-jee-match

If you want to reduce the number of threads that are run concurrently (default is 10) - to avoid rate limiting errors - run the following command:

EVALS_THREADS=1 oaieval gpt-3.5-turbo test-jee-match

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docs		docs
evals.egg-info		evals.egg-info
evals		evals
examples		examples
scripts		scripts
.DS_Store		.DS_Store
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evals

Setup

Making evals

Running evals

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ayushdeva/gpt-jee-eval

Folders and files

Latest commit

History

Repository files navigation

Evals

Setup

Making evals

Running evals

About

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages