Skip to content

Add optional juryeval integration for LLM-as-Judge metrics#2465

Open
py-ai-dev wants to merge 1 commit into
open-compass:mainfrom
py-ai-dev:juryeval-integration
Open

Add optional juryeval integration for LLM-as-Judge metrics#2465
py-ai-dev wants to merge 1 commit into
open-compass:mainfrom
py-ai-dev:juryeval-integration

Conversation

@py-ai-dev

Copy link
Copy Markdown

Adds an optional integration with juryeval — a lightweight LLM-as-Judge evaluation toolkit.

Changes:

  • New JuryEvalEvaluator in opencompass/evaluator/ registered with @ICL_EVALUATORS.register_module(), supporting pairwise and pointwise judge modes
  • requirements/juryeval.txt + juryeval extra in setup.py
  • Tests in tests/test_juryeval_evaluator.py
  • Import guarded in __init__.py

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants