Skip to content

ci: add automated evalbench evaluation pipeline#113

Merged
omkargaikwad23 merged 3 commits intomainfrom
evalbench-ci-onboarding
May 5, 2026
Merged

ci: add automated evalbench evaluation pipeline#113
omkargaikwad23 merged 3 commits intomainfrom
evalbench-ci-onboarding

Conversation

@omkargaikwad23
Copy link
Copy Markdown
Contributor

@omkargaikwad23 omkargaikwad23 commented May 4, 2026

This PR introduces an automated Evalbench CI pipeline for evaluating the Cloud SQL SQL Server extension (source).

Key Changes:

  • Cloud Build Pipeline: Adds cloudbuild.yaml to orchestrate the Evalbench standalone evaluation.
  • Evaluation Configs: Adds configurations (dataset.json, run_config.yaml, model_config.yaml) to define and test core scenarios like debugging instances and checking performance.
  • Trigger Label: Introduces the ci:run-evals GitHub label to manually trigger the evaluation pipeline on pull requests.

@omkargaikwad23 omkargaikwad23 requested review from a team as code owners May 4, 2026 10:36
@github-actions github-actions Bot requested a review from ajupazhamayil May 4, 2026 10:36
@omkargaikwad23 omkargaikwad23 added the ci:run-evals Manually trigger the evaluation CI pipeline on a PR. label May 4, 2026
@omkargaikwad23 omkargaikwad23 merged commit 0eb2d66 into main May 5, 2026
11 checks passed
@omkargaikwad23 omkargaikwad23 deleted the evalbench-ci-onboarding branch May 5, 2026 05:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci:run-evals Manually trigger the evaluation CI pipeline on a PR.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants