Skip to content

[fix][evaluation] ai native eval#487

Merged
dsf86 merged 66 commits intomainfrom
feat/ai_native_eval
Apr 15, 2026
Merged

[fix][evaluation] ai native eval#487
dsf86 merged 66 commits intomainfrom
feat/ai_native_eval

Conversation

@dsf86
Copy link
Copy Markdown
Collaborator

@dsf86 dsf86 commented Apr 9, 2026

What type of PR is this?

Check the PR title

  • This PR title match the format: [<type>][<scope>] <description>. For example: [fix][backend] flaky fix
  • The description of this PR title is user-oriented and clear enough for others to understand.
  • Add documentation if the current PR requires user awareness at the usage level.
  • This PR is written in English. PRs not in English will not be reviewed.

(Optional) Translate the PR title into Chinese

(Optional) More detailed description for this PR(en: English/zh: Chinese)

en:
zh(optional):

(Optional) Which issue(s) this PR fixes

dsf86 added 30 commits February 6, 2026 11:26
Change-Id: I3cc8ab48f8de1da5458dac51c99f29a634b23fe7
Change-Id: Icb168984120505e032e1a66d56678218ed570fbd
Change-Id: Ie3bddc0a105417fbc623c600b2ef6a7513ff32dc
Change-Id: I0d7735112495802bd500a46ebe4f54608481cd29
Change-Id: Ie68fc964c2f2adcb31f61dd0c05f7a263bfdeece
Change-Id: I98a9560b267b31583d0bf3aab5fab43c4d1de6c8
Change-Id: I42c3ada6a3caa2f34ddd5489350f099f3032206e
Change-Id: I0c56c787103f6a20887cfaaf5b6e92f0bc6d3b57
Change-Id: I151ba86cd90f5474700cd7713f4466a4273f575a
Change-Id: I24fe965761919c81cbde08d5f818a8f1dfd3fb91
Change-Id: I8a25e6837435799c63e3828c0b8167fcb9b22336
Change-Id: Ie370b035be768b3a379e01814e269189d488f2e6
Change-Id: Idec20a41250a4f1a2bf55b1b8ae90230a3ca54dc
Change-Id: Ib8a228a463fb70a2c3cfc3c22c472c8b1f354a77
Change-Id: I9378c980c541d2febf833a635e7ad6220c1e4acc
Change-Id: Iae93d7d9f525e90e6ba100a154694db6aefbfd96
Change-Id: I44f969825a48b266b850c07203d5d7d8e2067067
Change-Id: Ia683abf01b1797cf1bf4bf39bf80e3f156b80585
Change-Id: Ie432ceef38f7c5c633bbbd919a16f0ed21672fb7
Change-Id: Iaa44743df110ac9c5627dd4f4cd4f6d185acc16d
Change-Id: If4e0e9644d9f47d121b527b68dae281c0d8da30f
Change-Id: Iba43eccf72ab35c2a3371c039d5ffd6d0c057cce
Change-Id: I07abe8f4d95f8f05635685fea7d0344eb4c9ea76
Change-Id: Ie088042b956a4e749efcc081419737d76f858814
Change-Id: I2e9bef958350025b763dd22ac4ad87191a6a086e
Change-Id: Ia37f4ab62a267393ea527b1d2d2396d5b8c1a5fe
Change-Id: Iba5208c5c72699eb1d400405b1f8bcd171e14243
Change-Id: Iee07b274185cabe2fb5b26f014d4464b090016c1
Change-Id: I1acb412c868245e90452c058af3e51fff9daf23a
Change-Id: Ic9c252bca0aa59674b901b58286d6b36a26e8b1c
dsf86 added 12 commits March 31, 2026 21:11
Change-Id: Id17def64962447e70a6b2dc2c14c9d90c9d20df3
Change-Id: Ia65b19f5c7540044cd82e7c037381998c0a01ed2
Change-Id: Ia7e2c944a848084e5b6c1864e02fef8db4349c7f
Change-Id: Ic0c187c8644fe8d1c04a075feca40ccd876ce0c1
Change-Id: If8f01e58cdbd8ce11572e981aa4295e6df2f2090
Change-Id: Ic4bbcd75374d052a135198abfd5a4f0c0193956f
Change-Id: I01feb027910110fa834da90fa1d6dc1086929954
Change-Id: I3c860e171724e5619d2a5c10dd6775f84c76e362
Change-Id: I96151b3c2cc75e1386e3c64b528ef35ba0c9a022
Change-Id: Ia2755c1f51ed8d7f9ebcb4432dc415cc2cb8c7cc
Change-Id: I6b8598c2e865a160217eb2e12c831c9329b0ee1f
Change-Id: I459dc58ffd3e54187079da8f519bba49ada1c88e
@codecov
Copy link
Copy Markdown

codecov bot commented Apr 10, 2026

Codecov Report

❌ Patch coverage is 76.94064% with 101 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
...ion/domain/service/expt_run_scheduler_mode_impl.go 88.63% 9 Missing and 6 partials ⚠️
...ules/evaluation/domain/service/expt_result_impl.go 50.00% 12 Missing and 2 partials ⚠️
...valuation/application/convertor/experiment/expt.go 43.47% 8 Missing and 5 partials ⚠️
.../application/convertor/experiment/expt_template.go 36.84% 9 Missing and 3 partials ⚠️
...uation/application/convertor/target/eval_target.go 84.37% 7 Missing and 3 partials ⚠️
...luation/infra/repo/target/eval_target_repo_impl.go 0.00% 8 Missing ⚠️
...ation/application/convertor/evaluator/evaluator.go 70.58% 3 Missing and 2 partials ⚠️
...nd/modules/evaluation/application/evaluator_app.go 86.20% 3 Missing and 1 partial ⚠️
...ules/evaluation/domain/service/expt_manage_impl.go 42.85% 2 Missing and 2 partials ⚠️
...uation/application/convertor/experiment/openapi.go 0.00% 1 Missing and 2 partials ⚠️
... and 6 more

❌ Your patch status has failed because the patch coverage (76.94%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.

Impacted file tree graph

@@            Coverage Diff             @@
##             main     #487      +/-   ##
==========================================
+ Coverage   76.65%   77.01%   +0.36%     
==========================================
  Files         647      647              
  Lines       71083    71480     +397     
==========================================
+ Hits        54487    55052     +565     
+ Misses      13266    13130     -136     
+ Partials     3330     3298      -32     
Flag Coverage Δ
unittests 77.01% <76.94%> (+0.36%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines Coverage Δ
...d/modules/evaluation/application/experiment_app.go 85.29% <100.00%> (+3.56%) ⬆️
backend/modules/evaluation/domain/entity/common.go 100.00% <ø> (+12.24%) ⬆️
...kend/modules/evaluation/domain/entity/evaluator.go 100.00% <ø> (+6.79%) ⬆️
backend/modules/evaluation/domain/entity/expt.go 97.89% <100.00%> (+11.08%) ⬆️
...ckend/modules/evaluation/domain/entity/expt_run.go 98.43% <ø> (+28.64%) ⬆️
.../modules/evaluation/domain/entity/expt_template.go 98.01% <ø> (ø)
backend/modules/evaluation/domain/entity/param.go 88.88% <100.00%> (+24.60%) ⬆️
backend/modules/evaluation/domain/entity/target.go 100.00% <100.00%> (ø)
.../modules/evaluation/domain/entity/target_record.go 58.53% <ø> (ø)
...luation/domain/service/expt_run_item_event_impl.go 71.04% <100.00%> (ø)
... and 19 more

... and 3 files with indirect coverage changes


Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ebea9a3...fb68375. Read the comment docs.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

dsf86 added 13 commits April 10, 2026 14:15
Change-Id: I6cff506657c5fbbc20d84f572be15010b7b90295
Change-Id: I2af125dd931790d870e08ae10088e372006472b2
Change-Id: Ice9c1295bd1cb6398adc5dace1ceb4d05f451812
Change-Id: I9c0bbad7633ed67c1ba219431a73da31ec8cded3
Change-Id: I0dbe881aa683b10756f8a6f4a8bdadfcc0aa638d
Change-Id: Ie520d956b151d34f4971e7ab34d9f30a724de5eb
Change-Id: I29143d63486f5e22e46e63fb9fbca2964b2182ac
Change-Id: Id7d09467e9d0517c0ac25c34db59baa3b133a352
Change-Id: I8d3e2aa6dcd711f7b8b3e0d8674d986f4e2831e9
Change-Id: Ie0020a3a3b7fe7c288381cee03052b48f3176ec8
Change-Id: Ifcf5a23214d0e9c332e89be0154f26b676b0c0f9
Change-Id: Ia438a84664854a83c128552ff14601a17b4c95ee
Change-Id: Ibd64712c83f760323e6229974e92aa352d7e7ccc
@dsf86 dsf86 merged commit 6f2dd6f into main Apr 15, 2026
17 of 18 checks passed
@dsf86 dsf86 deleted the feat/ai_native_eval branch April 15, 2026 09:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants