[codex] add ERNIE image diffusers runner by HanFa · Pull Request #1115 · ModelTC/LightX2V

HanFa · 2026-06-03T06:16:59Z

Summary

Adds ERNIE-Image text-to-image support through a Diffusers-backed LightX2V runner.

This PR introduces:

ernie_image configs and model pipeline registration
ErnieImageRunner with a decomposed generation path instead of defaulting to ErnieImagePipeline.__call__
LightX2V wrappers for component access, text/PE encoding, transformer inference, scheduler state, and VAE decode
progress callback integration, image save/tensor return handling, CPU offload, and unload_modules cleanup
focused unit tests for shape handling, PE/CFG, scheduler/VAE behavior, component loading, progress, save, tensor return, and unload cleanup

Scope

This is still intentionally Diffusers-backed. It uses diffusers.ErnieImagePipeline.from_pretrained as the underlying loader and then routes runtime access through LightX2V wrapper boundaries. It does not yet implement native ERNIE weight mapping, native transformer infer classes, quantization, LoRA, or distributed execution.

Validation

Local checks:

python -m unittest test_cases.test_ernie_image_runner -> 17 tests OK
ruff check configs/ernie_image lightx2v/infer.py lightx2v/pipeline.py lightx2v/utils/set_config.py lightx2v/models/runners/ernie_image lightx2v/models/input_encoders/hf/ernie_image lightx2v/models/networks/ernie_image lightx2v/models/schedulers/ernie_image lightx2v/models/video_encoders/hf/ernie_image test_cases/test_ernie_image_runner.py -> all checks passed
python -m py_compile ... for the ERNIE runner/wrapper/test files -> passed

H100 smoke regression after the component-container migration:

PE off, no offload: LightX2V runner vs direct Diffusers was pixel-identical, MSE 0.0, max abs diff 0
PE off, CPU offload: LightX2V runner vs direct Diffusers was pixel-identical, MSE 0.0, max abs diff 0
PE on, CPU offload + unload_modules=true + tensor return: generation completed, progress reached (100.0, 100), and pipe/components/model/text_encoder/scheduler/vae were all unloaded

gemini-code-assist

Code Review

This pull request introduces support for the ERNIE-Image text-to-image model by adding dedicated configuration files, a text encoder, a transformer model wrapper, a scheduler, a VAE decoder, and a runner pipeline, along with comprehensive unit tests. The review feedback highlights three key areas for improvement: resolving a potential runtime dtype mismatch in the VAE decoder by explicitly casting batch norm statistics to the latent's dtype, replacing an assertion with a proper ValueError for runtime validation in the runner, and correcting an inconsistent variable reference from model_cls to self.model_cls in the pipeline initialization.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

HanFa · 2026-06-03T22:13:49Z

@wangshankun could you review this PR? Let me know if there is a contribution guide to follow.

feat: add ernie image diffusers runner

1af42a5

gemini-code-assist Bot reviewed Jun 3, 2026

View reviewed changes

Comment thread lightx2v/models/video_encoders/hf/ernie_image/vae.py Outdated

Comment thread lightx2v/models/runners/ernie_image/ernie_image_runner.py Outdated

Comment thread lightx2v/pipeline.py Outdated

fix: address ernie image review feedback

104413d

HanFa marked this pull request as ready for review June 3, 2026 06:47

gushiqiao closed this Jun 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] add ERNIE image diffusers runner#1115

[codex] add ERNIE image diffusers runner#1115
HanFa wants to merge 2 commits into
ModelTC:mainfrom
sutro-planet:feature/ernie-image-diffusers

HanFa commented Jun 3, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HanFa commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

HanFa commented Jun 3, 2026

Summary

Scope

Validation

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HanFa commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants