Skip to content

[Qwen3.5] Onboard model to Checkpointing Util & Verify Correctness#3839

Open
Rohan-Bierneni wants to merge 2 commits intomainfrom
rbierneni-qwen35-checkpoint
Open

[Qwen3.5] Onboard model to Checkpointing Util & Verify Correctness#3839
Rohan-Bierneni wants to merge 2 commits intomainfrom
rbierneni-qwen35-checkpoint

Conversation

@Rohan-Bierneni
Copy link
Copy Markdown
Collaborator

Description

This PR onboard the Qwen3.5 model to the checkpointing util. Also we now use the converted hf -> maxtext checkpoint to run forward_pass_logit checker and verify model correctness.

Tests

Results of forward_pass_logit_checker:

Checklist

Before submitting this PR, please make sure (put X in square brackets):

  • I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
  • I have necessary comments in my code, particularly in hard-to-understand areas.
  • I have run end-to-end tests tests and provided workload links above if applicable.
  • I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

Add config file for 397B model

update attentions.py with new decoder block type

Update other files with new model to ensure model initialization is correct

Update decoder block type

Train Compile test is passing

resolve nits in config file formatting

resolve formatting errors

Fix conflict in maxtext_utils

Fix linter errors

Fix linter errors

Fix linter errors

Ran pyink locally for formatting

Fix naming for config file

Add code for param_mapping for qwen3.5

Add hook fn function for Qwen3.5

Update hook logic for 1 to n mt -> hf hook fns
@codecov
Copy link
Copy Markdown

codecov Bot commented May 7, 2026

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant