Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

MinimaxM2.5-FP8-MI325x-vLLM: pin AITER FA attention backend
#1594 opened May 30, 2026 by chunfangamd Collaborator Loading…
ci(disagg): fail before writing result file + surface real failure class
#1591 opened May 29, 2026 by arygupt Collaborator Loading…
[WIP] Update DSv4 B300 vllm image tag full-sweep-enabled
#1588 opened May 29, 2026 by wzhao18 Collaborator Loading…
[AMD] improve dsr1 fp4 disagg AMD full-sweep-enabled
#1584 opened May 29, 2026 by billishyahao Collaborator Loading…
feat(power): per-worker prefill/decode power + role-split joules (stacked on #1574)
#1577 opened May 28, 2026 by arygupt Collaborator Loading…
1 of 3 tasks
[NV] Update B300 DSV4 SGLang Pareto sweep full-sweep-enabled
#1575 opened May 27, 2026 by Ankur-singh Collaborator Loading…
feat(power): multinode measured-power aggregation full-sweep-enabled
#1574 opened May 27, 2026 by arygupt Collaborator Loading…
3 of 6 tasks
[WIP] Chore/agentx v0.3
#1571 opened May 27, 2026 by cquil11 Collaborator Loading…
Update glm-5 b200 sglang image to nightly-dev-cu13-20260523-c112f762 non-canary-full-sweep-enabled Run the full sweep without the canary gate (full search space, no trim)
#1567 opened May 26, 2026 by Ankur-singh Collaborator Loading…
[NV] H100 (Agg): migrate model path sweep-enabled
#1537 opened May 20, 2026 by Ankur-singh Collaborator Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.