Skip to content

Pull requests: pytorch/helion

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[pallas] direct-call hot-path squeezes: output cache + sig-lock + closure baking + speculative dispatch CLA Signed This label is managed by the Meta Open Source bot.
#2595 opened May 26, 2026 by choijon5 Contributor Draft
[pallas] bypass JaxCallable for static-shape kernels via direct call_custom_kernel CLA Signed This label is managed by the Meta Open Source bot.
#2594 opened May 26, 2026 by choijon5 Contributor Draft
[pallas] fast-path host-side launcher with pre-computed cache entries CLA Signed This label is managed by the Meta Open Source bot.
#2593 opened May 26, 2026 by choijon5 Contributor Draft
[pallas] matmul pipeline launcher VMEM strips + outer-grid strategy CLA Signed This label is managed by the Meta Open Source bot.
#2592 opened May 26, 2026 by choijon5 Contributor Draft
[Pallas] Fix synchronize_device skipping sync for tuple returns on TPU CLA Signed This label is managed by the Meta Open Source bot.
#2580 opened May 25, 2026 by thcmbs Collaborator Loading…
[test] Split cute compiler-pass tests by pass, drop kernel name from filenames CLA Signed This label is managed by the Meta Open Source bot.
#2579 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Rewrite online softmax_two_pass → equivalent 3-pass form CLA Signed This label is managed by the Meta Open Source bot.
#2578 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Optional carried-only gate for the load pipeline pass CLA Signed This label is managed by the Meta Open Source bot.
#2577 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Software-pipeline inner vec loads to hide HBM latency CLA Signed This label is managed by the Meta Open Source bot.
#2576 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Cleaner LICM: alias DCE + FMA-friendly scale hoist CLA Signed This label is managed by the Meta Open Source bot.
#2575 opened May 25, 2026 by oulgen Contributor Loading…
[cute] LICM for reciprocals: hoist 1/divisor out of inner loops CLA Signed This label is managed by the Meta Open Source bot.
#2574 opened May 25, 2026 by oulgen Contributor Loading…
Fix fbcode CI torch.compile fusion with newer PyTorch CLA Signed This label is managed by the Meta Open Source bot.
#2567 opened May 23, 2026 by choijon5 Contributor Loading…
Speed up Helion kernel launches by avoiding repeated Python work CLA Signed This label is managed by the Meta Open Source bot.
#2565 opened May 23, 2026 by yushangdi Contributor Draft
[Pallas] Fix layernorm example tolerances and split bwd test CLA Signed This label is managed by the Meta Open Source bot.
#2560 opened May 22, 2026 by thcmbs Collaborator Draft
[Pallas] Propagate inner tile alignment min_size to bounding outer tiles CLA Signed This label is managed by the Meta Open Source bot.
#2559 opened May 22, 2026 by thcmbs Collaborator Loading…
[Pallas] Add support for non zero dim in gather CLA Signed This label is managed by the Meta Open Source bot.
#2558 opened May 22, 2026 by thcmbs Collaborator Loading…
Reject tensor_descriptor indexing when block size exceeds tensor dim (#2555) CLA Signed This label is managed by the Meta Open Source bot. fb-exported meta-exported
#2555 opened May 22, 2026 by mengluy0125 Contributor Loading…
Skip even more Python on repeated identical calls CLA Signed This label is managed by the Meta Open Source bot.
#2537 opened May 20, 2026 by choijon5 Contributor Draft
Reuse kernel output buffers instead of allocating fresh on every call CLA Signed This label is managed by the Meta Open Source bot.
#2536 opened May 20, 2026 by choijon5 Contributor Draft
Use the fast launcher during autotuning CLA Signed This label is managed by the Meta Open Source bot.
#2535 opened May 20, 2026 by choijon5 Contributor Draft
Add a C extension so launches skip more Python frames CLA Signed This label is managed by the Meta Open Source bot.
#2534 opened May 20, 2026 by choijon5 Contributor Draft
Speed up Helion kernel launches by avoiding repeated Python work CLA Signed This label is managed by the Meta Open Source bot.
#2533 opened May 20, 2026 by choijon5 Contributor Draft
[WIP] Pallas grid index map fp8 attention CLA Signed This label is managed by the Meta Open Source bot.
#2530 opened May 20, 2026 by thcmbs Collaborator Draft
[WIP] Fix Pallas grid index BlockSpecs CLA Signed This label is managed by the Meta Open Source bot.
#2529 opened May 20, 2026 by thcmbs Collaborator Draft
[Pallas] Reclaim HBM between kernels in run_tpu.py sweep CLA Signed This label is managed by the Meta Open Source bot.
#2495 opened May 20, 2026 by norx1991 Contributor Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.