enable 3d weights for NVFP4Tensor #3109

vkuzo · 2025-10-01T19:53:01Z

Summary:

Enables NVFP4Tensor to be created from a 3d weight. Note that slicing is gated to 2d tensors for now, we can enable that in a future PR if needed.

This is needed for vLLM stitching 2d weights into a 3d weight for MoEs.

Test Plan:

pytest test/prototype/mx_formats/ -s

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-10-01T19:53:02Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-10-01T19:53:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3109

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit baf7568 with merge base 8955739 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: doesn't work yet, stay tuned this is needed for vLLM stitching 2d weights into a 3d weight for MoEs Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 9f4b94d ghstack-comment-id: 3357908175 Pull-Request: #3109

[ghstack-poisoned]

Summary: doesn't work yet, stay tuned this is needed for vLLM stitching 2d weights into a 3d weight for MoEs Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 9beecf8 ghstack-comment-id: 3357908175 Pull-Request: #3109

[ghstack-poisoned]

Summary: doesn't work yet, stay tuned this is needed for vLLM stitching 2d weights into a 3d weight for MoEs Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: e0dbd18 ghstack-comment-id: 3357908175 Pull-Request: #3109

[ghstack-poisoned]

Summary: doesn't work yet, stay tuned this is needed for vLLM stitching 2d weights into a 3d weight for MoEs Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: b53a9c2 ghstack-comment-id: 3357908175 Pull-Request: #3109

[ghstack-poisoned]

Summary: doesn't work yet, stay tuned this is needed for vLLM stitching 2d weights into a 3d weight for MoEs Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: a69b96d ghstack-comment-id: 3357908175 Pull-Request: #3109

[ghstack-poisoned]

Summary: doesn't work yet, stay tuned this is needed for vLLM stitching 2d weights into a 3d weight for MoEs Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 9ab5657 ghstack-comment-id: 3357908175 Pull-Request: #3109

[ghstack-poisoned]

jcaip · 2025-10-02T20:55:33Z

torchao/prototype/mx_formats/nvfp4_tensor.py

-        assert len(data_hp.shape) == 2, "unsupported"
-        M, K = data_hp.shape[0], data_hp.shape[1]
+        assert len(data_hp.shape) in (2, 3), "unsupported"
+        leading_dims, M, K = data_hp.shape[:-2], data_hp.shape[-2], data_hp.shape[-1]


just fyi, i think you can do:

*leading_dims, M, K = data_hp.shape

jcaip · 2025-10-02T20:58:04Z

torchao/prototype/mx_formats/nvfp4_tensor.py

+    new = NVFP4Tensor(
+        new_qdata,
+        new_scale,
+        old._block_size,


would block size change with transpose?

currently block_size is an integer for this tensor, 16 for NVFP4. If we change it to a multidimensional block, we'd have to update this code.

vkuzo added 7 commits October 1, 2025 11:07

Update

f9ca2f8

[ghstack-poisoned]

Update

7da7826

[ghstack-poisoned]

Update

fa40093

[ghstack-poisoned]

Update

e10a16e

[ghstack-poisoned]

Update

9d0590b

[ghstack-poisoned]

Update

08e9d13

[ghstack-poisoned]

Update

7b76009

[ghstack-poisoned]

vkuzo mentioned this pull request Oct 1, 2025

make scale shape 2d and match qdata shape in NVFP4Tensor #3108

Merged

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 1, 2025

Update

55cc5e8

[ghstack-poisoned]

vkuzo changed the base branch from gh/vkuzo/134/head to main October 2, 2025 11:34

Update

7ce6dcf

[ghstack-poisoned]

vkuzo added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Oct 2, 2025

Update

dffb91c

[ghstack-poisoned]

Update

55c361f

[ghstack-poisoned]

vkuzo changed the title ~~[wip] enable 3d weights for NVFP4Tensor~~ enable 3d weights for NVFP4Tensor Oct 2, 2025

Update

eb82d5f

[ghstack-poisoned]

vkuzo mentioned this pull request Oct 2, 2025

enable select for NVFP4Tensor #3117

Open

Update

baf7568

[ghstack-poisoned]

jcaip approved these changes Oct 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enable 3d weights for NVFP4Tensor #3109

enable 3d weights for NVFP4Tensor #3109

Uh oh!

vkuzo commented Oct 1, 2025 •

edited

Loading

Uh oh!

vkuzo commented Oct 1, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 1, 2025 •

edited

Loading

Uh oh!

jcaip Oct 2, 2025

Uh oh!

jcaip Oct 2, 2025

Uh oh!

vkuzo Oct 3, 2025

Uh oh!

Uh oh!

enable 3d weights for NVFP4Tensor #3109

Are you sure you want to change the base?

enable 3d weights for NVFP4Tensor #3109

Uh oh!

Conversation

vkuzo commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vkuzo commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3109

✅ No Failures

Uh oh!

jcaip Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

jcaip Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

vkuzo Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vkuzo commented Oct 1, 2025 •

edited

Loading

vkuzo commented Oct 1, 2025 •

edited

Loading

pytorch-bot bot commented Oct 1, 2025 •

edited

Loading