Reapply #9842: Save some size in dtype_util when dtype selective build is not in use #10490

swolchok · 2025-04-25T20:02:13Z

We duplicate a lot of functions depending on the operator name so that
dtype selective build will work. We can just detect if dtype selective
build is in use and, if not, stop duplicating.

Test Plan: compared results of bash test/build_optimized_size_test.sh before/after this rev.

Before:

ExecuTorch with no ops binary size, unstripped:
-rwxr-xr-x  1 swolchok  staff  153928 Apr 25 12:24 cmake-out/test/size_test
ExecuTorch with portable ops binary size, unstripped:
-rwxr-xr-x  1 swolchok  staff  2150960 Apr 25 12:24 cmake-out/test/size_test_all_ops
ExecuTorch with optimized ops binary size, unstripped:
-rwxr-xr-x  1 swolchok  staff  5887368 Apr 25 12:24 cmake-out/test/size_test_all_optimized_ops
(.venv) swolchok@swolchok-mac ~/src/executorch> size cmake-out/test/size_test*
__TEXT	__DATA	__OBJC	others	dec	hex
81920	81920	0	4295049216	4295213056	10003c000	cmake-out/test/size_test
1474560	81920	0	4295655424	4297211904	100224000	cmake-out/test/size_test_all_ops
4489216	98304	0	4296359936	4300947456	1005b4000	cmake-out/test/size_test_all_optimized_ops

After:

ExecuTorch with no ops binary size, unstripped:
-rwxr-xr-x  1 swolchok  staff  153928 Apr 25 12:51 cmake-out/test/size_test
ExecuTorch with portable ops binary size, unstripped:
-rwxr-xr-x  1 swolchok  staff  1796928 Apr 25 12:51 cmake-out/test/size_test_all_ops
ExecuTorch with optimized ops binary size, unstripped:
-rwxr-xr-x  1 swolchok  staff  5605176 Apr 25 12:51 cmake-out/test/size_test_all_optimized_ops
(.venv) swolchok@swolchok-mac ~/src/executorch> size cmake-out/test/size_test*
__TEXT	__DATA	__OBJC	others	dec	hex
81920	81920	0	4295049216	4295213056	10003c000	cmake-out/test/size_test
1310720	81920	0	4295458816	4296851456	1001cc000	cmake-out/test/size_test_all_ops
4358144	98304	0	4296212480	4300668928	100570000	cmake-out/test/size_test_all_optimized_ops

(This was reverted because the diff it was stacked on was a size
regression. Reversing the order instead this time around, and reverted
part of the change that was actually regressing size.)

[ghstack-poisoned]

swolchok · 2025-04-25T20:02:13Z

Stack from ghstack (oldest at bottom):

Reapply #9841: Migrate elementwise_util callers to the variants with out_dtypes in template arguments #10491
-> Reapply #9842: Save some size in dtype_util when dtype selective build is not in use #10490
Save some size in pattern/{bitwise,comparison}_op.h #10489

pytorch-bot · 2025-04-25T20:02:16Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10490

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 92357fb with merge base bf5b99a ():

NEW FAILURE - The following job has failed:

.github/workflows/build-presets.yml (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

swolchok · 2025-04-25T21:49:40Z

kernels/portable/cpu/util/dtype_util.h

@@ -285,6 +289,29 @@ store_compute_to_tensor_fn<CTYPE_COMPUTE> get_store_compute_to_tensor_fn(
  return nullptr;
 }

+#ifndef EXECUTORCH_SELECTIVE_BUILD_DTYPE
+inline constexpr const char kGenericElementwiseOpName[] =


marking this inline was needed for size (it changes the linkage for this, otherwise it's duplicated across translation units) and is a difference from the first attempt.

kimishpatel

I dont think I understand selective dtype build enough to comment here. Maybe @larryliu0820 can

[ghstack-poisoned]

…d is not in use We duplicate a lot of functions depending on the operator name so that dtype selective build will work. We can just detect if dtype selective build is in use and, if not, stop duplicating. Test Plan: compared results of bash test/build_optimized_size_test.sh before/after this rev. Before: ``` ExecuTorch with no ops binary size, unstripped: -rwxr-xr-x 1 swolchok staff 153928 Apr 25 12:24 cmake-out/test/size_test ExecuTorch with portable ops binary size, unstripped: -rwxr-xr-x 1 swolchok staff 2150960 Apr 25 12:24 cmake-out/test/size_test_all_ops ExecuTorch with optimized ops binary size, unstripped: -rwxr-xr-x 1 swolchok staff 5887368 Apr 25 12:24 cmake-out/test/size_test_all_optimized_ops (.venv) swolchok@swolchok-mac ~/src/executorch> size cmake-out/test/size_test* __TEXT __DATA __OBJC others dec hex 81920 81920 0 4295049216 4295213056 10003c000 cmake-out/test/size_test 1474560 81920 0 4295655424 4297211904 100224000 cmake-out/test/size_test_all_ops 4489216 98304 0 4296359936 4300947456 1005b4000 cmake-out/test/size_test_all_optimized_ops ``` After: ``` ExecuTorch with no ops binary size, unstripped: -rwxr-xr-x 1 swolchok staff 153928 Apr 25 12:51 cmake-out/test/size_test ExecuTorch with portable ops binary size, unstripped: -rwxr-xr-x 1 swolchok staff 1796928 Apr 25 12:51 cmake-out/test/size_test_all_ops ExecuTorch with optimized ops binary size, unstripped: -rwxr-xr-x 1 swolchok staff 5605176 Apr 25 12:51 cmake-out/test/size_test_all_optimized_ops (.venv) swolchok@swolchok-mac ~/src/executorch> size cmake-out/test/size_test* __TEXT __DATA __OBJC others dec hex 81920 81920 0 4295049216 4295213056 10003c000 cmake-out/test/size_test 1310720 81920 0 4295458816 4296851456 1001cc000 cmake-out/test/size_test_all_ops 4358144 98304 0 4296212480 4300668928 100570000 cmake-out/test/size_test_all_optimized_ops ``` (This was reverted because the diff it was stacked on was a size regression. Reversing the order instead this time around, and reverted part of the change that was actually regressing size.) ghstack-source-id: 4eb4f02 ghstack-comment-id: 2831329046 Pull-Request-resolved: pytorch/executorch#10490

swolchok added 10 commits April 24, 2025 13:15

Update

0c73438

[ghstack-poisoned]

Update

fb432fa

[ghstack-poisoned]

Update

d4b7578

[ghstack-poisoned]

Update

773b183

[ghstack-poisoned]

Update

e83385f

[ghstack-poisoned]

Update

9aa95d0

[ghstack-poisoned]

Update

8d8e297

[ghstack-poisoned]

Update

5bc4b0c

[ghstack-poisoned]

Update

8fbbdee

[ghstack-poisoned]

Update

b132388

[ghstack-poisoned]

swolchok requested a review from manuelcandales as a code owner April 25, 2025 20:02

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 25, 2025

swolchok added 2 commits April 25, 2025 13:51

Update

b360be6

[ghstack-poisoned]

Update

0277317

[ghstack-poisoned]

swolchok commented Apr 25, 2025

View reviewed changes

swolchok requested a review from kimishpatel April 25, 2025 21:49

kimishpatel reviewed Apr 28, 2025

View reviewed changes

manuelcandales approved these changes Apr 29, 2025

View reviewed changes

swolchok added 2 commits May 7, 2025 19:18

Update

f02359b

[ghstack-poisoned]

Update

92357fb

[ghstack-poisoned]

Base automatically changed from gh/swolchok/429/head to main May 9, 2025 17:34

swolchok merged commit 80752f4 into main May 9, 2025
84 checks passed

swolchok deleted the gh/swolchok/430/head branch May 9, 2025 17:35

swolchok added the topic: not user facing label May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reapply #9842: Save some size in dtype_util when dtype selective build is not in use #10490

Reapply #9842: Save some size in dtype_util when dtype selective build is not in use #10490

Uh oh!

swolchok commented Apr 25, 2025

Uh oh!

swolchok commented Apr 25, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 25, 2025 •

edited

Loading

Uh oh!

swolchok Apr 25, 2025

Uh oh!

kimishpatel left a comment

Uh oh!

Uh oh!

Uh oh!

Reapply #9842: Save some size in dtype_util when dtype selective build is not in use #10490

Reapply #9842: Save some size in dtype_util when dtype selective build is not in use #10490

Uh oh!

Conversation

swolchok commented Apr 25, 2025

Uh oh!

swolchok commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10490

❌ 1 New Failure

Uh oh!

swolchok Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

kimishpatel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

swolchok commented Apr 25, 2025 •

edited

Loading

pytorch-bot bot commented Apr 25, 2025 •

edited

Loading