[aoti-et] Add an ASR runner and an Whisper example to showcase how to use it #15486

larryliu0820 · 2025-10-31T06:36:32Z

Key Changes:

Create new ASR runner extension in extension/asr/runner/ with reusable runner components (runner.h/cpp)
Update CMake configuration files to support ASR runner builds (executorch-config.cmake, default.cmake, llm.cmake)
Add new Whisper model example in examples/models/whisper/ with CMake build, README, and main.cpp runner
Bump optimum-executorch commit pin for Whisper support
Update CUDA CI workflow for testing

This change enables automatic speech recognition (ASR) capabilities in ExecuTorch with Whisper as the first supported model, following a similar pattern to the existing LLM runner infrastructure.

pytorch-bot · 2025-10-31T06:36:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15486

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ROCm failures during provisioning step due to network issues

❌ 1 New Failure, 17 Pending, 1 Unrelated Failure

As of commit 23b7c15 with merge base cc72b35 ():

NEW FAILURE - The following job has failed:

trunk / test-arm-backend (test_pytest_models_ethosu_fvp) / linux-job (gh)
RuntimeError: Command docker exec -t 5f6e7a94e300dcb5041f9faa0d1d9848da9f8b6d292cbb46500eb4fb88f84653 /exec failed with exit code 1

FLAKY - The following job failed but was likely due to flakiness present on trunk:

docker-builds / docker-build (linux.4xlarge, executorch-ubuntu-22.04-gcc9) (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Gasoonjia

LGTM on my side. I will let @jackzhxng give the stamp about asr_runner.

extension/asr/runner/runner.cpp

Gasoonjia

overall LGTM

use it **Key Changes:** * Create new ASR runner extension in `extension/asr/runner/` with reusable runner components (runner.h/cpp) * Update CMake configuration files to support ASR runner builds (executorch-config.cmake, default.cmake, llm.cmake) * Add new Whisper model example in `examples/models/whisper/` with CMake build, README, and main.cpp runner * Bump optimum-executorch commit pin for Whisper support * Update CUDA CI workflow for testing This change enables automatic speech recognition (ASR) capabilities in ExecuTorch with Whisper as the first supported model, following a similar pattern to the existing LLM runner infrastructure.

.github/workflows/cuda.yml

mergennachin

Great work @larryliu0820

A few comments before landing

extension/asr/runner/runner.h

examples/models/whisper/README.md

.ci/scripts/export_model_cuda_artifact.sh

mergennachin · 2025-11-03T16:02:32Z

extension/asr/runner/runner.h

+#include <executorch/runtime/core/result.h>
+#include <pytorch/tokenizers/tokenizer.h>
+
+namespace executorch::extension::asr {


Once this lands, we can renew this PR and add language bindings

https://github.com/pytorch/executorch/pull/13525/files#diff-22a92f60f6a05a9cf1de00b0174c7d548066f60c935cd47dc3e288463b046149

extension/asr/runner/runner.h

mergennachin · 2025-11-03T16:06:17Z

examples/models/whisper/CMakeLists.txt

+if(EXECUTORCH_BUILD_METAL)
+  list(APPEND _link_libraries metal_backend)
+  executorch_target_link_options_shared_lib(metal_backend)
+endif()


Is metal supported?

@manuelcandales

.github/workflows/cuda.yml

.ci/scripts/test_model_cuda_e2e.sh

extension/asr/runner/runner.h

.ci/scripts/test_model_cuda_e2e.sh

mergennachin

Thanks

larryliu0820 requested review from jackzhxng, kirklandsign and lucylq as code owners October 31, 2025 06:36

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 31, 2025

larryliu0820 requested a review from mergennachin October 31, 2025 06:37

larryliu0820 added the release notes: llm Changes to llm utilities label Oct 31, 2025

larryliu0820 requested a review from Gasoonjia October 31, 2025 06:38

Gasoonjia reviewed Oct 31, 2025

View reviewed changes

extension/asr/runner/runner.cpp Outdated Show resolved Hide resolved

Gasoonjia approved these changes Oct 31, 2025

View reviewed changes

larryliu0820 added 5 commits November 2, 2025 10:27

Fix cuda.yml

c28431d

Fix cuda.yml

305381f

Address comment

ea90759

Create scripts for export and e2e run

f348cee

larryliu0820 force-pushed the asr_runner branch from 2484896 to f348cee Compare November 2, 2025 21:27

larryliu0820 and others added 4 commits November 2, 2025 13:46

lint

ca343e7

Install extra deps

779a7ce

Fix artifact name

cf8d6be

Fix CI

2ef76b7

jackzhxng approved these changes Nov 3, 2025

View reviewed changes

.github/workflows/cuda.yml Outdated Show resolved Hide resolved

mergennachin requested changes Nov 3, 2025

View reviewed changes

jackzhxng reviewed Nov 3, 2025

View reviewed changes

.github/workflows/cuda.yml Outdated Show resolved Hide resolved

mergennachin reviewed Nov 3, 2025

View reviewed changes

.ci/scripts/test_model_cuda_e2e.sh Show resolved Hide resolved

mergennachin reviewed Nov 3, 2025

View reviewed changes

extension/asr/runner/runner.h Outdated Show resolved Hide resolved

mergennachin reviewed Nov 3, 2025

View reviewed changes

extension/asr/runner/runner.h Outdated Show resolved Hide resolved

mergennachin reviewed Nov 3, 2025

View reviewed changes

extension/asr/runner/runner.h Show resolved Hide resolved

.ci/scripts/test_model_cuda_e2e.sh Outdated Show resolved Hide resolved

Address comments

a20f528

mergennachin approved these changes Nov 3, 2025

View reviewed changes

More fixes

9da10a3

larryliu0820 added 2 commits November 3, 2025 13:55

More fixes

81ac56f

Install ffmpeg and use unsloth tokenizer.json

23b7c15

larryliu0820 merged commit 5e06650 into main Nov 4, 2025
309 of 315 checks passed

larryliu0820 deleted the asr_runner branch November 4, 2025 00:10

[aoti-et] Add an ASR runner and an Whisper example to showcase how to use it #15486

[aoti-et] Add an ASR runner and an Whisper example to showcase how to use it #15486

Uh oh!

Conversation

larryliu0820 commented Oct 31, 2025

Uh oh!

pytorch-bot bot commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15486

❗ 1 Active SEVs

❌ 1 New Failure, 17 Pending, 1 Unrelated Failure

Uh oh!

Gasoonjia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Gasoonjia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mergennachin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergennachin Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mergennachin Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergennachin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot bot commented Oct 31, 2025 •

edited

Loading