Skip to content

Conversation

@larryliu0820
Copy link
Contributor

Key Changes:

  • Create new ASR runner extension in extension/asr/runner/ with reusable runner components (runner.h/cpp)
  • Update CMake configuration files to support ASR runner builds (executorch-config.cmake, default.cmake, llm.cmake)
  • Add new Whisper model example in examples/models/whisper/ with CMake build, README, and main.cpp runner
  • Bump optimum-executorch commit pin for Whisper support
  • Update CUDA CI workflow for testing

This change enables automatic speech recognition (ASR) capabilities in ExecuTorch with Whisper as the first supported model, following a similar pattern to the existing LLM runner infrastructure.

@pytorch-bot
Copy link

pytorch-bot bot commented Oct 31, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15486

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 1 New Failure, 17 Pending, 1 Unrelated Failure

As of commit 23b7c15 with merge base cc72b35 (image):

NEW FAILURE - The following job has failed:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 31, 2025
@larryliu0820 larryliu0820 added the release notes: llm Changes to llm utilities label Oct 31, 2025
Copy link
Contributor

@Gasoonjia Gasoonjia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM on my side. I will let @jackzhxng give the stamp about asr_runner.

Copy link
Contributor

@Gasoonjia Gasoonjia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

overall LGTM

use it

**Key Changes:**

*   Create new ASR runner extension in `extension/asr/runner/` with reusable runner components (runner.h/cpp)
*   Update CMake configuration files to support ASR runner builds (executorch-config.cmake, default.cmake, llm.cmake)
*   Add new Whisper model example in `examples/models/whisper/` with CMake build, README, and main.cpp runner
*   Bump optimum-executorch commit pin for Whisper support
*   Update CUDA CI workflow for testing

This change enables automatic speech recognition (ASR) capabilities in ExecuTorch with Whisper as the first supported model, following a similar pattern to the existing LLM runner infrastructure.
Copy link
Contributor

@mergennachin mergennachin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work @larryliu0820

A few comments before landing

#include <executorch/runtime/core/result.h>
#include <pytorch/tokenizers/tokenizer.h>

namespace executorch::extension::asr {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment on lines +76 to +79
if(EXECUTORCH_BUILD_METAL)
list(APPEND _link_libraries metal_backend)
executorch_target_link_options_shared_lib(metal_backend)
endif()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is metal supported?

@manuelcandales

Copy link
Contributor

@mergennachin mergennachin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks

@larryliu0820 larryliu0820 merged commit 5e06650 into main Nov 4, 2025
309 of 315 checks passed
@larryliu0820 larryliu0820 deleted the asr_runner branch November 4, 2025 00:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. release notes: llm Changes to llm utilities

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants