Let `torchaudio.load()` and `torchaudio.save()` rely on `load_with_torchcodec()` and `save_with_torchcodec()`. #4039

samanklesaria · 2025-08-12T19:54:46Z

This PR wraps the load_with_torchcodec and save_with_torchcodec functions with functions of the name load and save so that code that depends on the old load and save functions can continue to work in the future once we remove backend-specific code.

pytorch-bot · 2025-08-12T19:54:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/audio/4039

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 8 New Failures

As of commit 498ce49 with merge base 02351a6 ():

NEW FAILURES - The following jobs have failed:

.github/workflows/bandit.yml (gh)
.github/workflows/integration-test.yml (gh)
.github/workflows/lint.yml (gh)
.github/workflows/pr-labels.yml (gh)
.github/workflows/unittest-macos-cpu.yml (gh)
.github/workflows/unittest-windows-cpu.yml (gh)
.github/workflows/unittest-windows-gpu.yml (gh)
Build documentation / build / Build doc (gh)
At least one of the pre-conditions you specified did not hold

This comment was automatically generated by Dr. CI and updates every 15 minutes.

src/torchaudio/__init__.py

samanklesaria · 2025-08-13T16:34:58Z

Installing ffmpeg>4, which is necessary for torchcodec to be able to load files used during testing, seems to be incompatible with the current CI infrastructure. Perhaps we need a separate PR to install ffmpeg>4, and wait for the infrastructure to improve so that that PR can be merged.

This reverts commit 80f5eb7.

This reverts commit 74edc0a.

samanklesaria · 2025-08-14T00:55:01Z

I'm coming across an interesting discrepancy in the load behavior here. In test_load_save_torchcodec.py, the function test_save_channels_first creates a random tensor called waveform. When I try it, I get waveform[0,1345] = -4.0688. Saving and loading the result with scipy gives waveform_ta[0,1345] = -4.0688. But saving with torchcodec and loading with scipy gives waveform[0,1345] = -1.. What exactly is happening here? Are we supposed to be truncating saved values between -1 and 1? Is the normalization that torchcodec is doing different than the normalization scipy does?
I guess the docstring for save_with_torchcodec says "TorchCodec AudioEncoder expects float32 samples in [-1, 1] range". Does this mean the behavior of this test is undefined when our random waveform has larger values?

An easy fix here is just to use rand instead of randn to initialize our random tensor.

NicolasHug · 2025-08-14T08:25:16Z

Your assessment on test_load_save_torchcodec.py are correct, it's would have been best to ensure that input values of the encoder are in [-1, 1] rather than to use randn.

However, this entire file is meant to test the old torchaudio load() and save() against load_with_torchcodec() and save_with_torchcodec()

Now that we are moving load() and save() to rely on load_with_torchcodec() and save_with_torchcodec() directly, these tests aren't worth running anymore. Even more, we are switching torchcodec for scipy - so these tests are not testing what they were meant to test anymore.

Basically, we can just skip them safely. Just add a big pytest.skip() statement at the top, indicating this test file was only relevant back when load_with_torchcodec() and save_with_torchcodec() were introduced and load() and save() still had their own implementation.

src/torchaudio/__init__.py

samanklesaria · 2025-08-14T15:03:10Z

Now we're failing because core dumped) python -c "import torch; import torchaudio; import torchcodec; print(torch.__version__, torchaudio.__version__, torchcodec.__version__)" fails. Because I never defined __version__ in my mock module. But do I need it? It might be easiest to remove this check in the install script.

samanklesaria · 2025-08-14T16:54:50Z

I'm going to remove the installation of torchcodec during testing, as it shouldn't be used anyway. We should rely on the mock.

src/torchaudio/utils/wav_utils.py

NicolasHug · 2025-08-18T11:14:00Z

src/torchaudio/__init__.py

+from typing import Union, BinaryIO, Optional, Tuple
+import os
+import torch
+import sys


sys doesn't seem to be used

samanklesaria and others added 5 commits July 18, 2025 19:57

Add torchcodec mock with wav loading and saving

2e25279

Merge branch 'main' into test_wav_hack

fe375f4

Let load and save rely on *_with_torchcodec

a300221

install torchcodec in doc job

07e3b77

Add docstring and arguments for load and save

92719d3

meta-cla bot added the CLA Signed label Aug 12, 2025

NicolasHug reviewed Aug 13, 2025

View reviewed changes

src/torchaudio/__init__.py Outdated Show resolved Hide resolved

samanklesaria added 4 commits August 13, 2025 14:42

Revise docstring

4a98ee5

Add typing imports

7b02754

Try ffmpeg>4

74edc0a

Install conda deps before pip deps

80f5eb7

samanklesaria added 7 commits August 13, 2025 18:11

Add scipy hack for load and save

7f063a6

Only import scipy during testing

700c6c9

Revert "Install conda deps before pip deps"

6995b21

This reverts commit 80f5eb7.

Revert "Try ffmpeg>4"

4ab5993

This reverts commit 74edc0a.

Revert torchcodec installation changes

43c4602

Use existing wav_utils

f74f004

Support frame_offset and num_frames in load hack

953fc65

Use rand instead of randn for test_save_channels_first

dd3ff90

samanklesaria marked this pull request as ready for review August 14, 2025 03:46

samanklesaria requested a review from a team as a code owner August 14, 2025 03:46

NicolasHug reviewed Aug 14, 2025

View reviewed changes

src/torchaudio/__init__.py Outdated Show resolved Hide resolved

samanklesaria added 2 commits August 14, 2025 14:34

Merge branch 'test_wav_hack' into torchcodec_loading

72539b9

Remove pytest-aware code in src

c94e011

samanklesaria marked this pull request as draft August 14, 2025 14:58

samanklesaria added 4 commits August 14, 2025 15:08

Remove torchcodec version check

b622d82

Fix bugs in torchcodec mock

93351a2

Skip test_load_save_torchcodec

5407163

Correct call to pytest skip

bd7eb52

Remove torchcodec installation

c3d0cc2

samanklesaria mentioned this pull request Aug 14, 2025

Remove dependencies from doc job #4043

Open

NicolasHug reviewed Aug 15, 2025

View reviewed changes

src/torchaudio/utils/wav_utils.py Outdated Show resolved Hide resolved

samanklesaria force-pushed the torchcodec_loading branch from 6d2ba1b to c3d0cc2 Compare August 15, 2025 15:02

Add torchcodec to build installation

d10fc19

samanklesaria marked this pull request as ready for review August 15, 2025 16:46

samanklesaria and others added 5 commits August 15, 2025 16:48

Remove redundant wav_utils

92fee51

Merge branch 'main' of github.com:pytorch/audio into torchcodec_loading

cc37073

remove sys

2646e59

Add comments

6c43c04

clarify comment

498ce49

NicolasHug changed the title ~~Torchcodec loading~~ Let torchaudio.load() and torchaudio.save() rely on load_with_torchcodec() and save_with_torchcodec(). Aug 18, 2025

NicolasHug approved these changes Aug 18, 2025

View reviewed changes

NicolasHug merged commit 93f582c into main Aug 18, 2025
42 of 43 checks passed

NicolasHug deleted the torchcodec_loading branch August 18, 2025 12:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Let `torchaudio.load()` and `torchaudio.save()` rely on `load_with_torchcodec()` and `save_with_torchcodec()`. #4039

Let `torchaudio.load()` and `torchaudio.save()` rely on `load_with_torchcodec()` and `save_with_torchcodec()`. #4039

samanklesaria commented Aug 12, 2025

Uh oh!

pytorch-bot bot commented Aug 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

samanklesaria commented Aug 13, 2025

Uh oh!

samanklesaria commented Aug 14, 2025 •

edited

Loading

Uh oh!

NicolasHug commented Aug 14, 2025

Uh oh!

Uh oh!

samanklesaria commented Aug 14, 2025 •

edited

Loading

Uh oh!

samanklesaria commented Aug 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

NicolasHug Aug 18, 2025

Uh oh!

Uh oh!

Uh oh!

Let torchaudio.load() and torchaudio.save() rely on load_with_torchcodec() and save_with_torchcodec(). #4039

Let torchaudio.load() and torchaudio.save() rely on load_with_torchcodec() and save_with_torchcodec(). #4039

Conversation

samanklesaria commented Aug 12, 2025

Uh oh!

pytorch-bot bot commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/audio/4039

❌ 8 New Failures

Uh oh!

Uh oh!

samanklesaria commented Aug 13, 2025

Uh oh!

samanklesaria commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug commented Aug 14, 2025

Uh oh!

Uh oh!

samanklesaria commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samanklesaria commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

NicolasHug Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Let `torchaudio.load()` and `torchaudio.save()` rely on `load_with_torchcodec()` and `save_with_torchcodec()`. #4039

Let `torchaudio.load()` and `torchaudio.save()` rely on `load_with_torchcodec()` and `save_with_torchcodec()`. #4039

pytorch-bot bot commented Aug 12, 2025 •

edited

Loading

samanklesaria commented Aug 14, 2025 •

edited

Loading

samanklesaria commented Aug 14, 2025 •

edited

Loading

samanklesaria commented Aug 14, 2025 •

edited

Loading