Convert TF and Numpy ops in whisper_audio_convert.py to Keras Ops #2225

LakshmiKalaKadali · 2025-04-23T05:30:10Z

This PR is in continuation to the draft PR .

Converted Numpy and TF ops into Keras ops in whisper_audio_converter.py and in whisper_audio_converter_test.py

mattdangerw · 2025-04-25T01:36:19Z

keras_hub/src/models/whisper/whisper_audio_converter_test.py

        # Verify output.
        expected = [1.1656, 1.0151, -0.8343, -0.8343, -0.8343]
-        self.assertAllClose(outputs[:, 0], expected, atol=0.01, rtol=0.01)
+        self.assertAllClose(outputs[:, 0], expected, atol=0.01, rtol=0.01)


keep newlines at end of files.

mattdangerw · 2025-04-25T01:37:52Z

keras_hub/src/models/whisper/whisper_audio_converter.py


        # Pad audio.
-        audio_shape = audio.shape.as_list()
+        audio_shape = list(audio.shape)


I believe tf.shape cannot always be listified like this. Maybe call ops.shape?

mattdangerw · 2025-04-25T01:38:34Z

@abheesht17 if you have time can you review this? You probably have the most insight here.

mattdangerw · 2025-04-25T01:39:48Z

keras_hub/src/models/whisper/whisper_audio_converter.py

@@ -1,14 +1,10 @@
-import numpy as np
+import keras.ops as ops
+import tensorflow as tf


We can never do a bare import of tf like this. Check other files in the library.

mattdangerw · 2025-04-25T01:40:23Z

keras_hub/src/models/whisper/whisper_audio_converter.py

+            audio = ops.expand_dims(audio, 0)

        # Convert the tensor to a Ragged Tensor.
        if isinstance(audio, tf.Tensor):


We could probably switch to a @preprocessing_function annotation and remove this? Something to try at least.

LakshmiKalaKadali marked this pull request as ready for review April 24, 2025 06:41

mattdangerw reviewed Apr 25, 2025

View reviewed changes

mattdangerw requested a review from abheesht17 April 25, 2025 01:38

mattdangerw reviewed Apr 25, 2025

View reviewed changes

Convert TF and Numpy ops in whisper_audio_convert.py to Keras Ops

ee12be5

LakshmiKalaKadali force-pushed the lakshmikala branch from b4bfba9 to ee12be5 Compare July 30, 2025 12:52

LakshmiKalaKadali added 2 commits July 30, 2025 20:24

updated audio converter

0cc36d4

updated audio converter

655b2e7

LakshmiKalaKadali requested a review from mattdangerw July 31, 2025 04:28

sachinprasadhs added the kokoro:force-run Runs Tests on GPU label Sep 16, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Sep 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Convert TF and Numpy ops in whisper_audio_convert.py to Keras Ops #2225

Convert TF and Numpy ops in whisper_audio_convert.py to Keras Ops #2225

Uh oh!

LakshmiKalaKadali commented Apr 23, 2025

Uh oh!

mattdangerw Apr 25, 2025

Uh oh!

mattdangerw Apr 25, 2025

Uh oh!

mattdangerw commented Apr 25, 2025

Uh oh!

mattdangerw Apr 25, 2025

Uh oh!

mattdangerw Apr 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Convert TF and Numpy ops in whisper_audio_convert.py to Keras Ops #2225

Are you sure you want to change the base?

Convert TF and Numpy ops in whisper_audio_convert.py to Keras Ops #2225

Uh oh!

Conversation

LakshmiKalaKadali commented Apr 23, 2025

Uh oh!

mattdangerw Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

mattdangerw Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

mattdangerw commented Apr 25, 2025

Uh oh!

mattdangerw Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

mattdangerw Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants