Add TPU support #629

wirthual · 2025-08-06T11:37:23Z

Add support for TPU.

Tested on a google colab with Cloud TPU v6e (Trillium)

Copilot

Pull Request Overview

This PR adds support for TPU (Tensor Processing Unit) devices by introducing XLA backend support to the infinity embedding library. The changes enable automatic detection and configuration of TPU devices through Google's XLA framework.

Add "xla" as a new device type option
Implement TPU device detection and configuration logic
Add optional import handling for torch_xla dependency

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
primitives.py	Adds "xla" as a new device enum value
loading_strategy.py	Implements TPU detection logic and device configuration for XLA backend
_optional_imports.py	Adds torch_xla as an optional dependency with proper import checking

Copilot · 2025-08-06T11:38:10Z

libs/infinity_emb/infinity_emb/inference/loading_strategy.py

    from transformers import is_torch_npu_available  # type: ignore
+    from transformers.utils.import_utils import is_torch_xla_available
+
+if CHECK_XLA.is_available:


The torch_xla import is only conditionally executed when CHECK_XLA.is_available is true, but torch_xla is used unconditionally on line 69 in the device count check. This will cause a NameError when XLA is available but torch_xla fails to import for other reasons.

libs/infinity_emb/infinity_emb/inference/loading_strategy.py

greptile-apps

Greptile Summary

This PR adds TPU (Tensor Processing Unit) support to the infinity_emb library, enabling users to run embedding models on Google's specialized ML hardware accelerators. The changes introduce XLA (Accelerated Linear Algebra) device support through three key modifications:

Device Type Addition: Adds xla = "xla" to the Device enum in primitives.py, following the established pattern for device types like CUDA and MPS
Optional Import Management: Introduces CHECK_XLA = OptionalImports("torch_xla", "torch_xla") in _optional_imports.py to handle the torch_xla dependency gracefully when not available
Loading Strategy Integration: Updates loading_strategy.py with XLA device auto-detection logic using is_torch_xla_available() from transformers, device counting via torch_xla.device_count(), and proper device validation

The implementation integrates with the existing device auto-detection system, allowing TPUs to be automatically selected when available or explicitly specified by users. This extends the library's deployment capabilities beyond traditional CPU/GPU setups to include Google Cloud TPU instances and Colab TPU environments, potentially offering significant performance improvements for large-scale embedding computations.

Confidence score: 2/5

This PR has significant dependency and integration issues that could prevent TPU functionality from working properly
Score reflects missing torch_xla dependency in pyproject.toml, potential import failures, and lack of comprehensive error handling
Pay close attention to loading_strategy.py and the missing dependency configuration in pyproject.toml

_{3 files reviewed, 2 comments}

_{Edit Code Review Bot Settings | Greptile}

libs/infinity_emb/infinity_emb/_optional_imports.py

greptile-apps · 2025-08-06T11:38:14Z

libs/infinity_emb/infinity_emb/inference/loading_strategy.py

+if CHECK_XLA.is_available:
+    import torch_xla


logic: torch_xla import is only guarded by CHECK_XLA.is_available but torch_xla.device_count() is called unconditionally on line 69

Suggested change

if CHECK_XLA.is_available:

import torch_xla

if CHECK_XLA.is_available:

import torch_xla

else:

torch_xla = None

wirthual · 2025-08-06T14:43:34Z

Liniting problem related to this PR: pytorch/xla#9515

michaelfeil

Have not reviewed!

wirthual · 2025-08-07T10:38:43Z

Waiting for 3.12 wheels to add torch_xla. See pytorch/xla#9500

codecov-commenter · 2025-08-23T06:57:36Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 60.00000% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 79.48%. Comparing base (ff80951) to head (965ff06).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...ity_emb/infinity_emb/inference/loading_strategy.py	50.00%	4 Missing ⚠️
❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #629      +/-   ##
==========================================
- Coverage   79.60%   79.48%   -0.12%     
==========================================
  Files          43       43              
  Lines        3486     3495       +9     
==========================================
+ Hits         2775     2778       +3     
- Misses        711      717       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

michaelfeil · 2025-08-28T04:15:48Z

looks good to me @wirthual

michaelfeil · 2025-08-28T04:17:54Z

@wirthual we can add it without changes to the lock file.

wirthual · 2025-08-28T08:04:52Z

@michaelfeil Using the lock file from main would result in a poetry error that the lock file is outdated. Should we consider not commiting the lock file at all?

michaelfeil · 2025-08-29T17:09:01Z

NO lock file, no change to pyproject for now.

michaelfeil · 2025-08-29T17:11:29Z

lets merge it like this.

wirthual added 3 commits July 24, 2025 11:59

tpu support 1

82d4acf

change package name

321bd63

run format

d17b976

wirthual requested review from Copilot and michaelfeil August 6, 2025 11:37

Copilot AI reviewed Aug 6, 2025

View reviewed changes

greptile-apps bot reviewed Aug 6, 2025

View reviewed changes

michaelfeil approved these changes Aug 6, 2025

View reviewed changes

wirthual and others added 3 commits August 7, 2025 13:42

add torch_xla dependency

593ab68

Merge branch 'main' into tpu-support

23622bd

run poetry lock --no-update

f3072ba

wirthual and others added 2 commits August 26, 2025 12:19

Merge remote-tracking branch 'origin/main' into tpu-support

0c133b6

Merge branch 'main' into tpu-support

4ebddc5

michaelfeil added 3 commits August 29, 2025 10:09

Delete libs/infinity_emb/poetry.lock

bc434e7

Update pyproject.toml

777ca80

Create poetry.lock

965ff06

michaelfeil merged commit f98ccf4 into main Aug 29, 2025
24 checks passed

michaelfeil deleted the tpu-support branch August 29, 2025 17:29

Add TPU support #629

Add TPU support #629

Uh oh!

Conversation

wirthual commented Aug 6, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Summary

Confidence score: 2/5

Uh oh!

Uh oh!

greptile-apps bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

wirthual commented Aug 6, 2025

Uh oh!

michaelfeil left a comment

Choose a reason for hiding this comment

Uh oh!

wirthual commented Aug 7, 2025

Uh oh!

codecov-commenter commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

michaelfeil commented Aug 28, 2025

Uh oh!

michaelfeil commented Aug 28, 2025

Uh oh!

wirthual commented Aug 28, 2025

Uh oh!

michaelfeil commented Aug 29, 2025

Uh oh!

michaelfeil commented Aug 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Aug 23, 2025 •

edited

Loading