Add support for locking kernels #10

danieldk · 2025-01-20T13:58:14Z

hf-kernels lock . locks the kernels specified in a project's pyproject.toml. Building a package with the setuptools build backend, which is the default for pyproject.toml, will add the lock file to the package's metadata. The a kernel can then be downloaded and loaded at the locked version using get_locked_kernel.
hf-kernels download . downloads the locked kernels to the HF cache directory. The kernel can then be loaded using load_kernel (which is a small wrapper for get_locked_kernel).

PR uploaded as hf-kernels 0.1.1 to PyPI for testing.

In the future, we want to be able to specify versions in pyproject.toml (which then get locked), but that's for a later PR.

This change allows Python projects that use kernels to lock the kernel revisions on a project-basis. For this to work, the user only has to include `hf-kernels` as a build dependency. During the build, a lock file is written to the package's pkg-info. During runtime we can read it out and use the corresponding revision. When the kernel is not locked, the revision that is provided as an argument is used.

src/hf_kernels/cli.py

Narsil · 2025-01-20T15:16:57Z

src/hf_kernels/utils.py

+    if locked_sha is None:
+        raise ValueError(f"Kernel `{repo_id}` is not locked")
+
+    package_name, package_path = install_kernel(


load_kernel is supposed to ignore entirely the download. This reintroduces it.

hf_hub_downlod(..., local_files_only=True) is the only public API I found to IGNORE downloading.

If I don't have the model cached and use load_kernel, I get the following error:

% python -c 'import kernel_test; kernel_test.run()' [...] FileNotFoundError: [Errno 2] No such file or directory: '/scratch/daniel/.cache/huggingface/hub/models--kernels-community--activation/snapshots/a71853ecbdd899526f9810cc558ee24081a6302e/build/torch25-cxx98-cu124-x8 6_64-linux/activation/__init__.py'

The error could be better, but it doesn't seem to download?

I did forget to pass through local_files_only to get_metadata. Pushing a fix for that now. Then it fails even earlier:

% python -c 'import kernel_test; kernel_test.run()' [...] huggingface_hub.errors.LocalEntryNotFoundError: Cannot find the requested files in the disk cache and outgoing traffic has been disabled. To enable hf.co look-ups and downloads online, set 'local_files_only' to False.

What I mean is let's keep the actual code local_files_only separate. Otherwise it's super easy to screw up and reintroduce the internet connection. (Better yet if we could sidestep that bad API altogether).

Ok, made separate again.

Narsil · 2025-01-20T15:32:54Z

src/hf_kernels/lockfile.py

+
+    file_locks = []
+    for sibling in r.siblings:
+        if sibling.rfilename.startswith("build/torch"):


Shouldn't we filter by version too here ? (Maybe subsequent PR ?)

The lockfile should contain all build variants, because we don't know what Torch/CUDA version a downstream user will have.

Or did you mean kernel version? If that, one particular commit is only supposed to have one version.

Yes I meant kernel version/version range.

Narsil · 2025-01-20T15:34:09Z

src/hf_kernels/lockfile.py

+        return
+
+    lock_path = cwd / "hf-kernels.lock"
+    if not lock_path.exists():


Isn't that always true ?

How can the lockfile exists before it's created ?

I think it could happen in an editable install:

Create the lockfile.

Do an editable install. (the lockfile gets written into the package info)

Remove the lockfile.

Do an editable install.

Though I still need to check whether this is the case.

Forgot to add, this checks the existence of the lock file in the project's source directory, which should exist prior to running the install if you want to lock the versions.

danieldk added 7 commits January 20, 2025 13:44

Generate lock files with hf-lock-kernels, copy to egg

085548d

Various improvements

4253af5

Name CLI hf-kernels, add download subcommand

7ae9c37

hf-kernels.lock

b65609c

Bump version to 0.1.1

5ca1a05

Use setuptools for testing the wheel

37be1e9

danieldk force-pushed the kernels-lock branch from c5266da to 37be1e9 Compare January 20, 2025 14:01

Narsil reviewed Jan 20, 2025

View reviewed changes

src/hf_kernels/cli.py Outdated Show resolved Hide resolved

Narsil reviewed Jan 20, 2025

View reviewed changes

danieldk added 2 commits January 20, 2025 15:22

Factor out tomllib module selection

aa2afc1

Pass through local_files_only in get_metadata

38c02cb

Narsil reviewed Jan 20, 2025

View reviewed changes

danieldk added 3 commits January 20, 2025 15:44

Do not reuse implementation in load_kernel

f63689f

The tests install hf-kernels from PyPI, should be local

d414012

docker: package is in subdirectory

2ae0e67

Narsil approved these changes Jan 21, 2025

View reviewed changes

danieldk merged commit 544354c into main Jan 21, 2025
3 checks passed

danieldk deleted the kernels-lock branch January 21, 2025 15:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for locking kernels #10

Add support for locking kernels #10

danieldk commented Jan 20, 2025 •

edited

Loading

Narsil Jan 20, 2025

danieldk Jan 20, 2025

Narsil Jan 20, 2025

danieldk Jan 20, 2025

Narsil Jan 20, 2025

danieldk Jan 20, 2025

Narsil Jan 21, 2025

Narsil Jan 20, 2025

danieldk Jan 20, 2025 •

edited

Loading

danieldk Jan 20, 2025

Add support for locking kernels #10

Add support for locking kernels #10

Conversation

danieldk commented Jan 20, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

danieldk Jan 20, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

danieldk commented Jan 20, 2025 •

edited

Loading

danieldk Jan 20, 2025 •

edited

Loading