Add cached evaluation of tensor networks by DNA386 · Pull Request #224 · Quantinuum/lambeq

DNA386 · 2025-03-07T18:21:49Z

Add caching option by default for models that evaluate tensor networks, as per #185
Also bonus include the Sim9 and Sim9Cx ansatze from the inTask repo.

A non-cached option is also included in case the circuits are small enough that caching introduces unnecessary overhead, but must be actively selected by the user. By default this will reproduce the previous behaviour.
The user can select this as a kwarg when initialising the model. Note: the pathfinder must also be provided as a kwarg if resuming from a checkpoint - the checkpoint currently will contain the saved paths, but not the information required to re-build the same fallback path finding algorithm.

By default, the paths will only be cached in memory. The user can chose to either save them to the checkpoint, or to a separate file by supplying a configured CachedTnPathOptimizer instance.

neiljdo

Hi @DNA386 thank you for your work! 🙌🏼 I just came back so I can only do a surface-level review at the moment - kindly take a look at my initial notes. I'll have a closer look soon.

One thing we would like to have are tests specific to the classes/functions implemented in the `lambeq/training/saved_tn_optimizer.py'.

lambeq/training/saved_tn_optimizer.py

dimkart · 2025-03-13T15:53:38Z

We can make this configurable (i.e. enable/disable storing the paths in the checkpoints somehow).

This initial proposal currently saves cached paths to the model checkpoint, but this is not desirable if the datasets are very large as this will result in a lot of duplicated information (assuming checkpoints are saved per epoch). The paths can typically also be reused when training a different iteration of a model on the same dataset (with the same anstaz), which this system doesn't capture. A more natural way to implement is to have an independent checkpointing system for the paths, that overwrite the previous checkpoint each time a new path is added, however this means the user needs to specify and track the checkpoint filepath separately from the model. In this case, it would not be possible to have caching by default; the user would need to explicitly opt-in by providing a configured TnOptimizer.

lambeq/training/cached_tn_path_optimizer.py

dimkart · 2025-04-03T08:49:15Z

@neiljdo Let's run our benchmarks on this PR and compare speeds with current branch.

lambeq/training/pytorch_model.py

includes merging PytorchQuantumModel with PytorchModel so they can share the cached tn eval.

add explicit opt-einsum dependency in workflow

dimkart

This is great work, ~~the only change at this point (@neiljdo can do this) would be to make the old behaviour the default, so users can opt-in for the caching if they want.~~

We ultimately stuck with the caching path optimizer as the default.

* main: Fix no-, multiple-root prediction for parent predictions (Quantinuum#239) Add Oncilla to CLI (Quantinuum#238) Make SplitTensorAnsatz to deal with boxes with domains (Quantinuum#228) Fix bug with untokenised input sentences for `OncillaParser` (Quantinuum#235) Make `DisCoCircReader` compatible with `OncillaParser` (Quantinuum#234)

dimkart changed the title ~~Cached tn eval~~ Add cached evaluation of tensor networks Mar 11, 2025

neiljdo reviewed Mar 12, 2025

View reviewed changes

DNA386 force-pushed the cached-tn-eval branch 2 times, most recently from d6e6035 to e45d4b4 Compare March 31, 2025 16:08

DNA386 marked this pull request as ready for review March 31, 2025 16:09