Skip to content

Commit 576b27d

Browse files
[NPUW] Update NPUW AoT caching docs (#31628)
Co-authored-by: Tatiana Savina <[email protected]>
1 parent 92be6a1 commit 576b27d

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

docs/articles_en/openvino-workflow-generative/inference-with-genai/inference-with-genai-on-npu.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -289,6 +289,7 @@ Specifying ``EXPORT_BLOB`` and ``BLOB_PATH`` parameters works similarly to ``CAC
289289
* To export a blob with weights you need to pass ``"CACHE_MODE" : "OPTIMIZE_SPEED"`` in the config.
290290
* If the blob is exported as weightless you also need to either provide
291291
``"WEIGHTS_PATH" : "path\\to\\original\\model.bin"`` or ``"MODEL_PTR" : original ov::Model object``.
292+
* Ahead-of-time import in weightless mode has been optimized to consume less memory than during regular compilation or using ``CACHE_DIR``.
292293

293294
.. tab-set::
294295

0 commit comments

Comments
 (0)