Disable CPU helper in AUTO when the model is LLM #29233

wgzintel · 2025-03-03T01:30:39Z

Disable CPU helper in AUTO when the model is LLM.
Fix the error "Not Implemented" when setting device to AUTO to run model InternVL2 on machines with more than two GPUs

tickets: CVS-160732

src/plugins/auto/src/plugin.cpp

…into guozhong/disable_cpu_helper

ilya-lavrenov · 2025-03-13T07:20:11Z

src/plugins/auto/src/plugin.cpp

    if (model_path.empty()) {
        support_devices = filter_device_by_model(support_devices_by_property, model, load_config);
+        is_LLM_model = ov::op::util::is_large_language_model(*model);


is it LLM specific issue or any model which has states?

@ilya-lavrenov State model has been handle here

openvino/src/plugins/auto/src/plugin.cpp

Lines 906 to 919 in 5dba745

std::vector<std::string> stateful_node_names;

for (auto& op : model->get_ops()) {

if (ov::as_type_ptr<ov::op::util::AssignBase>(op) ||

ov::as_type_ptr<ov::op::util::ReadValueBase>(op)) {

stateful_node_names.push_back(op->get_friendly_name());

}

}

if (stateful_node_names.empty()) {

// not stateful model

return meta_devices;

}

// disable CPU_HELP and runtime fallback if model is stateful

disable_startup_runtime_fallback();

and it is being updated in https://github.com/openvinotoolkit/openvino/pull/27019/files#diff-85029a9232410831627d7b7b225e30d2cd4879e55b4abda5c475d04efb12daddL837-R859

Here handle the LLM model only.

@wgzintel Why this is not handled in filter_device_by_model ?

Updated and handled in filter_device_by_model.

yangwang201911 · 2025-03-20T02:37:35Z

src/plugins/auto/src/plugin.cpp

+        if (is_LLM_model) {
+            // disable cpu helper and runtime_fallback when the model is LLM, only one device need to compile model
+            disable_startup_runtime_fallback(load_config);
+        }


consider below code lines to replace the changed lines here.
auto m_model = model_path.empty() ? model : get_core()->read_model(mode_path, std::string{}, {}); and support_devices = filter_device_by_model(support_devices_by_property, m_model, load_config);

Disable CPU helper in AUTO when the model is LLM

03810de

github-actions bot added the category: AUTO OpenVINO AUTO device selection plugin label Mar 3, 2025

wgzintel requested review from yangwang201911, songbell, peterchen-intel and wangleis March 3, 2025 01:33

yangwang201911 reviewed Mar 3, 2025

View reviewed changes

ilya-lavrenov requested changes Mar 3, 2025

View reviewed changes

src/plugins/auto/src/plugin.cpp Outdated Show resolved Hide resolved

wgzintel added 2 commits March 3, 2025 22:42

Merge branch 'master' of https://github.com/openvinotoolkit/openvino …

a6906e3

…into guozhong/disable_cpu_helper

move get_optimum_intel_version to a common API

0618583

github-actions bot added the category: transformations OpenVINO Runtime library - Transformations label Mar 3, 2025

wgzintel added 2 commits March 4, 2025 15:37

Merge branch 'master' of https://github.com/openvinotoolkit/openvino …

0b321b0

…into guozhong/disable_cpu_helper

use is_large_language_model to match LLM

f0380c4

github-actions bot removed the category: transformations OpenVINO Runtime library - Transformations label Mar 4, 2025

ilya-lavrenov self-requested a review March 4, 2025 08:24

resolve conflict

04212be

wgzintel force-pushed the guozhong/disable_cpu_helper branch from b5d71a7 to 04212be Compare March 5, 2025 15:10

Merge branch 'master' into guozhong/disable_cpu_helper

6420d0b

wgzintel marked this pull request as ready for review March 10, 2025 01:53

wgzintel requested a review from a team as a code owner March 10, 2025 01:53

wgzintel added 6 commits March 12, 2025 14:15

Merge branch 'master' into guozhong/disable_cpu_helper

eac23ab

Fix the error Not Implemented

46131fd

Add comments

9f74701

Merge branch 'master' into guozhong/disable_cpu_helper

d740d0f

Merge branch 'master' into guozhong/disable_cpu_helper

6323789

Merge branch 'master' into guozhong/disable_cpu_helper

ea3a8c3

ilya-lavrenov reviewed Mar 13, 2025

View reviewed changes

wgzintel added 3 commits March 14, 2025 10:03

Merge branch 'master' into guozhong/disable_cpu_helper

7c89221

Merge branch 'master' into guozhong/disable_cpu_helper

ddf2cca

Merge branch 'master' into guozhong/disable_cpu_helper

62f0427

wgzintel added 3 commits March 19, 2025 09:54

Merge branch 'master' into guozhong/disable_cpu_helper

c584b08

Merge branch 'master' into guozhong/disable_cpu_helper

74959c2

LLM model handled in filter_device_by_model when model path is empty

5441b0f

yangwang201911 reviewed Mar 20, 2025

View reviewed changes

wgzintel and others added 2 commits March 21, 2025 09:56

Merge branch 'master' into guozhong/disable_cpu_helper

af5e285

optimize the code

6e928a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable CPU helper in AUTO when the model is LLM #29233

Disable CPU helper in AUTO when the model is LLM #29233

wgzintel commented Mar 3, 2025 •

edited by peterchen-intel

Loading

ilya-lavrenov Mar 13, 2025

peterchen-intel Mar 15, 2025

wgzintel Mar 20, 2025

yangwang201911 Mar 20, 2025

wgzintel Mar 21, 2025

	std::vector<std::string> stateful_node_names;
	for (auto& op : model->get_ops()) {
	if (ov::as_type_ptr<ov::op::util::AssignBase>(op) \|\|
	ov::as_type_ptr<ov::op::util::ReadValueBase>(op)) {
	stateful_node_names.push_back(op->get_friendly_name());
	}
	}
	if (stateful_node_names.empty()) {
	// not stateful model
	return meta_devices;
	}

	// disable CPU_HELP and runtime fallback if model is stateful
	disable_startup_runtime_fallback();

Disable CPU helper in AUTO when the model is LLM #29233

Are you sure you want to change the base?

Disable CPU helper in AUTO when the model is LLM #29233

Conversation

wgzintel commented Mar 3, 2025 • edited by peterchen-intel Loading

ilya-lavrenov Mar 13, 2025

Choose a reason for hiding this comment

peterchen-intel Mar 15, 2025

Choose a reason for hiding this comment

wgzintel Mar 20, 2025

Choose a reason for hiding this comment

yangwang201911 Mar 20, 2025

Choose a reason for hiding this comment

wgzintel Mar 21, 2025

Choose a reason for hiding this comment

wgzintel commented Mar 3, 2025 •

edited by peterchen-intel

Loading