Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR introduces a Kubernetes operator that enables declarative management of Eino applications on Kubernetes. The operator automates deployment, scaling, and lifecycle management of LLM orchestrations built with the Eino framework.
What type of PR is this?
feat: A new feature
Check the PR title.
(Optional) Translate the PR title into Chinese.
feat(operator): 添加用于 EinoChainApp 管理的 Kubernetes 操作器
(Optional) More detailed description for this PR(en: English/zh: Chinese).
en:
This commit introduces a Kubernetes operator for managing EinoChainApp custom resources. Key features include:
eino_token_per_sec_avg_per_podmetric, optimized for LLM workload performanceThe operator enables users to declaratively define and manage Eino applications on Kubernetes with intelligent autoscaling based on token processing throughput, a key performance indicator for LLM workloads.
zh(optional):
此提交为管理 EinoChainApp 自定义资源引入了 Kubernetes 操作器。主要功能包括 EinoChainApp CRD 定义、自动化资源管理、基于令牌处理吞吐量的自定义自动扩缩容,以及全面的测试覆盖。
(Optional) Which issue(s) this PR fixes:
(optional) The PR that updates user documentation: