- 
                Notifications
    You must be signed in to change notification settings 
- Fork 147
refactor(ava/advisor): init new advisor pipeline & built-in modules to plugin-based architecture #784
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: refactor-advisor-pipeline
Are you sure you want to change the base?
Conversation
…o plugin-based architecture
6b28516    to
    716ab18      
    Compare
  
    716ab18    to
    85cc0ce      
    Compare
  
            
          
                packages/ava/src/advisor/advise-pipeline/plugin/presets/spec-generator/charts/area.ts
          
            Show resolved
            Hide resolved
        
      | dataProps, | ||
| data: filteredData, | ||
| chartTypeRecommendations: [{ chartType: 'pie_chart' }], | ||
| }); | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
只看调用。看不出来 .dataAnalyzer 和 .specGenerator 和 adviceAsnyc 是 pipeline 的关系。对 execute 有疑惑 🤔
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
修改为不从 advisor 直接调用内部环节,而是从 pipeline 的方法上调用,每个 component 可以从 pipeline 上拿到并单独调用,如下示例:
 // 单独使用 pipeline 中的部分环节
    const dataAnalyzer = myChartAdvisor.pipeline.getComponent(PresetComponentName.dataAnalyzer)
 dataAnalyzer.execute({...})
// 之后增加 pipe 方法,dataAnalyzer.pipe(specGenerator).execute() ...
| export type AdvisorPipelineContext = { | ||
| /** 原始数据 */ | ||
| data?: Data; | ||
| advisor: Advisor; | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
pipeline 流转过程中 advisor 也是可选的?它一定有吗?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
一定有的
| type?: 'async' | 'sync'; | ||
| execute: (data: Input, context: AdvisorPipelineContext) => Output | Promise<Output>; | ||
| /** 判断插件运行的条件 */ | ||
| condition?: (data?: Input, context?: AdvisorPipelineContext) => boolean | Promise<boolean>; | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
data 也属于 ctx 是不是不用放第一个 args 呢?上面 exec 同理。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里的 data 是各环节 component 自己的 input, context 里原来是整个 pipeline 开头的原始数据,不一样
这个问题可能和下面哪个 pipeline context 里是否加上 input 和 output 一起看
| const chatTypes = Object.keys(chartWIKI); | ||
| const list: ScoringResultForChartType[] = chatTypes.map((chartType) => { | ||
| return scoreRules(chartType, chartWIKI, dataProps, ruleBase, options); | ||
| return scoreRules(chartType, chartWIKI, dataProps, ruleBase, options, advisorContext); | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
看到了个之前遗留的不统一的点:chartWIKI 是不是要小驼峰?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| export type ChartTypeRecommendOutput = { chartTypeRecommendations: ScoringResultForChartType[] }; | ||
|  | ||
| export type SpecGeneratorInput = { | ||
| // todo 实际上不应该需要 score 信息 | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
所以 score 是不是 for 中间结果评测的?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是的, score 模块不应该影响 spec 生成的环节,这个后面还要再拆
| export type PipelineStageType = 'dataAnalyze' | 'chartTypeRecommend' | 'encode' | 'specGenerate'; | ||
|  | ||
| /** 基础插件接口定义 */ | ||
| export interface AdvisorPluginType<Input = any, Output = any> { | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
我有一个大胆的想法,如果 advice pipeline 是确定的,有几个 component 也是确定的。这条 pipeline 是不是都属于一个 ctx,那这条链路上的 input 和 output 其实都是从当前的 ctx 中获得,而不用区分?也就是说这里的 Input 和 Output 其实都是 ctx。况且看下面 stage 还有多个的情况。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
意思是说,pipeline context 里加上 input 和 output 属性么,这两个属性根据当前执行的环节进行更新?
每个环节的 input 和 output 可以挂到 ctx 上,不过前置环节的应该都需要挂上去,不能只挂当前环节的🤔️
每个 component 和 plugin 用类型约束,是想说用户在自定义自己的 plugin 时,能够明确需要的类型,都放 pipeline 感觉类型不好约束了?
|  | ||
| components.forEach((component) => { | ||
| if (!component) return; | ||
| this.componentsManager.tapPromise(component.name, async (previousResult) => { | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
看起来 type?: 'async' | 'sync'; 并没有区分处理?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
|  | ||
| import type { Specification } from '../../common/types'; | ||
| import type { ScoringResultForRule, Lint } from '../types'; | ||
| import type { ScoringResultForRule, Lint, AdvisorPipelineContext } from '../types'; | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lint 和 advice 现在的关系是什么样的?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lint 之前是在 spec generator 环节被内置调用的,保持原用法不变的话,它还是归属在 advise 的一环
另外,lint 也可看作一个独立的 pipeline,可以单独调用
        
          
                packages/ava/src/advisor/advisor.ts
              
                Outdated
          
        
      | pipelineComponent.registerPlugin(plugin); | ||
| return; | ||
| } | ||
| if (size(plugin.stage) === 1) { | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
所以当前还没有多 stage?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
我先拿掉了 stage 的概念,plugin 挂到 component 上,各 component 直接在 pipeline 中串行执行,好像 stage 不是必须的🤔️
| // todo 内置的 visualEncode 和 spec generate 插件需要明确支持哪些图表类型 | ||
| export const specGeneratorPlugin: AdvisorPluginType<SpecGeneratorInput, SpecGeneratorOutput> = { | ||
| name: 'defaultSpecGenerator', | ||
| stage: ['specGenerate'], | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
如果注册多个 plugin 作用于同一个 stage 它的表现会如何,顺序是怎么管理的?
问这个问题是看到下面操作生成 spec 其实也是有一个生成 or 操作顺序的,那么其实可以拆多个 plugin 实现,实现业务自定义 theme 之类的需求
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
|  | ||
| constructor( | ||
| config: AdvisorConfig = {}, | ||
| custom: { | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
当前仅 constructor 输入 custom 不知道是否支持后续 实例动态注册?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
需要的,后续打算在 advisor 上增加 updateCkb, updateRuleBase, resigterComponents 等 api
| /** extra info to pass through the pipeline | ||
| * 额外透传到推荐 pipeline 中的业务信息 | ||
| */ | ||
| extra?: Record<string, any>; | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
同理,extra 直接放 ctx 上?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是放在 context 上的,这里是 constructor 时进行初始化传参
| return Promise.resolve(this.execute(params)); | ||
| } | ||
| const pluginsOutput = {}; | ||
| return Promise.all( | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
promise.all 是并行执行的,上面的 forEach 是顺序调用的,不知道对结果有没有什么影响?🤔
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
上面是同步执行的,所以直接 forEach 了,这里异步不考虑顺序关系,如上评论,后续 component 若需要扩展出串行类型,再修改这里的执行逻辑
        
          
                packages/ava/src/advisor/advisor.ts
              
                Outdated
          
        
      |  | ||
| private initDefaultComponents() { | ||
| this.dataAnalyzer = new BaseComponent('data', { plugins: [dataProcessorPlugin], context: this.context }); | ||
| this.chartTypeRecommender = new BaseComponent('chartType', { | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
encode 和 generate 都是以动词结尾的,这个 chartType 是一整个名次,是不是叫 category 之类的包含 “分类” 含义好一些?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已修改,收在 enum PresetComponentName 下
/** 内置默认模块名 */
export enum PresetComponentName {
  dataAnalyzer,
  chartTypeRecommender,
  chartEncoder,
  specGenerator,
}
…ents and pipeline



advisor pipeline 重构为插件式的结构:
Pipeline默认由4个阶段组成(数据处理, 图表类型推荐, 图表视觉通道映射, G2 spec 生成),每个阶段对应一个Component来处理plugins实现,通过自定义注册 plugins,实现自定义的对应环节的数据处理或推荐逻辑。每个 component 中的多个 plugins 并行执行,在afterPluginsExecutehook 中处理多个插件的结果。期望通过这样的处理,可以解耦原 adise 函数,灵活选择性地使用和定制图表推荐的部分环节。该 PR 是第一部分,初始化整体框架,拆分出 encode 和 spec generate 环节见后续 PR: WIP: refactor(ava/advisor): separate visual encodingfrom spec generator module #785
refactor advisor pipeline:
Pipelineinto four stage: data analyze, chart type recommend, encode, spec generate. Each stage is processed by aComponentComponentcould have multipleplugins, these plugins are executed in parallel. The outputs of the plugins could be handled inafterPluginsExecutehook.