Skip to content

[Question] [Usage]: MoriEP is equivalent to which mode in DeepEP: throughput or low_latency mode ? #57

@tjtanaavllm

Description

@tjtanaavllm

I saw that MoriEP supports intra-node and inter-node ops, and I have a few questions:

  1. MoriEP is equivalent to which mode in DeepEP: throughput or low_latency mode?
  2. Is MoriEP beneficial in the case no non-PD inferencing?
  3. In PD inferencing, is MoriEP suitable to be used in Prefill instance , Decode instance or both?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions