Skip to content

Evaluation on Objective Benchmarks #40

@jingmingzhuo

Description

@jingmingzhuo

I think this work is meaningful and provide remarkable results. However, I find all the test benchs are subjective benchs which outputs are judged by LLMs. Have you tried using MoA for objective tasks such as MMLU or MATH? I think this could make MoA even more valuable. Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions