Skip to content

Problems with separate evaluations for real user data and llm simulated data #33

@yuanwuyuan9

Description

@yuanwuyuan9
  • Cause: Inconsistent JSON formats between real-user data and LLM-Simulation data.
  • Solution: Implemented two separate processing pipelines:
    • Pipeline 1 (For Real Users):
      • coverage_metrics.py
      • cohesion_metrics.py
      • main_evaluator.py
    • Pipeline 2 (For LLM Simulation):
      • coverage_metrics_llm.py
      • cohesion_metrics_llm.py
      • main_evaluator_llm.py

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions