Skip to content

Mentor Worker Benchmark v1.2.3

Latest

Choose a tag to compare

@dicnunz dicnunz released this 10 May 06:59

Adds the Agent Browser Operator OS self-serve support route across README, mentor-worker-benchmark support, package project URLs, GitHub funding metadata, generated leaderboard docs, and support tests.

Validation:

  • .venv/bin/python -m pytest tests -q
  • .venv/bin/python -m pytest tests/test_provider_cli.py tests/test_community_leaderboard_builder.py tests/test_submission.py -q
  • .venv/bin/python -m mentor_worker_benchmark.tasks.task_pack_v1.validate
  • .venv/bin/python -m mentor_worker_benchmark.tasks.task_pack_v2.validate
  • .venv/bin/python -m mentor_worker_benchmark sanity --task-pack task_pack_v2 --suite quick --seed 1337
  • PYTHONPATH=. .venv/bin/python scripts/build_community_leaderboard.py --strict
  • .venv/bin/python -m mentor_worker_benchmark support
  • uv build --clear --no-create-gitignore
  • git diff --check
  • GitHub Actions CI passed: https://github.com/dicnunz/mentor-worker-benchmark/actions/runs/25622299680
  • GitHub Pages deployment passed: https://github.com/dicnunz/mentor-worker-benchmark/actions/runs/25622299422

Boundaries: self-serve browser/account/public-action control templates only; no Chrome plugin repair, guaranteed automation, account access, custom setup, calls, or posting without human approval.