Change the repository type filter
All
Repositories list
12 repositories
terminal-bench-science
PublicTerminal-Bench-Science: Evaluating AI Agents on Complex Real-World Scientific Workflows in the Terminal- Harbor is a framework for running agent evaluations and creating and using RL environments.
t-bench-docs
Publicterminal-bench-3
Publicbenchmark-template
Public templateterminal-bench-2
Publicharbor-docs
Publicskills
Publicharbor-cookbook
Publicawesome-harbor
Publicterminal-bench
Public
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.