-
Notifications
You must be signed in to change notification settings - Fork 1.9k
SWE Agent for solving SWEBench and TerminalBench #559
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
tpryan
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Couple nits. I am pulling in a python reviewer to look at the actual code.
happyhuman
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1- Please run pylint on all the python files.
2- Please choose a less generic project name other than swe_agent (which could mean anything). Also the agent names use - instead of _.
|
Thanks for all the changes. Can we rename this sample to something less generic. |
Thanks for the reviews @happyhuman, I've renamed the agent to |
This PR introduces the SWE Agent, a new autonomous agent designed to solve software engineering problems from the SWE-bench and TerminalBench benchmarks.
The agent operates autonomously within a Dockerized environment, it has a extensible toolset providing basic functionality for solving software engineering tasks.
For detailed information on architecture, setup, testing, and evaluation, please refer to the README.md file included in the agent's directory.