


The open source version of AI programmers is here: GPT-4 blessing, ability comparable to Devin, 1.4k Stars a day
To learn more about AIGC, please visit:
51CTO AI.x Community
https://www.51cto.com/ aigc/
Recently, many people are worried about AI replacing their jobs.
Devin, the "first AI programmer" who became popular in the AI circle last month, has mastered full-stack skills by using large model capabilities. He only needs humans to give natural language instructions. Automate complex coding tasks.
The tool capabilities demonstrated by Devin are very amazing, especially for this startup company that takes the closed source route. Currently, only a few people can use this closed beta quota.
On Tuesday, researchers from the Princeton University NLP Group released SWE-agent, an open source version of the AI programmer, which received thousands of GitHub stars in less than a day. . This SWE-agent is based on deep learning technology and can automatically write efficient and reliable code. His release attracted widespread attention, and many developers expressed high recognition of his technology and performance. These achievements also prove the advancement of AI research in the field of NLP
SWE-agent is a new system for autonomously solving problems in GitHub repositories. It achieved similar accuracy to Devin on SWE-bench, taking an average of 93 seconds.
- Project website: https://swe-agent.com/
- GitHub :https://github.com/princeton-nlp/SWE-agent
John Yang, the author of the project, said that preprints of related papers The version will also be uploaded on April 10th.
In principle, SWE-agent can fix bugs and issues in real GitHub repositories by turning large models (such as GPT-4) into software engineering agents.
On the complete SWE-bench test set, SWE-agent solved 12.29% of the problems and achieved SOTA performance.
To provide automation during development, SWE-agent works by interacting with a dedicated terminal, which can open, search file contents, use automatic Syntax check, edit specific lines, and also write and execute tests.
The developers of this project carefully designed the UI interface and introduced it on GitHub.
Agent-Computer Interface (ACI)
The research team designed simple Large Model (LM)-centric commands and feedback format that enables large models to more easily browse repositories, view, edit, and execute code files, known as the Agent-Computer Interface (ACI). The research team also built a SWE agent repository to easily iterate on ACI designs of repository-level coded agents.
Just like language models require good prompt engineering, good ACI design will lead to better results when using agents. The baseline agent without well-tuned ACI performs much worse than the SWE-agent.
SWE-agent contains features that the research team found to be very useful during the design of the agent-computer interface, including:
1. Add a linter that runs when an edit command is issued and won't let the edit command go through if the code syntax is incorrect.
2. Provide the agent with a purpose-built file viewer. The research team found that this file viewer works best when it displays only 100 lines per round, and that the file editor has commands for scrolling up and down and performing searches within the file.
3. Provide specially built directory-wide string search commands for agents. The research team found it important that the tool lists matches succinctly—just list every file that has at least one match. The study showed that showing the model more context about each match would be too confusing for the model.
4. When the output of the command is empty, return a message: "Your command ran successfully, but did not produce any output."
Future published papers will detail more information.
Installation and use
To use SWE-agent, you must first set the following conditions:
1. Install Docker , and start Docker locally;
2. Install Miniconda, and use conda env create -fenvironment.yml to create the swe-agent environment;
3. Use conda activate swe-agent to activate;
4. Run ./setup.sh to create the swe-agent docker image;
5. Create a keys.cfg file in the root directory of this repository and fill in the following content:
OPENAI_API_KEY: 'OpenAI API Key Here if using OpenAI Model (optional)'ANTHROPIC_API_KEY: 'Anthropic API Key Here if using Anthropic Model (optional)'GITHUB_TOKEN: 'GitHub Token Here (required)'
The SWE-agent pipeline consists of two steps:
- Step 1: SWE-agent receives the input GitHub issue and returns a pull request to try to fix it;
- Step 2: Evaluate the pull request to verify that it actually solves the issue (currently only available for issues in the SWE-bench benchmark).
If you want to run and evaluate on the entire SWE-bench, the easiest way is to use an x86 machine.
python run.py --model_name gpt4 \--data_path https://github.com/pvlib/pvlib-python/issues/1603 --config_file config/default_from_url.yaml
python run.py --model_name gpt4 \--per_instance_cost_limit 2.00 \--config_file ./config/default.yaml
If you want to run a single question in SWE-bench, you can use --instance_filter:
python run.py --model_name gpt4 \--instance_filter marshmallow-code__marshmallow-1359
To learn more about AIGC, please visit:
51CTO AI.x Community
https://www.51cto.com/ aigc/
The above is the detailed content of The open source version of AI programmers is here: GPT-4 blessing, ability comparable to Devin, 1.4k Stars a day. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



The CentOS shutdown command is shutdown, and the syntax is shutdown [Options] Time [Information]. Options include: -h Stop the system immediately; -P Turn off the power after shutdown; -r restart; -t Waiting time. Times can be specified as immediate (now), minutes ( minutes), or a specific time (hh:mm). Added information can be displayed in system messages.

PHP and Python have their own advantages and disadvantages, and the choice depends on project needs and personal preferences. 1.PHP is suitable for rapid development and maintenance of large-scale web applications. 2. Python dominates the field of data science and machine learning.

Docker uses Linux kernel features to provide an efficient and isolated application running environment. Its working principle is as follows: 1. The mirror is used as a read-only template, which contains everything you need to run the application; 2. The Union File System (UnionFS) stacks multiple file systems, only storing the differences, saving space and speeding up; 3. The daemon manages the mirrors and containers, and the client uses them for interaction; 4. Namespaces and cgroups implement container isolation and resource limitations; 5. Multiple network modes support container interconnection. Only by understanding these core concepts can you better utilize Docker.

Complete Guide to Checking HDFS Configuration in CentOS Systems This article will guide you how to effectively check the configuration and running status of HDFS on CentOS systems. The following steps will help you fully understand the setup and operation of HDFS. Verify Hadoop environment variable: First, make sure the Hadoop environment variable is set correctly. In the terminal, execute the following command to verify that Hadoop is installed and configured correctly: hadoopversion Check HDFS configuration file: The core configuration file of HDFS is located in the /etc/hadoop/conf/ directory, where core-site.xml and hdfs-site.xml are crucial. use

Python and JavaScript have their own advantages and disadvantages in terms of community, libraries and resources. 1) The Python community is friendly and suitable for beginners, but the front-end development resources are not as rich as JavaScript. 2) Python is powerful in data science and machine learning libraries, while JavaScript is better in front-end development libraries and frameworks. 3) Both have rich learning resources, but Python is suitable for starting with official documents, while JavaScript is better with MDNWebDocs. The choice should be based on project needs and personal interests.

The command to restart the SSH service is: systemctl restart sshd. Detailed steps: 1. Access the terminal and connect to the server; 2. Enter the command: systemctl restart sshd; 3. Verify the service status: systemctl status sshd.

CentOS Installing Nginx requires following the following steps: Installing dependencies such as development tools, pcre-devel, and openssl-devel. Download the Nginx source code package, unzip it and compile and install it, and specify the installation path as /usr/local/nginx. Create Nginx users and user groups and set permissions. Modify the configuration file nginx.conf, and configure the listening port and domain name/IP address. Start the Nginx service. Common errors need to be paid attention to, such as dependency issues, port conflicts, and configuration file errors. Performance optimization needs to be adjusted according to the specific situation, such as turning on cache and adjusting the number of worker processes.

Installing MySQL on CentOS involves the following steps: Adding the appropriate MySQL yum source. Execute the yum install mysql-server command to install the MySQL server. Use the mysql_secure_installation command to make security settings, such as setting the root user password. Customize the MySQL configuration file as needed. Tune MySQL parameters and optimize databases for performance.
