GPT-4 can exploit real vulnerabilities by reading advisories (2024)

AI agents, which combine large language models with automation software, can successfully exploit real world security vulnerabilities by reading security advisories, academics have claimed.

In a newly released paper, four University of Illinois Urbana-Champaign (UIUC) computer scientists – Richard Fang, Rohan Bindu, Akul Gupta, and Daniel Kang – report that OpenAI's GPT-4 large language model (LLM) can autonomously exploit vulnerabilities in real-world systems if given a CVE advisory describing the flaw.

"To show this, we collected a dataset of 15 one-day vulnerabilities that include ones categorized as critical severity in the CVE description," the US-based authors explain in their paper.

"When given the CVE description, GPT-4 is capable of exploiting 87 percent of these vulnerabilities compared to 0 percent for every other model we test (GPT-3.5, open-source LLMs) and open-source vulnerability scanners (ZAP and Metasploit)."

If you extrapolate to what future models can do, it seems likely they will be much more capable than what script kiddies can get access to today

The term "one-day vulnerability" refers to vulnerabilities that have been disclosed but not patched. And by CVE description, the team means a CVE-tagged advisory shared by NIST – eg, this one for CVE-2024-28859.

How to weaponize LLMs to auto-hijack websites

NOW READ

Eleven of the vulnerabilities tested occurred after GPT-4's training cutoff, meaning the model had not learned any data about them during training. Its success rate for these CVEs was slightly lower at 82 percent, or 9 out of 11.

As to the nature of the bugs, they are all listed in the above paper, and we're told: "Our vulnerabilities span website vulnerabilities, container vulnerabilities, and vulnerable Python packages. Over half are categorized as 'high' or 'critical' severity by the CVE description."

Kang and his colleagues computed the cost to conduct a successful LLM agent attack and came up with a figure of $8.80 per exploit, which they say is about 2.8x less than it would cost to hire a human penetration tester for 30 minutes.

The agent code, according to Kang, consists of just 91 lines of code and 1,056 tokens for the prompt. The researchers were asked by OpenAI, the maker of GPT-4, to not release their prompts to the public, though they say they will provide them upon request.

OpenAI did not immediately respond to a request for comment. ®

FAQs

GPT-4 can exploit real vulnerabilities by reading advisories? ›

OpenAI's GPT-4 can exploit real-world security vulnerabilities by reading security advisories (CVE descriptions). It achieved an 87% success rate on a set of 15 vulnerabilities. GPT-4 is significantly more successful than other large language models (LLMs) and open-source vulnerability scanners tested in the research.

Find Out More ›

In which of the following attacks does the attacker exploit vulnerabilities before the software developer can release a patch for them? ›

A zero-day vulnerability is a software vulnerability discovered by attackers before the vendor has become aware of it.

Read On ›

What is attacking a system by exploiting an otherwise unknown vulnerability? ›

A zero-day attack is an attempt by a threat actor to penetrate, damage, or otherwise compromise a system that is affected by an unknown vulnerability. By nature of the attack, the victim will not have defenses in place, making it highly likely to succeed.

Tell Me More ›

Which of the following attacks exploits a software vulnerability that is unknown to the developer? ›

A zero-day (0day) exploit is a cyber attack targeting a software vulnerability which is unknown to the software vendor or to antivirus vendors.

Keep Reading ›

How do hackers exploit operating system vulnerabilities? ›

Scanning and Enumeration: Hackers use automated tools to scan networks and systems to identify potential vulnerabilities. They look for open ports, services, and devices that might have weak security configurations.

Find Out More ›

What exploits vulnerabilities or bugs in a system or application? ›

An exploit is a program, or piece of code, designed to find and take advantage of a security flaw or vulnerability in an application or computer system, typically for malicious purposes such as installing malware. An exploit is not malware itself, but rather it is a method used by cybercriminals to deliver malware.

Discover More ›

How exploits relate to vulnerabilities? ›

Exploits are the means through which a vulnerability can be leveraged for malicious activity by hackers; these include pieces of software, sequences of commands, or even open-source exploit kits.

Learn More ›

What are some common ways an attacker could exploit the system? ›

Common Attack Vector Examples

Compromised Credentials. ‍Usernames and passwords are still the most common type of access credential and continue to be exposed in data leaks, phishing scams, and malware. ...
Weak Credentials. ...
Insider Threats. ...
Missing or Poor Encryption. ...
Misconfiguration. ...
Ransomware. ...
Phishing. ...
Vulnerabilities.

More items...

Jan 18, 2024

Find Out More ›

Which type of exploit requires accessing to any vulnerable system? ›

Remote exploits: Works over a network and exploits the vulnerability without prior access to the vulnerable system. Local exploits: Requires prior access to the vulnerable system and increases the privilege of the attacker past those granted by the security administrator.

What is an example of vulnerability exploitation? ›

A cybercriminal exploiting a vulnerability can perform various malicious actions, such as installing malicious software (malware), running malicious code, and stealing sensitive data. Common exploitation techniques include SQL injection (SQLi), cross-site scripting (XSS), and buffer overflow.

Get More Info Here ›

What is the best defense against social engineering? ›

Top 10 Ways to Prevent Social Engineering Attacks

Multi-Factor Authentication. ...
Continuously Monitor Critical System. ...
Utilize Next-Gen cloud-based WAF. ...
Verify Email Sender's Identity. ...
Identify your critical assets which attract criminals. ...
Check for SSL Certificate. ...
Penetration Testing. ...
Check and Update your Security Patches.

More items...

Jan 2, 2024

Discover More ›

What is a dummy computer that is made to look vulnerable in order to deceive attackers? ›

A honeypot is a security mechanism that creates a virtual trap to lure attackers. An intentionally compromised computer system allows attackers to exploit vulnerabilities so you can study them to improve your security policies.

Keep Reading ›

What is the most famous zero day exploit? ›

Attack #1 – Sony Zero-Day Attack

One of the most famous zero-day attacks was launched in 2014 against Sony Pictures Entertainment. Through a specific unknown exploit, a team of hackers silently crept into Sony's network and got access to all vital information quickly.

See Details ›

Which vulnerability is most frequently exploited by hackers? ›

The most common security vulnerabilities that are exploited by hackers are: Injection flaws: These vulnerabilities allow attackers to inject malicious code into a system, such as through a web application or database.

See Details ›

How do hackers find their victims? ›

An attacker might choose their target list through readily available data online, such as employee count, industry, or existing vendor relationships, then narrow their search down further from there.

How do hackers penetrate networks? ›

One of the fastest ways cybercriminals access networks is by duping unsuspecting users to willfully download malicious software by embedding it within downloadable files, games or other “innocent”-looking apps. This can largely be prevented with a good firewall and employee training and monitoring.

Discover More ›

In which of the following attacks does an attacker exploit the vulnerability in a bare metal cloud server? ›

The correct answer to the question is B. Cloudborne attack. In a Cloudborne attack, an attacker exploits the vulnerability in a bare-metal cloud server to implant a malicious backdoor in its firmware. A bare-metal cloud server is a physical server that is dedicated to a single user or organization.

Get More Info Here ›

What can an attacker do with a software vulnerability? ›

What Can an Attacker Do With a Software Vulnerability? An attacker can exploit a software vulnerability to steal or manipulate sensitive data, join a system to a botnet, install a backdoor, or plant other types of malware.

Show Me More ›

In which phase the hacker exploits the network or system vulnerabilities? ›

Explanation: Penetration testers after scanning the system or network tries to exploit the flaw of the system or network in “gaining access” phase.

Tell Me More ›

What are the three types of software attacks? ›

The 3 Main Types of Cyberattacks & How to Prevent Them

Malware. An attack that involves the installation of unwanted programs or software on your system without your permission.
Social Engineering. ...
DoS and DDoS Attacks. ...
Man-In-The-Middle Attacks. ...
SQL Injections. ...
Cybersecurity Breaches. ...
Tips on Preventing Cyberattacks.

Jul 20, 2020

Explore More ›

GPT-4 can exploit real vulnerabilities by reading advisories (2024)

How to weaponize LLMs to auto-hijack websites

FAQs

GPT-4 can exploit real vulnerabilities by reading advisories? ›

What is the best defense against social engineering? ›

References