CVEfixes: Automated Collection of Vulnerabilities and Their Fixes from Open-Source Software Paper • 2107.08760 • Published Jul 19, 2021
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future Paper • 2408.02479 • Published Aug 5, 2024
Vulnerability Detection Using Two-Stage Deep Learning Models Paper • 2305.09673 • Published May 8, 2023
Enhancing Large Language Models for Secure Code Generation: A Dataset-driven Study on Vulnerability Mitigation Paper • 2310.16263 • Published Oct 25, 2023
Vulnerability Detection with Code Language Models: How Far Are We? Paper • 2403.18624 • Published Mar 27, 2024
Automated Code-centric Software Vulnerability Assessment: How Far Are We? An Empirical Study in C/C++ Paper • 2407.17053 • Published Jul 24, 2024
Efficient Avoidance of Vulnerabilities in Auto-completed Smart Contract Code Using Vulnerability-constrained Decoding Paper • 2309.09826 • Published Sep 18, 2023 • 1
Code Security Vulnerability Repair Using Reinforcement Learning with Large Language Models Paper • 2401.07031 • Published Jan 13, 2024
A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly Paper • 2312.02003 • Published Dec 4, 2023
A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection Paper • 2512.16538 • Published Dec 18, 2025
White-Basilisk: A Hybrid Model for Code Vulnerability Detection Paper • 2507.08540 • Published Jul 11, 2025 • 1
VISION: Robust and Interpretable Code Vulnerability Detection Leveraging Counterfactual Augmentation Paper • 2508.18933 • Published Aug 26, 2025
LLM-Powered Code Vulnerability Repair with Reinforcement Learning and Semantic Reward Paper • 2401.03374 • Published Jan 7, 2024
Code Structure-Aware through Line-level Semantic Learning for Code Vulnerability Detection Paper • 2407.18877 • Published Jul 26, 2024
CodeQA: A Question Answering Dataset for Source Code Comprehension Paper • 2109.08365 • Published Sep 17, 2021
PyRadar: Towards Automatically Retrieving and Validating Source Code Repository Information for PyPI Packages Paper • 2404.16565 • Published Apr 25, 2024
Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation Paper • 2412.16135 • Published Dec 20, 2024
DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection Paper • 2304.00409 • Published Apr 1, 2023 • 1
STraceBERT: Source Code Retrieval using Semantic Application Traces Paper • 2312.04731 • Published Dec 7, 2023
CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models Paper • 2404.13161 • Published Apr 19, 2024
Comparing Human and LLM Generated Code: The Jury is Still Out! Paper • 2501.16857 • Published Jan 28, 2025 • 1
Benchmarking Large Language Models for Multi-Language Software Vulnerability Detection Paper • 2503.01449 • Published Mar 3, 2025 • 4
Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets Paper • 2501.02628 • Published Jan 5, 2025
Poisoning Programs by Un-Repairing Code: Security Concerns of AI-generated Code Paper • 2403.06675 • Published Mar 11, 2024
Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation Paper • 2308.10335 • Published Aug 20, 2023
BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems Paper • 2505.15216 • Published May 21, 2025
Assessing the Quality and Security of AI-Generated Code: A Quantitative Analysis Paper • 2508.14727 • Published Aug 20, 2025
Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis Paper • 2412.14841 • Published Dec 19, 2024
Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security Paper • 2507.19399 • Published Jul 25, 2025 • 2
An Empirical Study of Vulnerabilities in Python Packages and Their Detection Paper • 2509.04260 • Published Sep 4, 2025
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI Paper • 2410.11096 • Published Oct 14, 2024 • 13
Generate and Pray: Using SALLMS to Evaluate the Security of LLM Generated Code Paper • 2311.00889 • Published Nov 1, 2023
CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation Paper • 2501.08200 • Published Jan 14, 2025 • 1
ARVO: Atlas of Reproducible Vulnerabilities for Open Source Software Paper • 2408.02153 • Published Aug 4, 2024
ReCode: Robustness Evaluation of Code Generation Models Paper • 2212.10264 • Published Dec 20, 2022 • 1
MOCHA: Are Code Language Models Robust Against Multi-Turn Malicious Coding Prompts? Paper • 2507.19598 • Published Jul 25, 2025
The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs Paper • 2504.11711 • Published Apr 16, 2025
IRIS: LLM-Assisted Static Analysis for Detecting Security Vulnerabilities Paper • 2405.17238 • Published May 27, 2024
QLCoder: A Query Synthesizer For Static Analysis of Security Vulnerabilities Paper • 2511.08462 • Published Nov 11, 2025
Understanding the Effectiveness of Large Language Models in Detecting Security Vulnerabilities Paper • 2311.16169 • Published Nov 16, 2023 • 1
PATCHEVAL: A New Benchmark for Evaluating LLMs on Patching Real-World Vulnerabilities Paper • 2511.11019 • Published Nov 14, 2025 • 1
RedCode: Risky Code Execution and Generation Benchmark for Code Agents Paper • 2411.07781 • Published Nov 12, 2024 • 1
Can Large Language Models Find And Fix Vulnerable Software? Paper • 2308.10345 • Published Aug 20, 2023
Deep Learning based Vulnerability Detection: Are We There Yet? Paper • 2009.07235 • Published Sep 3, 2020
On the Adversarial Robustness of Instruction-Tuned Large Language Models for Code Paper • 2411.19508 • Published Nov 29, 2024
Human-Written vs. AI-Generated Code: A Large-Scale Study of Defects, Vulnerabilities, and Complexity Paper • 2508.21634 • Published Aug 29, 2025
CodeAttack: Code-Based Adversarial Attacks for Pre-trained Programming Language Models Paper • 2206.00052 • Published May 31, 2022 • 1
Shellcode_IA32: A Dataset for Automatic Shellcode Generation Paper • 2104.13100 • Published Apr 27, 2021
MetaReflection: Learning Instructions for Language Agents using Past Reflections Paper • 2405.13009 • Published May 13, 2024
SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence Paper • 2510.00240 • Published Sep 30, 2025 • 2
Symbol Preference Aware Generative Models for Recovering Variable Names from Stripped Binary Paper • 2306.02546 • Published Jun 5, 2023 • 1
A Repository-Level Dataset For Detecting, Classifying and Repairing Software Vulnerabilities Paper • 2401.13169 • Published Jan 24, 2024
SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks Paper • 2506.11791 • Published Jun 13, 2025
CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks Paper • 2507.05269 • Published Jul 3, 2025 • 1
RedCoder: Automated Multi-Turn Red Teaming for Code LLMs Paper • 2507.22063 • Published Jun 25, 2025 • 2
Cross-Domain Evaluation of Transformer-Based Vulnerability Detection on Open & Industry Data Paper • 2509.09313 • Published Sep 11, 2025 • 2
How Far Have We Gone in Stripped Binary Code Understanding Using Large Language Models Paper • 2404.09836 • Published Apr 15, 2024
Agent That Debugs: Dynamic State-Guided Vulnerability Repair Paper • 2504.07634 • Published Apr 10, 2025
AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI) Paper • 2506.08885 • Published Jun 10, 2025
Revisiting Pre-trained Language Models for Vulnerability Detection Paper • 2507.16887 • Published Jul 22, 2025 • 1
Leveraging multi-task learning to improve the detection of SATD and vulnerability Paper • 2501.15934 • Published Jan 27, 2025 • 2
Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning Paper • 2509.13755 • Published Sep 17, 2025 • 19
VulDeePecker: A Deep Learning-Based System for Vulnerability Detection Paper • 1801.01681 • Published Jan 5, 2018
Is Your AI-Generated Code Really Safe? Evaluating Large Language Models on Secure Code Generation with CodeSecEval Paper • 2407.02395 • Published Jul 2, 2024
Automating the Detection of Code Vulnerabilities by Analyzing GitHub Issues Paper • 2501.05258 • Published Jan 9, 2025
Devign: Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks Paper • 1909.03496 • Published Sep 8, 2019
An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation Paper • 2408.09078 • Published Aug 17, 2024
VulSolver: Vulnerability Detection via LLM-Driven Constraint Solving Paper • 2509.00882 • Published Aug 31, 2025
ProSec: Fortifying Code LLMs with Proactive Security Alignment Paper • 2411.12882 • Published Nov 19, 2024 • 2
SecureCode v2.0: A Production-Grade Dataset for Training Security-Aware Code Generation Models Paper • 2512.18542 • Published Dec 20, 2025 • 5
Reasoning with LLMs for Zero-Shot Vulnerability Detection Paper • 2503.17885 • Published Mar 22, 2025
VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection Paper • 2512.07533 • Published Dec 8, 2025 • 4
Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models Paper • 2103.15543 • Published Mar 29, 2021
Learning to Quantize Vulnerability Patterns and Match to Locate Statement-Level Vulnerabilities Paper • 2306.06109 • Published May 26, 2023
Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives Paper • 2310.01152 • Published Oct 2, 2023