The SentinelOne Annual Threat Report - A Defenders Guide from the FrontlinesThe SentinelOne Annual Threat ReportGet the Report
Experiencing a Breach?Blog
Get StartedContact Us
SentinelOne
  • Platform
    Platform Overview
    • Singularity Platform
      Welcome to Integrated Enterprise Security
    • AI for Security
      Leading the Way in AI-Powered Security Solutions
    • Securing AI
      Accelerate AI Adoption with Secure AI Tools, Apps, and Agents.
    • How It Works
      The Singularity XDR Difference
    • Singularity Marketplace
      One-Click Integrations to Unlock the Power of XDR
    • Pricing & Packaging
      Comparisons and Guidance at a Glance
    Data & AI
    • Purple AI
      Accelerate SecOps with Generative AI
    • Singularity Hyperautomation
      Easily Automate Security Processes
    • AI-SIEM
      The AI SIEM for the Autonomous SOC
    • Singularity Data Lake
      AI-Powered, Unified Data Lake
    • Singularity Data Lake for Log Analytics
      Seamlessly Ingest Data from On-Prem, Cloud or Hybrid Environments
    Endpoint Security
    • Singularity Endpoint
      Autonomous Prevention, Detection, and Response
    • Singularity XDR
      Native & Open Protection, Detection, and Response
    • Singularity RemoteOps Forensics
      Orchestrate Forensics at Scale
    • Singularity Threat Intelligence
      Comprehensive Adversary Intelligence
    • Singularity Vulnerability Management
      Application & OS Vulnerability Management
    • Singularity Identity
      Identity Threat Detection and Response
    Cloud Security
    • Singularity Cloud Security
      Block Attacks with an AI-Powered CNAPP
    • Singularity Cloud Native Security
      Secure Cloud and Development Resources
    • Singularity Cloud Workload Security
      Real-Time Cloud Workload Protection Platform
    • Singularity Cloud Data Security
      AI-Powered Threat Detection for Cloud Storage
    • Singularity Cloud Security Posture Management
      Detect and Remediate Cloud Misconfigurations
    Securing AI
    • Prompt Security
      Secure AI Tools Across Your Enterprise
  • Why SentinelOne?
    Why SentinelOne?
    • Why SentinelOne?
      Cybersecurity Built for What’s Next
    • Our Customers
      Trusted by the World’s Leading Enterprises
    • Industry Recognition
      Tested and Proven by the Experts
    • About Us
      The Industry Leader in Autonomous Cybersecurity
    Compare SentinelOne
    • Arctic Wolf
    • Broadcom
    • CrowdStrike
    • Cybereason
    • Microsoft
    • Palo Alto Networks
    • Sophos
    • Splunk
    • Trellix
    • Trend Micro
    • Wiz
    Verticals
    • Energy
    • Federal Government
    • Finance
    • Healthcare
    • Higher Education
    • K-12 Education
    • Manufacturing
    • Retail
    • State and Local Government
  • Services
    Managed Services
    • Managed Services Overview
      Wayfinder Threat Detection & Response
    • Threat Hunting
      World-Class Expertise and Threat Intelligence
    • Managed Detection & Response
      24/7/365 Expert MDR Across Your Entire Environment
    • Incident Readiness & Response
      DFIR, Breach Readiness, & Compromise Assessments
    Support, Deployment, & Health
    • Technical Account Management
      Customer Success with Personalized Service
    • SentinelOne GO
      Guided Onboarding & Deployment Advisory
    • SentinelOne University
      Live and On-Demand Training
    • Services Overview
      Comprehensive Solutions for Seamless Security Operations
    • SentinelOne Community
      Community Login
  • Partners
    Our Network
    • MSSP Partners
      Succeed Faster with SentinelOne
    • Singularity Marketplace
      Extend the Power of S1 Technology
    • Cyber Risk Partners
      Enlist Pro Response and Advisory Teams
    • Technology Alliances
      Integrated, Enterprise-Scale Solutions
    • SentinelOne for AWS
      Hosted in AWS Regions Around the World
    • Channel Partners
      Deliver the Right Solutions, Together
    • SentinelOne for Google Cloud
      Unified, Autonomous Security Giving Defenders the Advantage at Global Scale
    • Partner Locator
      Your Go-to Source for Our Top Partners in Your Region
    Partner Portal→
  • Resources
    Resource Center
    • Case Studies
    • Data Sheets
    • eBooks
    • Reports
    • Videos
    • Webinars
    • Whitepapers
    • Events
    View All Resources→
    Blog
    • Feature Spotlight
    • For CISO/CIO
    • From the Front Lines
    • Identity
    • Cloud
    • macOS
    • SentinelOne Blog
    Blog→
    Tech Resources
    • SentinelLABS
    • Ransomware Anthology
    • Cybersecurity 101
  • About
    About SentinelOne
    • About SentinelOne
      The Industry Leader in Cybersecurity
    • Investor Relations
      Financial Information & Events
    • SentinelLABS
      Threat Research for the Modern Threat Hunter
    • Careers
      The Latest Job Opportunities
    • Press & News
      Company Announcements
    • Cybersecurity Blog
      The Latest Cybersecurity Threats, News, & More
    • FAQ
      Get Answers to Our Most Frequently Asked Questions
    • DataSet
      The Live Data Platform
    • S Foundation
      Securing a Safer Future for All
    • S Ventures
      Investing in the Next Generation of Security, Data and AI
  • Pricing
Get StartedContact Us
CVE Vulnerability Database
Vulnerability Database/CVE-2026-2069

CVE-2026-2069: llama.cpp Buffer Overflow Vulnerability

CVE-2026-2069 is a stack-based buffer overflow flaw in ggml-org llama.cpp affecting the GBNF Grammar Handler. Attackers with local access can exploit this vulnerability. This article covers technical details, impact, and patches.

Published: February 13, 2026

CVE-2026-2069 Overview

A stack-based buffer overflow vulnerability has been discovered in ggml-org llama.cpp, a popular C/C++ implementation for running Large Language Model (LLM) inference. The vulnerability exists in the llama_grammar_advance_stack function within the GBNF Grammar Handler component located at llama.cpp/src/llama-grammar.cpp. When processing maliciously crafted grammar input, an attacker with local access can trigger a stack-based buffer overflow condition, potentially leading to denial of service or other impacts.

Critical Impact

Local attackers can exploit a stack-based buffer overflow in the GBNF Grammar Handler to cause denial of service conditions. A proof-of-concept exploit has been published and the vulnerability affects versions up to commit 55abc39.

Affected Products

  • ggml-org llama.cpp versions up to commit 55abc39
  • Applications integrating the affected llama.cpp GBNF Grammar Handler component
  • Systems running unpatched llama.cpp for LLM inference

Discovery Timeline

  • 2026-02-06 - CVE-2026-2069 published to NVD
  • 2026-02-09 - Last updated in NVD database

Technical Details for CVE-2026-2069

Vulnerability Analysis

This vulnerability is classified as CWE-119 (Improper Restriction of Operations within the Bounds of a Memory Buffer). The flaw resides in the llama_grammar_advance_stack function, which is responsible for managing the grammar parsing stack during GBNF (GGML BNF) grammar processing. When handling specifically crafted grammar input, the function fails to properly validate buffer boundaries, resulting in a stack-based buffer overflow condition.

The vulnerability requires local access to exploit, meaning an attacker would need the ability to provide malicious grammar files or input to an application using the vulnerable llama.cpp library. While the direct impact is limited to availability (denial of service), stack-based buffer overflows can potentially be leveraged for more severe attacks depending on the system's memory protection mechanisms.

Root Cause

The root cause lies in insufficient bounds checking within the llama_grammar_advance_stack function when processing grammar rules. The function operates on a stack data structure that manages grammar states during parsing, but does not adequately validate the stack depth or buffer size before performing write operations. This allows carefully constructed grammar input to overflow the allocated stack buffer.

Attack Vector

The attack requires local access to the target system. An attacker must be able to supply a malicious GBNF grammar file or grammar string to an application using the vulnerable llama.cpp library. The exploit has been publicly disclosed, with a proof-of-concept available demonstrating how to trigger the overflow condition.

The attack scenario involves:

  1. Creating a specially crafted GBNF grammar file designed to exhaust or overflow the grammar stack
  2. Providing this malicious grammar to an application using llama.cpp for inference
  3. Triggering the llama_grammar_advance_stack function to process the malformed input
  4. Causing a stack-based buffer overflow leading to application crash or potential code execution

Technical details and a proof-of-concept can be found in the GitHub Issue Tracker. Researchers can review the PoC archive for reproduction steps.

Detection Methods for CVE-2026-2069

Indicators of Compromise

  • Unexpected crashes or segmentation faults in applications using llama.cpp during grammar processing
  • Presence of unusually large or malformed GBNF grammar files on the system
  • Application logs showing errors related to llama_grammar_advance_stack or grammar parsing failures
  • Core dumps or crash reports indicating stack corruption in llama.cpp components

Detection Strategies

  • Monitor for applications loading llama.cpp libraries with versions prior to patch 18993
  • Implement file integrity monitoring for grammar files used by LLM inference applications
  • Deploy memory corruption detection tools (AddressSanitizer, Valgrind) during development and testing
  • Use application-level logging to track grammar file sources and processing events

Monitoring Recommendations

  • Enable crash reporting and analysis for applications utilizing llama.cpp
  • Monitor system resource usage for abnormal memory patterns during LLM inference operations
  • Implement input validation for any user-supplied grammar files before processing
  • Deploy SentinelOne Singularity platform for real-time detection of memory corruption exploitation attempts

How to Mitigate CVE-2026-2069

Immediate Actions Required

  • Update llama.cpp to the patched version by applying patch #18993
  • Audit all applications and services using llama.cpp for grammar processing functionality
  • Restrict local access to systems running vulnerable llama.cpp versions
  • Implement input validation for grammar files to reject malformed or suspicious input

Patch Information

The vulnerability has been addressed in patch #18993 available in the llama.cpp GitHub repository. Organizations should update to the latest version of llama.cpp that includes this fix. The patch addresses the buffer boundary validation issue in the llama_grammar_advance_stack function.

To apply the patch, clone the latest repository or pull the specific fix:

bash
# Update to the latest llama.cpp version
git clone https://github.com/ggml-org/llama.cpp/
cd llama.cpp
git pull origin master

# Rebuild the project
mkdir build && cd build
cmake ..
make -j$(nproc)

Workarounds

  • Disable or restrict GBNF grammar processing functionality if not required for your use case
  • Implement strict input validation and sanitization for all grammar files before processing
  • Run llama.cpp applications in sandboxed environments with limited privileges
  • Deploy application firewalls or input filters to block potentially malicious grammar constructs
bash
# Example: Run llama.cpp in a restricted container environment
docker run --read-only --security-opt=no-new-privileges \
  --cap-drop=ALL --memory=4g --cpus=2 \
  -v /safe/grammar/path:/grammar:ro \
  llama-cpp-container

Disclaimer: This content was generated using AI. While we strive for accuracy, please verify critical information with official sources.

  • Vulnerability Details
  • TypeBuffer Overflow

  • Vendor/TechLlama Cpp

  • SeverityMEDIUM

  • CVSS Score4.8

  • EPSS Probability0.01%

  • Known ExploitedNo
  • CVSS Vector
  • CVSS:4.0/AV:L/AC:L/AT:N/PR:L/UI:N/VC:N/VI:N/VA:L/SC:N/SI:N/SA:N/E:P/CR:X/IR:X/AR:X/MAV:X/MAC:X/MAT:X/MPR:X/MUI:X/MVC:X/MVI:X/MVA:X/MSC:X/MSI:X/MSA:X/S:X/AU:X/R:X/V:X/RE:X/U:X
  • Impact Assessment
  • ConfidentialityLow
  • IntegrityNone
  • AvailabilityLow
  • CWE References
  • CWE-119
  • Technical References
  • GitHub Repository

  • GitHub Issue Tracker

  • GitHub Issue Event

  • GitHub Pull Request

  • GitHub PoC Zip File

  • VulDB #344636 (CTI)

  • VulDB #344636

  • VulDB Submission #745263
  • Related CVEs
  • CVE-2026-27940: llama.cpp Buffer Overflow Vulnerability

  • CVE-2026-34159: llama.cpp RPC Backend RCE Vulnerability

  • CVE-2026-33298: llama.cpp Integer Overflow RCE Vulnerability

  • CVE-2026-21869: llama.cpp RCE Vulnerability
Experience the World’s Most Advanced Cybersecurity Platform

Experience the World’s Most Advanced Cybersecurity Platform

See how our intelligent, autonomous cybersecurity platform can protect your organization now and into the future.

Try SentinelOne
  • Get Started
  • Get a Demo
  • Product Tour
  • Why SentinelOne
  • Pricing & Packaging
  • FAQ
  • Contact
  • Contact Us
  • Customer Support
  • SentinelOne Status
  • Language
  • Platform
  • Singularity Platform
  • Singularity Endpoint
  • Singularity Cloud
  • Singularity AI-SIEM
  • Singularity Identity
  • Singularity Marketplace
  • Purple AI
  • Services
  • Wayfinder TDR
  • SentinelOne GO
  • Technical Account Management
  • Support Services
  • Verticals
  • Energy
  • Federal Government
  • Finance
  • Healthcare
  • Higher Education
  • K-12 Education
  • Manufacturing
  • Retail
  • State and Local Government
  • Cybersecurity for SMB
  • Resources
  • Blog
  • Labs
  • Case Studies
  • Videos
  • Product Tours
  • Events
  • Cybersecurity 101
  • eBooks
  • Webinars
  • Whitepapers
  • Press
  • News
  • Ransomware Anthology
  • Company
  • About Us
  • Our Customers
  • Careers
  • Partners
  • Legal & Compliance
  • Security & Compliance
  • Investor Relations
  • S Foundation
  • S Ventures

©2026 SentinelOne, All Rights Reserved.

Privacy Notice Terms of Use

English