The SentinelOne Annual Threat Report - A Defenders Guide from the FrontlinesThe SentinelOne Annual Threat ReportGet the Report
Experiencing a Breach?Blog
Get StartedContact Us
SentinelOne
  • Platform
    Platform Overview
    • Singularity Platform
      Welcome to Integrated Enterprise Security
    • AI for Security
      Leading the Way in AI-Powered Security Solutions
    • Securing AI
      Accelerate AI Adoption with Secure AI Tools, Apps, and Agents.
    • How It Works
      The Singularity XDR Difference
    • Singularity Marketplace
      One-Click Integrations to Unlock the Power of XDR
    • Pricing & Packaging
      Comparisons and Guidance at a Glance
    Data & AI
    • Purple AI
      Accelerate SecOps with Generative AI
    • Singularity Hyperautomation
      Easily Automate Security Processes
    • AI-SIEM
      The AI SIEM for the Autonomous SOC
    • Singularity Data Lake
      AI-Powered, Unified Data Lake
    • Singularity Data Lake for Log Analytics
      Seamlessly Ingest Data from On-Prem, Cloud or Hybrid Environments
    Endpoint Security
    • Singularity Endpoint
      Autonomous Prevention, Detection, and Response
    • Singularity XDR
      Native & Open Protection, Detection, and Response
    • Singularity RemoteOps Forensics
      Orchestrate Forensics at Scale
    • Singularity Threat Intelligence
      Comprehensive Adversary Intelligence
    • Singularity Vulnerability Management
      Application & OS Vulnerability Management
    • Singularity Identity
      Identity Threat Detection and Response
    Cloud Security
    • Singularity Cloud Security
      Block Attacks with an AI-Powered CNAPP
    • Singularity Cloud Native Security
      Secure Cloud and Development Resources
    • Singularity Cloud Workload Security
      Real-Time Cloud Workload Protection Platform
    • Singularity Cloud Data Security
      AI-Powered Threat Detection for Cloud Storage
    • Singularity Cloud Security Posture Management
      Detect and Remediate Cloud Misconfigurations
    Securing AI
    • Prompt Security
      Secure AI Tools Across Your Enterprise
  • Why SentinelOne?
    Why SentinelOne?
    • Why SentinelOne?
      Cybersecurity Built for What’s Next
    • Our Customers
      Trusted by the World’s Leading Enterprises
    • Industry Recognition
      Tested and Proven by the Experts
    • About Us
      The Industry Leader in Autonomous Cybersecurity
    Compare SentinelOne
    • Arctic Wolf
    • Broadcom
    • CrowdStrike
    • Cybereason
    • Microsoft
    • Palo Alto Networks
    • Sophos
    • Splunk
    • Trellix
    • Trend Micro
    • Wiz
    Verticals
    • Energy
    • Federal Government
    • Finance
    • Healthcare
    • Higher Education
    • K-12 Education
    • Manufacturing
    • Retail
    • State and Local Government
  • Services
    Managed Services
    • Managed Services Overview
      Wayfinder Threat Detection & Response
    • Threat Hunting
      World-Class Expertise and Threat Intelligence
    • Managed Detection & Response
      24/7/365 Expert MDR Across Your Entire Environment
    • Incident Readiness & Response
      DFIR, Breach Readiness, & Compromise Assessments
    Support, Deployment, & Health
    • Technical Account Management
      Customer Success with Personalized Service
    • SentinelOne GO
      Guided Onboarding & Deployment Advisory
    • SentinelOne University
      Live and On-Demand Training
    • Services Overview
      Comprehensive Solutions for Seamless Security Operations
    • SentinelOne Community
      Community Login
  • Partners
    Our Network
    • MSSP Partners
      Succeed Faster with SentinelOne
    • Singularity Marketplace
      Extend the Power of S1 Technology
    • Cyber Risk Partners
      Enlist Pro Response and Advisory Teams
    • Technology Alliances
      Integrated, Enterprise-Scale Solutions
    • SentinelOne for AWS
      Hosted in AWS Regions Around the World
    • Channel Partners
      Deliver the Right Solutions, Together
    • SentinelOne for Google Cloud
      Unified, Autonomous Security Giving Defenders the Advantage at Global Scale
    • Partner Locator
      Your Go-to Source for Our Top Partners in Your Region
    Partner Portal→
  • Resources
    Resource Center
    • Case Studies
    • Data Sheets
    • eBooks
    • Reports
    • Videos
    • Webinars
    • Whitepapers
    • Events
    View All Resources→
    Blog
    • Feature Spotlight
    • For CISO/CIO
    • From the Front Lines
    • Identity
    • Cloud
    • macOS
    • SentinelOne Blog
    Blog→
    Tech Resources
    • SentinelLABS
    • Ransomware Anthology
    • Cybersecurity 101
  • About
    About SentinelOne
    • About SentinelOne
      The Industry Leader in Cybersecurity
    • Investor Relations
      Financial Information & Events
    • SentinelLABS
      Threat Research for the Modern Threat Hunter
    • Careers
      The Latest Job Opportunities
    • Press & News
      Company Announcements
    • Cybersecurity Blog
      The Latest Cybersecurity Threats, News, & More
    • FAQ
      Get Answers to Our Most Frequently Asked Questions
    • DataSet
      The Live Data Platform
    • S Foundation
      Securing a Safer Future for All
    • S Ventures
      Investing in the Next Generation of Security, Data and AI
  • Pricing
Get StartedContact Us
CVE Vulnerability Database
Vulnerability Database/CVE-2026-22773

CVE-2026-22773: vLLM Inference Engine DoS Vulnerability

CVE-2026-22773 is a denial of service flaw in vLLM inference engine affecting Idefics3 multimodal models. Attackers can crash servers using crafted 1x1 images. This article covers technical details, affected versions, and patches.

Updated: January 22, 2026

CVE-2026-22773 Overview

CVE-2026-22773 is a Denial of Service vulnerability affecting vLLM, a popular inference and serving engine for large language models (LLMs). The vulnerability allows authenticated users to crash vLLM engine instances serving multimodal models that utilize the Idefics3 vision model implementation by sending a specially crafted 1x1 pixel image. This triggers a tensor dimension mismatch that results in an unhandled runtime error, causing complete server termination.

Critical Impact

Authenticated attackers can cause complete denial of service to vLLM inference servers by submitting malicious image payloads, disrupting AI/ML workloads and potentially affecting production LLM services.

Affected Products

  • vLLM versions 0.6.4 through 0.11.x (prior to 0.12.0)
  • vLLM deployments using Idefics3 vision model implementation
  • Multimodal LLM serving configurations

Discovery Timeline

  • 2026-01-10 - CVE CVE-2026-22773 published to NVD
  • 2026-01-13 - Last updated in NVD database

Technical Details for CVE-2026-22773

Vulnerability Analysis

This vulnerability exists in the image processing pipeline of vLLM's Idefics3 vision model implementation. When processing multimodal inputs that combine text and images, the Idefics3 model expects image tensors with specific dimensional requirements. The vulnerability stems from improper allocation of resources without limits (CWE-770), where the system fails to validate image dimensions before tensor operations.

When a malformed 1x1 pixel image is submitted to the inference endpoint, the vision model's tensor processing logic encounters a dimension mismatch during the image embedding phase. This mismatch triggers an unhandled runtime exception that propagates up the call stack, bypassing any error recovery mechanisms and causing the entire vLLM server process to terminate.

The attack is particularly impactful because vLLM is designed to handle high-throughput inference workloads, meaning a single malicious request can disrupt service for all concurrent users and queued requests.

Root Cause

The root cause is insufficient input validation in the Idefics3 vision model's image preprocessing pipeline. The code assumes incoming images meet minimum dimensional requirements for tensor operations without explicitly validating these constraints. When boundary-case images (such as 1x1 pixel images) are processed, the resulting tensor shapes are incompatible with downstream operations, causing the runtime error.

The underlying issue is classified as CWE-770 (Allocation of Resources Without Limits or Throttling), as the system fails to properly constrain and validate the image input resources before processing them in tensor operations.

Attack Vector

The attack can be executed remotely over the network by any authenticated user with access to the vLLM inference API. The attacker needs to:

  1. Identify a vLLM deployment serving a multimodal model with Idefics3 vision capabilities
  2. Craft a valid API request containing a 1x1 pixel image payload
  3. Submit the request to the multimodal inference endpoint

The vulnerability manifests when the Idefics3 vision model attempts to process the malformed image, resulting in a tensor dimension mismatch during the embedding generation phase. This causes an unhandled runtime exception that terminates the server process. For technical implementation details, refer to the GitHub Security Advisory.

Detection Methods for CVE-2026-22773

Indicators of Compromise

  • Unexpected vLLM server process terminations or crashes
  • Error logs containing tensor dimension mismatch exceptions related to image processing
  • API requests containing unusually small image payloads (particularly 1x1 pixel images)
  • Repeated server restarts following multimodal inference requests

Detection Strategies

  • Monitor vLLM server logs for unhandled runtime errors in the Idefics3 vision model components
  • Implement request logging to capture image dimensions before processing
  • Deploy application-level health checks to detect unexpected server terminations
  • Analyze API traffic patterns for requests with minimal image payloads targeting multimodal endpoints

Monitoring Recommendations

  • Configure alerting for vLLM process crashes or unexpected restarts
  • Implement log aggregation to correlate tensor-related exceptions with incoming request payloads
  • Monitor inference API latency spikes that may indicate service degradation before crashes
  • Track request patterns from individual users for anomalous small image submissions

How to Mitigate CVE-2026-22773

Immediate Actions Required

  • Upgrade vLLM to version 0.12.0 or later immediately
  • Review access controls to restrict multimodal inference endpoints to trusted users
  • Implement request rate limiting on API endpoints as a temporary protective measure
  • Deploy health monitoring to enable rapid restart of crashed instances

Patch Information

The vulnerability has been patched in vLLM version 0.12.0. Organizations should upgrade to this version or later to remediate the vulnerability. The patch adds proper validation of image dimensions before tensor processing operations, ensuring that malformed images are rejected with an appropriate error response rather than causing server crashes.

For detailed patch information, see the GitHub Security Advisory.

Workarounds

  • Implement input validation at the API gateway level to reject images below minimum dimensional thresholds
  • Deploy vLLM instances behind a reverse proxy that filters requests with malformed image payloads
  • Use container orchestration with automatic restart policies to minimize downtime from crashes
  • Consider temporarily disabling Idefics3 vision model support if not required for production workloads
bash
# Configuration example - Upgrade vLLM to patched version
pip install --upgrade vllm>=0.12.0

# Verify installed version
pip show vllm | grep Version

Disclaimer: This content was generated using AI. While we strive for accuracy, please verify critical information with official sources.

  • Vulnerability Details
  • TypeDOS

  • Vendor/TechVllm

  • SeverityMEDIUM

  • CVSS Score6.5

  • EPSS Probability0.04%

  • Known ExploitedNo
  • CVSS Vector
  • CVSS:3.1/AV:N/AC:L/PR:L/UI:N/S:U/C:N/I:N/A:H
  • Impact Assessment
  • ConfidentialityLow
  • IntegrityNone
  • AvailabilityHigh
  • CWE References
  • CWE-770
  • Technical References
  • GitHub Security Advisory
  • Related CVEs
  • CVE-2026-27893: Vllm Vllm RCE Vulnerability

  • CVE-2026-25960: vLLM SSRF Bypass Vulnerability

  • CVE-2026-22778: vLLM ASLR Bypass and RCE Vulnerability

  • CVE-2026-24779: vLLM SSRF Vulnerability
Experience the World’s Most Advanced Cybersecurity Platform

Experience the World’s Most Advanced Cybersecurity Platform

See how our intelligent, autonomous cybersecurity platform can protect your organization now and into the future.

Try SentinelOne
  • Get Started
  • Get a Demo
  • Product Tour
  • Why SentinelOne
  • Pricing & Packaging
  • FAQ
  • Contact
  • Contact Us
  • Customer Support
  • SentinelOne Status
  • Language
  • Platform
  • Singularity Platform
  • Singularity Endpoint
  • Singularity Cloud
  • Singularity AI-SIEM
  • Singularity Identity
  • Singularity Marketplace
  • Purple AI
  • Services
  • Wayfinder TDR
  • SentinelOne GO
  • Technical Account Management
  • Support Services
  • Verticals
  • Energy
  • Federal Government
  • Finance
  • Healthcare
  • Higher Education
  • K-12 Education
  • Manufacturing
  • Retail
  • State and Local Government
  • Cybersecurity for SMB
  • Resources
  • Blog
  • Labs
  • Case Studies
  • Videos
  • Product Tours
  • Events
  • Cybersecurity 101
  • eBooks
  • Webinars
  • Whitepapers
  • Press
  • News
  • Ransomware Anthology
  • Company
  • About Us
  • Our Customers
  • Careers
  • Partners
  • Legal & Compliance
  • Security & Compliance
  • Investor Relations
  • S Foundation
  • S Ventures

©2026 SentinelOne, All Rights Reserved.

Privacy Notice Terms of Use

English