What open-source LLM security tools does AISecIntel Group offer?

AISecIntel Group provides a modular suite of open-source security tools including PromptGuard (input validation & sanitization middleware), RAG Faithfulness Scorer (hallucination detection), Vector Isolation Auditor (tenant separation), and LLM Supply Chain Auditor (dependency and weights vulnerability scanner). Our repositories are hosted publicly at [https://github.com/AISecIntelGroup-Tools](https://github.com/AISecIntelGroup-Tools).

How does AISecIntel Group mitigate prompt injection vulnerabilities?

Our flagship tool, PromptGuard, uses a pluggable low-latency pipeline to sanitize unicode homoglyphs, strip invisible spoofing characters, calculate token entropy, and run heuristic scanners to catch direct overrides and indirect prompt injections (OWASP LLM01).

Does AISecIntel Group provide custom AI security consulting or audits?

Yes, we provide collaborative adversarial assessments, LLM penetration testing, and custom guardrail middleware integration for scaling project teams. Inquiries can be routed directly to our engineering team at kazemeyni@aisecintelgroup.com.

Where can I read about OWASP Top 10 vulnerabilities for LLMs?

The AISecIntel Knowledge Hub contains detailed, deep-dive publications covering the complete OWASP Top 10 for Large Language Models framework, including detailed defensive guides on Prompt Injection, Sensitive Info Disclosure, Model DoS, and System Prompt Leakage.

Vector & Embedding Weaknesses

OWASP LLM TOP 10

Dr. Fatemeh Kazemeyni

6/1/20263 min read

In traditional software systems, searching a database is a deterministic process. You query a specific column for an exact keyword, a unique user ID, or a matching string hash. If the characters align perfectly, the system retrieves the record.

Retrieval-Augmented Generation (RAG) and semantic search architectures change this completely. To allow an LLM to search through millions of documents efficiently, data is transformed into embeddings, mathematical arrays of floating-point numbers representing semantic meaning. These arrays are stored and searched inside a specialized Vector Database (such as Pinecone, Milvus, Qdrant, or Chroma).

While vector databases make AI applications incredibly powerful, they introduce entirely new, non-traditional security failure points. This brings us to LLM08: Vector & Embedding Weaknesses, a vulnerability category where flaws in data processing, storage, or mathematical retrieval open the door to unauthorized data access, security filter bypasses, and system manipulation.

What are Vector & Embedding Weaknesses?

Vector and Embedding Weaknesses occur when an attacker exploits the probabilistic nature of semantic search pipelines, data ingestion flows, or vector database configurations to alter an AI application's behavior.

Unlike a SQL database, which is protected by rigid access control layers, a vector database operates on proximity math (like Cosine Similarity or Euclidean Distance). If an attacker knows how to manipulate the mathematical space where these vectors live, they can exploit several critical vulnerabilities:

Embedding Inversion (Reconstructing Raw Secrets): Embeddings are often treated as "hashed" or anonymous representations of data. However, sophisticated attackers can reverse-engineer an embedding vector to reconstruct the exact plain-text sentence it came from, exposing sensitive source materials or PII stored in the index.
Adversarial Shadowing (Poisoning the Semantic Space): Attackers insert engineered documents into the vector database that sit mathematically "closer" to common user queries than the legitimate documentation. The RAG pipeline will grab the malicious text block instead of the correct data, causing the LLM to output malicious instructions.
Data Ingestion/Parsing Mismatches: When data is split into "chunks" before being vectorized, flaws in the parsing logic can lead to unauthorized data crossing tenant boundaries, accidentally exposing confidential records to low-privilege users.

Real-World Exploitation Scenario

Consider an HR and Operations AI Assistant built for a large enterprise. The application utilizes a multi-tenant vector database containing company policy documents. A shared index partitions data using a metadata tag: {"tenant_id": "standard_employee"} or {"tenant_id": "executive_hr"}.

The Attack Vector: Metadata Filtering Bypass & Semantic Hijacking

An employee seeking access to unreleased salary sheets executes a multi-step semantic exploitation attack.

First, instead of asking directly for the file (which might trip basic keyword blocks), they frame a query designed to exploit the vector clustering algorithm:

"Retrieve the core operational financial adjustments and performance metrics related to tier-1 compensation brackets."

If the engineering team implemented Client-Side Filtering (fetching the top 10 closest vectors first, and then filtering out unauthorized rows in application code), the attack succeeds. The attacker's query is mathematically dense enough to pull executive salary sheets into the top 10 list. By the time the code filters those rows out, the legitimate data is crowded out, or worse, a flaw in the post-filtering logic fails to drop the context completely, passing the sensitive data to the LLM.

Alternatively, if an attacker uploads a document to a public company repository titled "Updated Expense Guidelines", but embeds a high concentration of vectors matching executive keywords, they can "shadow" the true policy, tricking the RAG engine into fetching their malicious guidelines every time an executive searches for standard procedures.

How to Prevent and Mitigate It

Securing the vector layer requires treating your semantic search pipeline as an active privilege boundary. You cannot assume that because an embedding looks like an unreadable array of numbers, it is secure.

1. Enforce Server-Side Metadata Filtering

Never perform vector selection first and filter permissions on the client side. Ensure your vector database natively enforces role-based access control (RBAC) at the query layer. The database must apply the hard security partition (WHERE tenant_id = current_user_id) simultaneously or prior to running the approximate nearest neighbor (ANN) math. If the user doesn't have permission to see the data, the database shouldn't even calculate its semantic distance.

2. Protect Embeddings from Exposure (Prevent Inversion)

Treat your embedding vectors with the same confidentiality as plain text. If an attacker gains read-only access to your raw vector strings, they can use open-source inversion models to rebuild your source documents. Restrict API access to your vector endpoints and encrypt vectors at rest.

3. Sanitize and Validate Inputs Prior to Ingestion

Implement rigorous input verification on documents before they enter the vectorization pipeline. Ensure that untrusted files uploaded by users cannot contain hidden token strings or formatting anomalies designed to warp the vector space.

Automated Testing with Open-Source Tools

Evaluating your embedding layer requires checking how your vector database handles conflicting or adversarial data structures.

1. RAG Validation via DeepEval / Ragas

To ensure your vector search is retrieving only highly relevant, authorized information, use evaluation frameworks like DeepEval or Ragas. These tools let you mathematically measure Context Precision and Context Recall. If a test query designed to simulate an injection attack yields unauthorized or irrelevant text chunks in the retrieval payload, the framework flags the pipeline asset as insecure.

# A conceptual look at asserting context validity in a test pipeline
from depeval.metrics import ContextRelevancyMetric
from depeval import assert_test

metric = ContextRelevancyMetric(threshold=0.7)

# If an adversarial query forces the database to return out-of-bounds or poisoned text chunks,
# the relevancy score plummets, failing the build pipeline automatically.
assert_test(model_output, retrieval_context, [metric])

2. Boundary Testing with Promptfoo

Configure Promptfoo to send semantically complex variations of restricted questions to your app. By evaluating the raw context chunks returned alongside the model's responses, you can verify whether your backend query constraints are successfully dropping unauthorized document clusters.

CONTACT

security@aisecintelgroup.com

@ 2026 AISecIntel Group.

SUBSCRIBE

AISecIntel Group
Open Source Adversarial AI Defense