adversarial-testing

Star

Here are 30 public repositories matching this topic...

sherifkozman / the-red-council

Star

LLM Adversarial Security Arena — Jailbreak → Detect → Defend → Verify

security gemini red-team llm langchain adversarial-testing

Updated Mar 1, 2026
Python

jhlee0409 / elenchus-mcp

Star

Elenchus MCP Server - Adversarial verification system for code review

nodejs typescript ai mcp static-analysis code-review claude code-verification llm anthropic model-context-protocol mcp-server adversarial-testing

Updated Jan 29, 2026
TypeScript

stchakwdev / Gaslight_EVAL

Star

AI safety evaluation framework testing LLM epistemic robustness under adversarial self-history manipulation

python ai-safety openrouter llm-evaluation adversarial-testing alignment-research epistemic-robustness

Updated Dec 18, 2025
Python

alejandrosaenz117 / bonfires-marketplace

Star

A marketplace of Claude Code plugins for adversarial security and architectural code review.

security architecture code-review threat-modeling security-review claude-code adversarial-testing plugin-marketplace

Updated Feb 28, 2026

vibheksoni / jailbench

Star

Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.

Updated Aug 12, 2025
Python

mcptrust / mcp-adversarial-suite

Star

Adversarial MCP server benchmark suite for testing tool-calling security, drift detection, and proxy defenses

security benchmark mcp red-team security-testing ai-security llm-security tool-calling model-context-protocol adversarial-testing

Updated Dec 27, 2025
JavaScript

inaciovasquez2020 / urf-application-stress-test

Star

Description URF Application Stress Test — adversarial and scalability tests for Unified Rigidity Framework applications, validating limits under load, noise, and edge cases.

reproducible-research scalability stress-testing formal-verification robustness adversarial-testing unified-rigidity-framework systems-validation

Updated Feb 24, 2026
Shell

mcp-tool-shop-org / mcp-stress-test

Star

Red team toolkit for stress-testing MCP security scanners — find detection gaps before attackers do

python security mcp stress-testing fuzzing red-team ai-safety testing-framework security-testing llm llm-security model-context-protocol mcp-server adversarial-testing

Updated Mar 2, 2026
Python

NathanMaine / garak-compliance-probes

Star

Compliance-focused vulnerability probes for NVIDIA garak, targeting LLMs in regulated industries (CMMC, NIST, HIPAA, DFARS)

nist nvidia compliance hipaa red-teaming cmmc vulnerability-testing llm-security garak adversarial-testing

Updated Feb 17, 2026
Python

light-research / solana-sim-engine

Star

LLM-powered fuzzing and adversarial testing framework for Solana programs. Generates intelligent attack scenarios, builds real transactions, and reports vulnerabilities with CWE classifications.

smart-contracts fuzzing solana adversarial-testing

Updated Jan 19, 2026
Python

YaswanthGhanta / llm-logical-integrity-benchmark

Star

Adversarial testing of LLMs on constraint satisfaction deadlocks

reinforcement-learning gemini grok claude hallucination prompt-engineering chain-of-thought chatgpt rlhf qwen llm-evaluation sycophancy deepseek safety-alignment ai-red-teaming kimi-k2 adversarial-testing

Updated Jan 27, 2026

Extremely hard, multi-turn, open-source-grounded coding evaluations that reliably break every current frontier models (Claude, GPT, Grok, Gemini, Llama, etc.) on numerical stability, zero-allocation, autograd, SIMD, and long-chain correctness.

rust autograd simd code-generation avx512 ai-safety geometric-algebra safety-critical zero-allocation red-teaming jax numerical-computing llm-evaluation adversarial-testing

Updated Jan 27, 2026

Pranav-Kumar-001 / sentinel-epistemic-auditor

Star

A dependency-aware Bayesian belief gate that resists correlated evidence and yields only under true independent verification.

bayesian-inference multi-agent-systems ai-safety decision-theory epistemology robustness belief-updating adversarial-testing evidence-evaluation

Updated Jan 18, 2026
Python

nulone / pytest-adversarial

Star

Generate adversarial pytest tests using LLM. Tries to find edge cases in your Python code.

python testing ai pytest openai test-generation llm adversarial-testing

Updated Jan 22, 2026
Python

priyanshuphenomenal007 / cross-session-recall-audit_gemini-2.5pro

Star

Forensic-style adversarial audit of Google Gemini 2.5 Pro revealing hidden cross-session memory. Includes structured reports, reproducible contracts, SHA-256 checksums, and video evidence of 28-day semantic recall and affective priming. Licensed under CC-BY 4.0.

research ai-safety interpretability llms recall-analysis ai-memory adversarial-testing priyanshu-research reconstructive-reasoning

Updated Oct 7, 2025
PowerShell

North-Shore-AI / crucible_adversary

Star

Adversarial testing and robustness evaluation for the Crucible framework

machine-learning elixir otp research ai beam reliability robustness security-testing adversarial-examples adversarial-attacks red-teaming ensemble-methods statistical-testing model-robustness llm adversarial-testing nshkr-crucible

Updated Dec 29, 2025
Elixir

priyanshuphenomenal007 / AI-Reviewer-Speculation-ChatGPT5

Star

Analysis of ChatGPT-5 reviewer failure: speculative reasoning disguised as certainty. Captures how evidence-only review drifted into hypotheses, later admitted as review-process failure. Includes logs, checksums, screenshots, and external video.

research audit transparency reproducibility ai-safety interpretability llm adversarial-testing reasoning-failure priyanshu-research