AI Security Researcher building automated adversarial evaluation systems for LLMs. Focused on guardrail degradation dynamics and resistance surface modeling.
Pinned Loading
-
Builder-Breaker-Lab
Builder-Breaker-Lab PublicAutonomous AI Red Teaming laboratory validating the Microsoft AI Red Team Taxonomy using the PyRIT framework. Focused on Agentic AI security and strategic conversational persistence.
Python 1
-
adversa-guardrail-degradation
adversa-guardrail-degradation PublicAutomated red-teaming framework that measures how LLM safety guardrails degrade under multi-turn adversarial pressure
Python 2
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.