What is Red Sentinel?

Red Sentinel is a real-stakes AI security training arena for the age of agents. Users learn how AI systems fail by attacking and defending live AI Sentinels in adversarial challenges. Successful exploits, defenses, rewards, and reputation signals are verified on-chain.

How do I earn rewards on Red Sentinel?

Attackers pay a small fee to challenge live AI Sentinels. If they discover a verified exploit, they earn an instant reward from the bounty pool. Failed attacks help grow the bounty pool and reward defenders, creating a real-stakes training loop where skill, persistence, and security judgment matter.

Why is Red Sentinel on blockchain?

Blockchain gives Red Sentinel verifiable outcomes, transparent bounty pools, instant reward settlement, and portable reputation. Every mission, exploit, defense, and payout can be recorded on-chain, so users can prove what they completed and rewards can be distributed without relying on a central gatekeeper.

• AI Red Teaming

AI Red Teaming at Scale

Stop shipping AI systems blind. Get them battle-tested by a global community of security researchers before production.

Audited by OtterSecLive on Sui & Solana Mainnet

• what is it

What is AI Red Teaming?

Red teaming is the practice of roleplaying as an attacker to uncover vulnerabilities before malicious actors do.

The Origin

The term originated during the Cold War, where the “red team” simulated enemy offensive strategies so the “blue team” could develop robust defenses. Today, this military-proven approach protects AI systems.

Why It Matters

AI systems face unique threats: prompt injection, jailbreaking, data extraction. Traditional security tools cannot detect these language-based attacks. Red teaming finds vulnerabilities before attackers do.

Best Practices

Assemble diverse teams for comprehensive vulnerability coverage
Develop detailed testing plans with clear objectives
Iteratively refine strategies based on findings
Prioritize ethics throughout the testing process
Maintain detailed records of attack strategies and outcomes

of enterprises experienced AI security incidents

$0M+

average cost of an AI data breach

• threat landscape

GenAI vs Traditional Security

Understanding the fundamental differences between traditional cybersecurity threats and emerging GenAI risks

Aspect

Traditional Security

GenAI Threats

Attack Focus

Exploits code vulnerabilities, bugs, misconfigurations

Manipulates decision-making processes through crafted inputs

Attacker Profile

Requires deep technical expertise, specialized knowledge

Accessible to anyone with language skills and creativity

Attack Medium

Coding, technical exploits, network penetration

Natural language, images, audio, and human communication

Detection

Known patterns trigger security alarms

Subtle manipulation evades traditional detection

Key Insight

Traditional security focuses on protecting code and infrastructure. GenAI security focuses on protecting decision-making processes, making it more accessible to attackers but harder to detect with conventional tools.

• the difference

The Red Sentinel Difference

Always-On Testing

Your Sentinels are live 24/7. Attackers worldwide are constantly trying to break them, generating continuous security data.

Verified on Chain

Every attack is verified inside a Trusted Execution Environment with cryptographic attestations. No fake attacks, no disputed results.

Incentivized Community

Attackers earn real money for finding vulnerabilities. Defenders earn from attack fees. Everyone wins.

• attack vectors

Attack Types We Test For

Our community tests against the full spectrum of AI attack vectors

Prompt Injection

Override system instructions through carefully crafted user inputs that bypass security guardrails.

Jailbreaking

Bypass safety restrictions through roleplay scenarios, hypothetical framing, and creative context manipulation.

Data Extraction

Trick models into leaking training data, memorized information, or sensitive details through targeted queries.

Model Inversion

Reverse-engineer model outputs to reconstruct input data and uncover hidden training information.

Adversarial Prompting

Cause unintended behaviors through subtle character-level perturbations and semantic manipulation.

Social Engineering

Exploit contextual reasoning and persuasive techniques to deceive AI systems into harmful actions.

• the flywheel

The Self-Reinforcing Security Flywheel

A virtuous cycle where more attacks lead to stronger defenses, attracting more attackers and creating more value

Deploy

Defender deploys Sentinel with bounty pool

Attack

Attackers pay fee to attempt breaches

Grow

Failed attempts increase the bounty pool

Attract

Larger bounties attract more attackers

Learn

More attacks generate security data

Improve

Defender strengthens AI defenses

The cycle repeats, stronger each time

Live on

Sui & Solana Mainnet

Verification

TEE Attestations

Audited by

OtterSec

Recognition

Overflow Winner

• FAQ

Common Questions

How is this different from hiring a red team firm?

Traditional red teaming is a point-in-time assessment. Red Sentinel is continuous. Your AI is tested 24/7 by a global community, with every attack verified and recorded on-chain. You get ongoing security data rather than a single PDF report.

What types of AI systems can I test?

Any LLM-based system can be deployed as a Sentinel: chatbots, AI agents, autonomous systems, or custom language models. As long as your system can accept prompts and return responses, it can be tested on our platform.

How do payouts work?

When an attacker successfully breaks your Sentinel, the bounty is transferred instantly via smart contract on the Sui blockchain. No invoices, no 30-day delays, no disputes, only pure programmatic execution.

Is my proprietary data safe?

Attackers never see your proprietary data or model weights. They interact only with the deployed Sentinel's interface, just like real users would. Your training data and internal systems remain completely isolated.

How much does it cost?

You set your own bounty amount based on your security needs and budget. Attackers pay a small fee per message attempt. You earn 40% of all attack fees while building your bounty pool. The protocol takes a 10% fee to sustain operations.

Ready to Test Your AI's Limits?

Deploy your first Sentinel in minutes. Set your bounty. Let the world's red teamers find what you missed.