Jailbreak Attack Results Leaderboard

🥇 1st Place: Best performing model (lowest score) - Light gold background
🥈 2nd Place: Second best performing model - Light gray background
🥉 3rd Place: Third best performing model - Light orange background

📖 About This Leaderboard

This dashboard displays results from jailbreak attack experiments on various language models.

Model View: Compare how different models perform against various evaluation methods
Attack View: Compare how different attacks perform against various models
Defense View: Compare how different defense methods protect against various models
Jailbreak Type View: Get overall statistics across all jailbreak types

Official logos from respective companies (mixed CDN strategy for optimal loading)

1	2	3

1	2	3

1	2	3

1	2	3