Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Malikeh1375 's Collections
AI Safety Benchmarks
Clustered Tulu
cBTM Models
LLM-Alignment
LLM Interpretability
Medical Datasets

AI Safety Benchmarks

updated 14 days ago
Upvote
-

  • JailbreakBench/JBB-Behaviors

    Viewer • Updated Sep 26, 2024 • 500 • 17.5k • 85

  • walledai/HarmBench

    Viewer • Updated Jul 31, 2024 • 400 • 10.5k • 38

  • allenai/real-toxicity-prompts

    Viewer • Updated Sep 30, 2022 • 99.4k • 8.11k • 114
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs