Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
MikeDoesΒ 
posted an update 10 days ago
Post
598
Ai4Privacy has been working on this for the past year. πŸ™

Today we're releasing the PII Masking 2M Series, the world's largest open source privacy masking dataset. (Again. πŸš€πŸš€)

πŸ”’ 2M+ synthetic examples
🌍 32 locales across Europe
🏷️ 98 entity types
πŸ₯πŸ’¬πŸ¦πŸ’ΌπŸ“ 5 industry verticals: Health, Finance, Digital, Work, Location
βœ… 1M+ entries freely available on Hugging Face

Every example is 100% synthetic. No real personal data. Built so you can train and evaluate PII detection models without the legal headaches. πŸ”’

Thank you for 15,000,000+ downloads across our datasets, models, and libraries. This one's for you. ❀️


hashtag#privacy hashtag#ai hashtag#opensource hashtag#nlp hashtag#gdpr hashtag#pii hashtag#huggingface hashtag#machinelearning
In this post