U4RASD/NeoAraBERT
Feature Extraction • 0.3B • Updated • 256 • 8
NeoAraBERT: A Modern Foundation Model for Arabic Embeddings with Diacritics-Aware Tokenization and POS-Targeted Masking
Note This is the NeoAraBERT_Mix checkpoint, our best-performing checkpoint overall.