view article Article Does Depth Actually Help Reasoning? A Tiny Experiment on 2× T4 wop • 11 days ago • 3
MiniCPM-V 4.6 Collection A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone • 11 items • Updated 17 days ago • 10
🤏 Smol-Data Collection Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated Mar 2 • 12
Claude 4.5 Opus Collection Distilled models and datasets for Claude 4.5 Opus. • 12 items • Updated Apr 12 • 35
PockEngine: Sparse and Efficient Fine-tuning in a Pocket Paper • 2310.17752 • Published Oct 26, 2023 • 15