aquiffoo's picture

In a Training Loop 🔄

aquiffoo

aquiffoo

·

https://aquiffoo.is-a.dev/

AI & ML interests

thanks for everything.

Recent Activity

liked a model 2 days ago

zai-org/GLM-5.1

liked a model 4 days ago

k2-fsa/OmniVoice

updated a collection 5 days ago

View all activity

Organizations

upvoted a paper 19 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 21 days ago • 66

upvoted an article about 2 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

Feb 20

•

501

upvoted a paper about 2 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 136

upvoted a collection about 2 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.47k

upvoted a paper about 2 months ago

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

Paper • 2602.04649 • Published Feb 4 • 12

upvoted a paper 2 months ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 179

upvoted a paper 3 months ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

upvoted 2 collections 3 months ago

Jamba2

Jamba2 is a highly-efficient open source family of language models built for maximum reliability and steerability in the enterprise. • 3 items • Updated Jan 8 • 5

💧 LFM2.5

Collection of post-trained and base LFM2.5 models. • 30 items • Updated 1 day ago • 123

upvoted a changelog 4 months ago

Hugging Face Changelog

HuggingChat for Docs

Dec 12, 2025

• 120

upvoted a collection 4 months ago

i3-Series

Note: The models are listed in the default order set by Hugging Face, so the latest model appears at the botSeries • 6 items • Updated Mar 2 • 2

upvoted a paper 5 months ago

MemMamba: Rethinking Memory Patterns in State Space Model

Paper • 2510.03279 • Published Sep 28, 2025 • 74

upvoted a changelog 6 months ago

Hugging Face Changelog

Cleaner Collection URLs

Oct 23, 2025

• 82

upvoted a paper 6 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513

upvoted a changelog 7 months ago

Hugging Face Changelog

Repositories total file size is now displayed

Sep 18, 2025

• 175

upvoted a collection 7 months ago

Ling 2.0

11 items • Updated 6 days ago • 37

upvoted a paper 7 months ago

Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Paper • 2507.17702 • Published Jul 23, 2025 • 6

upvoted a collection 11 months ago

Granite 4.0 Language Models

Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated 7 days ago • 216