CoVT Checkpoint (Depth Aligned)

Checkpoint of https://huggingface.co/papers/2511.19418.

Model Description

This CoVT checkpoint is aligned with 4 Depth tokens.
These task-specific tokens are integrated into the model’s embedding space to enhance depth-awareness.

Downloads last month: 69

Safetensors

Model size

8B params

Tensor type

F16

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including Wakals/CoVT-7B-depth

CoVT: Chain-of-Visual-Thought

Collection

Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought! • 7 items • Updated Nov 25, 2025 • 6

Paper for Wakals/CoVT-7B-depth

Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens

Paper • 2511.19418 • Published Nov 24, 2025 • 29