BodyMaps

university

https://www.zongweiz.com/dataset

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Saint-lsy authored a paper about 1 month ago

OmniBrainBench: A Comprehensive Multimodal Benchmark for Brain Imaging Analysis Across Multi-stage Clinical Tasks

Saint-lsy authored a paper about 1 month ago

Unveiling and Bridging the Functional Perception Gap in MLLMs: Atomic Visual Alignment and Hierarchical Evaluation via PET-Bench

Saint-lsy authored a paper about 1 month ago

MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning

View all activity

authored 3 papers about 1 month ago

OmniBrainBench: A Comprehensive Multimodal Benchmark for Brain Imaging Analysis Across Multi-stage Clinical Tasks

Paper • 2511.00846 • Published Nov 2, 2025 • 1

Unveiling and Bridging the Functional Perception Gap in MLLMs: Atomic Visual Alignment and Hierarchical Evaluation via PET-Bench

Paper • 2601.02737 • Published Jan 6

MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning

Paper • 2602.03320 • Published Feb 3 • 2

submitted a paper to Daily Papers about 1 month ago

MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning

Paper • 2602.03320 • Published Feb 3 • 2

authored a paper about 2 months ago

VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

Paper • 2601.10124 • Published Jan 15 • 4

authored a paper 3 months ago

Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension

Paper • 2512.02791 • Published Dec 2, 2025 • 1

authored a paper 4 months ago

Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models

Paper • 2511.11910 • Published Nov 14, 2025 • 35

authored 2 papers 5 months ago

Large Language Models Do NOT Really Know What They Don't Know

Paper • 2510.09033 • Published Oct 10, 2025 • 17

Scaling Language-Centric Omnimodal Representation Learning

Paper • 2510.11693 • Published Oct 13, 2025 • 104

authored 4 papers 5 months ago

Medical Reasoning in the Era of LLMs: A Systematic Review of Enhancement Techniques and Applications

Paper • 2508.00669 • Published Aug 1, 2025

Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation

Paper • 2405.06948 • Published May 11, 2024

Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion

Paper • 2501.16679 • Published Jan 28, 2025

EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis

Paper • 2505.23601 • Published May 29, 2025

authored 2 papers 5 months ago

MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs

Paper • 2504.06897 • Published Apr 9, 2025 • 1

POSTER++: A simpler and stronger facial expression recognition network

Paper • 2301.12149 • Published Jan 28, 2023 • 1

authored a paper 7 months ago

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Paper • 2507.22607 • Published Jul 30, 2025 • 47

authored 3 papers 8 months ago

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Paper • 2505.13032 • Published May 19, 2025 • 4

CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following

Paper • 2506.12285 • Published Jun 14, 2025 • 54

$μ^2$Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation

Paper • 2507.00316 • Published Jun 30, 2025 • 15

authored a paper 9 months ago

Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Paper • 2504.13816 • Published Apr 18, 2025 • 18