Any-to-Any
Transformers
Safetensors
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
Instructions to use openbmb/MiniCPM-o-2_6 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openbmb/MiniCPM-o-2_6 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("openbmb/MiniCPM-o-2_6", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
| { | |
| "image_processor_type": "MiniCPMVImageProcessor", | |
| "auto_map": { | |
| "AutoProcessor": "processing_minicpmo.MiniCPMOProcessor", | |
| "AutoImageProcessor": "image_processing_minicpmv.MiniCPMVImageProcessor" | |
| }, | |
| "processor_class": "MiniCPMOProcessor", | |
| "max_slice_nums": 9, | |
| "scale_resolution": 448, | |
| "patch_size": 14, | |
| "use_image_id": true, | |
| "image_feature_size": 64, | |
| "im_start": "<image>", | |
| "im_end": "</image>", | |
| "slice_start": "<slice>", | |
| "slice_end": "</slice>", | |
| "unk": "<unk>", | |
| "im_id_start": "<image_id>", | |
| "im_id_end": "</image_id>", | |
| "slice_mode": true, | |
| "norm_mean": [0.5, 0.5, 0.5], | |
| "norm_std": [0.5, 0.5, 0.5], | |
| "version": 2.6 | |
| } |