Instructions to use microsoft/deberta-v3-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/deberta-v3-base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="microsoft/deberta-v3-base")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("microsoft/deberta-v3-base", dtype="auto") - Inference
- Notebooks
- Google Colab
- Kaggle
Inference time
#9
by avifaiza - opened
Comparing inference time between deberta-v3-base and say bert-base-cased, I am getting that deberta is significantly slower. This is of course on the same set and same machine. (~50% increase).
model_name_or_path_bert-base-cased/dev_prediction_time.txt, Total prediction time = 73.53624510765076
model_name_or_path_microsoft/deberta-v3-base/dev_prediction_time.txt, Total prediction time = 111.61959910392761
Is that expected, or am I doing something wrong?