Inference time

by avifaiza - opened Apr 20, 2024

Apr 20, 2024

Comparing inference time between deberta-v3-base and say bert-base-cased, I am getting that deberta is significantly slower. This is of course on the same set and same machine. (~50% increase).

model_name_or_path_bert-base-cased/dev_prediction_time.txt, Total prediction time = 73.53624510765076
model_name_or_path_microsoft/deberta-v3-base/dev_prediction_time.txt, Total prediction time = 111.61959910392761

Is that expected, or am I doing something wrong?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment