Transformers Trainer: Tried to track the number of tokens seen, however the current model is not configured properly to know what item is the input
I’m receiving this error from HuggingFace’s Trainer
:
Did anyone try to re-implement Seq2SeqTrainer using only Pytorch?
Lately I’m trying to fine-tune a T5-based model and compare the performance when using Seq2SeqTrainer
of HuggingFace and only using Pytorch class.