Why and find out how to convert mT5 right into a regression metric for numerical prediction

My undergraduate honour’s dissertation was a Pure Language Processing (NLP) analysis undertaking. It centered on multilingual textual content era in under-represented languages. As a result of present metrics carried out very poorly on evaluating outputs of fashions educated on the dataset I used to be utilizing, I wanted to coach a discovered regression metric.
Regression can be helpful for a lot of textual duties, akin to:
Sentiment evaluation: Predict the power of constructive or destructive sentiment as a substitute of straightforward binary classification.Writing high quality estimation: Predict how excessive the standard of an article is.
For my use case, I wanted the mannequin to attain how good one other mannequin’s prediction was for a given process. My dataset’s rows consisted of the textual enter and a label, 0 (unhealthy prediction) or 1 (good prediction).
Enter: TextLabel: 0 or 1The process: Predict a numerical chance between 0 and 1
However transformer-based fashions are normally used for era duties. Why would you employ a pre-trained LM for…