Content deleted Content added
Neonrights (talk | contribs) mNo edit summary |
Neonrights (talk | contribs) |
||
Line 42:
There are multiple methods that can be used to evaluate paraphrases. Since paraphrase recognition is simply a classification problem, most standard evaluations metrics such as [[accuracy]], [[f1 score]], or an [[receiver operating characteristic|ROC curve]] will do.
The simplest method used to evaluate paraphrase generation would be through the use of human judges. Unfortunately, evaluation through human judges tends to be time consuming. Automated approaches to evaluation prove to be challenging as it is essentially a problem as difficult as paraphrase recognition.<ref name=needed>{{Citation needed}}</ref>
|