Revision as of 18:55, 29 April 2019 edit Daviddwd (talk \| contribs) Extended confirmed users 13,794 edits short desc Tag: Visual edit ← Previous edit		Revision as of 18:51, 10 October 2019 edit undo 112.208.228.83 (talk) →Methodology Next edit →
Line 20: == Methodology == [[Natural language processing]] ~~methods~~ are used to extract and identify language usage patterns common to speakers of an L1-group. This is done using language learner data, usually from a [[learner corpus]]. Next, [[machine learning]] is applied to train classifiers, like [[support vector machine]]s, for predicting the L1 of unseen texts.<ref>Tetreault et al, [http://anthology.aclweb.org/C/C12/C12-1158.pdf "Native Tongues, Lost and Found: Resources and Empirical Evaluations in Native Language Identification"], In Proc. International Conf. on Computational Linguistics (COLING), 2012</ref> A range of ensemble based systems have also been applied to the task and shown to improve performance over single classifier systems.<ref>Malmasi, Shervin, Sze-Meng Jojo Wong, and Mark Dras. [http://anthology.aclweb.org/W/W13/W13-1716.pdf "NLI Shared Task 2013: MQ submission"]. Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications. 2013.</ref>

Native-language identification: Difference between revisions