Revision as of 19:07, 9 March 2025 edit Pikamander2 (talk \| contribs) Extended confirmed users, Rollbackers 9,298 edits mNo edit summary ← Previous edit		Revision as of 13:57, 25 May 2025 edit undo OAbot (talk \| contribs) Bots 643,717 edits m Open access bot: url-access updated in citation with #oabot. Next edit →
Line 9: CLIR systems have improved so much that the most accurate multi-lingual and cross-lingual [[adhoc information retrieval]] systems today are nearly as effective as monolingual systems.<ref>Oard, Douglas. "Multilingual Information Access." ''Understanding Information Retrieval Systems''(2011): 373-80. Web.</ref> Other related information access tasks, such as [[media monitoring]], [[information filtering]] and routing, [[sentiment analysis]], and [[information extraction]] require more sophisticated models and typically more processing and analysis of the information items of interest. Much of that processing needs to be aware of the specifics of the target languages it is deployed in. Mostly, the various mechanisms of [[Linguistic typology\|variation in human language]] pose coverage challenges for information retrieval systems: texts in a collection may treat a topic of interest but use terms or expressions which do not match the expression of information need given by the user. This can be true even in a mono-lingual case, but this is especially true in cross-lingual information retrieval, where users may know the target language only to some extent. The benefits of CLIR technology for users with poor to moderate competence in the target language has been found to be greater than for those who are fluent.<ref>{{cite journal\|last1=Airio\|first1=Eija\|title=Who benefits from CLIR in web retrieval?\|journal=Journal of Documentation\|date=2008\|volume=64\|issue=5\|pages=760–778\|doi=10.1108/00220410810899754\|url=http://tampub.uta.fi/handle/10024/65923\|url-access=subscription}}</ref> Specific technologies in place for CLIR services include [[Morphology (linguistics)\|morphological analysis]] to handle [[inflection]], decompounding or compound splitting to handle [[Compound (linguistics)\|compound terms]], and translations mechanisms to translate a query from one language to another. The first workshop on CLIR was held in Zürich during the SIGIR-96 conference.<ref>The proceedings of this workshop can be found in the book ''Cross-Language Information Retrieval'' (Grefenstette, ed; Kluwer, 1998) {{ISBN\|0-7923-8122-X}}.</ref> Workshops have been held yearly since 2000 at the meetings of the [[Cross Language Evaluation Forum]] (CLEF). Researchers also convene at the annual [[Text Retrieval Conference]] (TREC) to discuss their findings regarding different systems and methods of information retrieval, and the conference has served as a point of reference for the CLIR subfield.<ref>Olvera-Lobo, María-Dolores. "Cross-Language Information Retrieval on the Web." ''Handbook of Research on Social Dimensions of Semantic Technologies and Web Services''(n.d.): 704-19. Web.</ref> Early CLIR experiments were conducted at TREC-6, held at the [[National Institute of Standards and Technology]] (NIST) on November 19–21, 1997.<ref>{{Cite journal \|last=Vorhees \|first=Ellen M. \|last2=Harman \|first2=Donna \|year=1999 \|title=Overview of the Sixth Text REtrieval Conference (TREC-6) \|url=https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=151399 \|journal=[[Information Processing and Management]]}}</ref>

Cross-language information retrieval: Difference between revisions