Translation events in cross-language information retrieval: Lexical ambiguity, lexical holes, vocabulary mismatch, and correct translations
Date of Award
2003
Degree Type
Dissertation
Degree Name
Doctor of Philosophy (PhD)
Department
Information Science and Technology
Advisor(s)
Elizabeth D. Liddy
Keywords
Translation, Cross-language information retrieval, Lexical ambiguity, Vocabulary mismatch
Subject Categories
Computer Sciences | Library and Information Science | Physical Sciences and Mathematics | Social and Behavioral Sciences
Abstract
Cross-Language Information Retrieval (CLIR) systems enable users to formulate queries in their native language to retrieve documents in foreign languages. Because queries and documents in CLIR do not necessarily share the same language, translation is needed before matching can take place. This translation step tends to cause a reduction in the retrieval performance of CLIR as compared to monolingual information retrieval.
The prevailing CLIR approach and the focus of this study is query translation. The translation of queries is inherently difficult due to the lack of a one-to-one mapping of a lexical item and its meaning, which creates lexical ambiguity. This, and other translation problems, result in translation errors which impact CLIR performance.
To understand the events occurring in cross-language retrieval query translation and the relation of these events to retrieval performance, the study explored the following research questions: (1) What kinds of translation events affect cross-language retrieval? (2) In what way does the presence of certain translation events in query translation affect retrieval performance?
The study followed a two-phase multi-method approach. In phase one, a taxonomy of translation events was created through content analysis of queries and their translations in combination with an examination of the literature. In the second and final phase, a subset of the test queries was coded using the taxonomy resulting from phase one. These queries were then used in information retrieval experimentation to assess the impact of the translation events on retrieval performance.
Access
Surface provides description only. Full text is available to ProQuest subscribers. Ask your Librarian for assistance.
Recommended Citation
Diekema, Anne Roel, "Translation events in cross-language information retrieval: Lexical ambiguity, lexical holes, vocabulary mismatch, and correct translations" (2003). School of Information Studies - Dissertations. 26.
https://surface.syr.edu/it_etd/26
http://libezproxy.syr.edu/login?url=http://proquest.umi.com/pqdweb?did=764749791&sid=1&Fmt=2&clientId=3739&RQT=309&VName=PQD