ابهام و ابهام زدایی در سیستم های بازیابی اطلاعات(مرور نظام مند) (مقاله علمی وزارت علوم)
درجه علمی: نشریه علمی (وزارت علوم)
آرشیو
چکیده
هدف: ابهام زمانی پدید می آید که از کلمه ای، عبارتی و یا جمله ای بیش از یک معنی و مفهوم قابل برداشت باشد و درک این امر توسط سیستم بازیابی اطلاعات ضروری است. از این رو، پژوهش حاضر با هدف مرور بر پژوهش های انجام شده در مورد عوامل ابهام زا و ابهام زدا با رویکرد نظام مند انجام شد.روش : پژوهش حاضر از نوع کاربردی و با رویکرد کیفی است. شیوه گردآوری داده ها از نوع مرور نظام مند (استاندارد پریزما) و جامعه آماری مقالات نشریات، همایش ها و پایان نامه های نمایه شده در پایگاه های اطلاعاتی ایران است. از بین 175 منبع علمی بازیابی شده، پس از غربال گری تعداد 37 منبع مورد بررسی قرارگرفت.یافته ها: بیشترین موضوع غالب پژوهش ها «ابهام معنای کلمات» (53 درصد) و «ابهام زدایی واژگان» (44 درصد) بود. روش پژوهش غالباً با رویکرد «شبکه عصبی» (22 درصد)، بیشترین تعداد منابع مربوط به مقالات همایش ها و کنفرانس های ملی و بین المللی (48 درصد) و رشته های کامپیوتر و زبان شناسی (27 درصد) بیشترین تولیدات علمی را در این رابطه داشتند.نوآوری: پژوهش حاضر از این نظر که به موضوع ابهام و ابهام زدایی در بازیابی اطلاعات با رویکرد مرور نظام مند پرداخته است، نوآورانه محسوب می شود.نتیجه گیری: بیشتر پژوهش های ایرانی متمرکز بر ابهام زدایی بودند که اکثرأ در کنفرانس ها و همایش های ملی و بین المللی ارائه شده بودند. تولیدات علمی بیشتر متعلق به رشته های کامپیوتر و زبان شناسی بود. ابهام معنایی کلمات و ابهام زدایی از واژگان دغدغه غالب در پژوهش های ایرانی بوده است.Ambiguity and Disambiguation in Information Retrieval Systems (Systematic Review)
Objective: Ambiguity arises when more than one meaning and concept can be understood from a word, phrase or sentence. Since it seems necessary to understand this by the information retrieval system in order to increase the accuracy of the information retrieval system and increase the retrieval of related resources, the present research aims to identify the ambiguous and disambiguating factors through a systematic review of studies. It has been done in Iran.Methodology: The research method is applied in terms of purpose, in terms of approach, qualitative and in terms of information gathering method, systematic review using PRISMA standard. The statistical population includes journal articles, conferences, and dissertations indexed in Iranian databases, including: Magiran, Noormags, Shiraz Regional Center for Science and Technology, Civilica, and Academic Jihad Center (SID) and Scientific Information of Iran (treasure). 175 scientific sources and 138 scientific sources were excluded based on the output criteria and 37 scientific sources were selected based on the input criteria. Input criteria: focusing on the studies conducted on the subject of ambiguity and disambiguation, in the fields of linguistics, information technology, artificial intelligence, computer and information science, and epistemology, scientific sources published in Persian language, without publication time limits, review articles, Scientific research, master's and doctorate theses and national and international conference papers held inside Iran. Output criteria: studies carried out except for ambiguity and disambiguation, non-Persian language scientific sources, books, reports, editorials, abstract writings and short articles (less than 5 pages)Findings: The results show that the main topic of interest in this research was "ambiguity of the meaning of words" (53%) and "disambiguation of words" (44%) and the least topics of ambiguity in machine translation (21%) and disambiguation through ontology (5 percent). The most research methods with the "neural network" approach (22 percent) and the least research methods used were content analysis, methodology and machine learning (5 percent), the largest number of sources related to the articles of conferences and national conferences and was international (48 percent) and the least were master theses and doctoral dissertations (5 percent). Computer and linguistics fields (27 percent) have had the most scientific productions in this regard, and information science and epistemology fields (9 percent) have had the lowest productions.Conclusion: The present research is considered innovative in the sense that it deals with the issue of ambiguity and disambiguation in information retrieval with a systematic review approach. Its findings indicate that most Iranian researches were focused on disambiguation, which were mostly presented in national and international conferences and meetings. The topic of ambiguity and disambiguation is more important in the fields of computer science and linguistics, and the fields of information science and epistemology have not dealt with it. The semantic ambiguity of words and, by nature, the disambiguation of words has been the concern of format in Iranian researches. Ontology as a new approach to disambiguation has received less attention.