مطالب مرتبط با کلیدواژه

Relation Extraction


۱.

RePersian - A Fast Relation Extraction Tool in Persian(مقاله علمی وزارت علوم)

کلیدواژه‌ها: Relation Extraction Persian Language Regex POS Tag

تعداد بازدید : ۵۰۷ تعداد دانلود : ۲۸۷
The task of extracting semantic relations from raw data is called relation extraction. One of the most important fields in open information extraction is the automatically extraction of relations in any domain, especially in web mining. There are many works and approaches for relation extraction in English and other languages. Some of these approaches are based on parsing trees. Dependency parsing in the Persian language is difficult and time-consuming, since Persian is a low resource language and has also a dependency grammar and lexical structure, which affects also the speed of relations extraction in Persian. In this paper we will introduce a fast relation extraction method in Persian called RePersian. RePersian is dependent on part-of-speech (POS) tags of a sentence and special relation patterns, which are extracted by analyzing sentence structures in Persian. For finding relation patterns, RePersian searches through POS-tags that are given in regular expression forms. By matching the correct POS pattern to a relation pattern, RePersian extracts the semantic relations in a sentence. We appraise RePersian in two different scenarios on the Dadegan Persian dependency tree dataset. RePersian had on average the precisions 78.05%, 80.4% and 54.85% in finding the first argument on a relation, the second argument and the right relation between them.
۲.

A Distant Supervised Approach for Relation Extraction in Farsi Texts(مقاله علمی وزارت علوم)

کلیدواژه‌ها: Relation Extraction Information Extraction Distant Supervision Persian Wikipedia

تعداد بازدید : ۵۹۶ تعداد دانلود : ۳۱۲
The volume of Farsi information on the Internet has been increasing in recent years. However, most of this information is in the form of unstructured or semi-structured free text. For quick and accurate access to the vast knowledge contained in these texts, the information extraction methods are essential to generate knowledge bases. In recent years, relation extraction as a sub-task of information extraction has received much attention. While many of these systems were developed in English and other well-known languages, the systems for information extraction in Farsi have received less attention from researchers. In this systematic research for semi-automatic relation extraction, Persian Wikipedia articles were presented as reliable and semi-structured sources. In this system, the relation extraction is performed with the assistance of patterns that are automatically obtained with an approach based on distant supervised. In order to apply the distant supervised, the vast knowledge base of Wikidata has been used as a source in perfect synchronization with Wikipedia. The results show that the average precision value for all relations is 76.81%, which indicates an enhancement of precision compared to other methods in Farsi.
۳.

DREaM: Drug-Drug Relation Extraction via Transfer Learning Method(مقاله علمی وزارت علوم)

کلیدواژه‌ها: Relation Extraction Drug-Drug relations Transfer Learning

حوزه‌های تخصصی:
تعداد بازدید : ۱۹ تعداد دانلود : ۱۸
Relation extraction between drugs plays a crucial role in identifying drug–drug interactions and predicting side effects. The advancement of machine learning methods in relation extraction, along with the development of large medical text databases, has enabled the low-cost extraction of such relations compared to other approaches that typically require expert knowledge. However, to the best of our knowledge, there are limited datasets specifically designed for drug–drug relation extraction currently available. Therefore, employing transfer learning becomes necessary to apply machine learning methods in this domain. In this study, we propose DREAM, a method that first employs a trained relation extraction model to discover relations between entities and then applies this model to a corpus of medical texts to construct an ontology of drug relationships. The main contribution of this study is adapting the ACORD model to the medical domain using transfer learning, enabling the extraction of domain-specific drug–drug relations and the construction of an ontology. The extracted relations are subsequently validated using a large language model. Quantitative results indicate that the LLM agreed with 71 % of the relations extracted from a subset of PubMed abstracts. Furthermore, our qualitative analysis indicates that this approach can uncover ambiguities in the medical domain, highlighting the challenges inherent in relation extraction in this field.