Automated web annotator of biomedical entities in spanish language.

 

Authors
Morejón Corrales, Samiel Andrés; Iza Iza, Josselyn Libelia
Format
Article
Status
publishedVersion
Description

In Natural Language Processing (NLP) and supervised machine learning, the scarcity of labeled corpora results in poor performance of machine learning models. In the medical domain, there are also fewer labeled corpora in Spanish than in English. We propose a method to identify biomedical entities in Spanish-language clinical texts, through automatic translation and word alignment, by translating the source text (Spanish) to the target text (English), then labeling the target text with automatic biomedical entity taggers, and finally projecting the biomedical entities from the target text to the corresponding text sections in the source text by means of word alignment generated in the translation process. This is done with the objective of annotating the source text with English language tools (automatic annotators). As a result, an efficient method capable of processing and annotating biomedical entities in the Spanish language with high precision is obtained, since it integrates several automatic annotators in a single web system.
ESPE-L

Publication Year
2022
Language
eng
Topic
SISTEMAS WEB
ALINEACIÓN AUTOMÁTICA
PROGRAMACIÓN NEUROLINGUÍSTICA
Repository
Repositorio Universidad de las Fuerzas Armadas
Get full text
http://repositorio.espe.edu.ec/handle/21000/33602
Rights
openAccess
License