The principal challenge encountered in the realm of Named-Entity Recognition lies in the acquisition of high-caliber annotated data. In certain languages and specialized domains, the availability of substantial datasets suitable for training models via traditional machine learning methodologies can prove to be a formidable obstacle [10]. In an effort to address this issue, we have explored a Policy-based Active Learning approach aimed at meticulously selecting the most advantageous instances generated through a Data Augmentation procedure [3, 6]. This endeavor was undertaken within the context of a few-shot scenario in the biomedical field. Our study has revealed the superiority of this strategy in comparison to active learning techniques relying on fixed metrics or random instance selection, guaranteeing the privacy of patients from whose medical records the source data were obtained and used. However, it is imperative to note that this approach entails heightened computational demands and necessitates a longer execution duration [7].

Few Shot NER on Augmented Unstructured Text from Cardiology Records

Ferraro, Antonino;
2024-01-01

Abstract

The principal challenge encountered in the realm of Named-Entity Recognition lies in the acquisition of high-caliber annotated data. In certain languages and specialized domains, the availability of substantial datasets suitable for training models via traditional machine learning methodologies can prove to be a formidable obstacle [10]. In an effort to address this issue, we have explored a Policy-based Active Learning approach aimed at meticulously selecting the most advantageous instances generated through a Data Augmentation procedure [3, 6]. This endeavor was undertaken within the context of a few-shot scenario in the biomedical field. Our study has revealed the superiority of this strategy in comparison to active learning techniques relying on fixed metrics or random instance selection, guaranteeing the privacy of patients from whose medical records the source data were obtained and used. However, it is imperative to note that this approach entails heightened computational demands and necessitates a longer execution duration [7].
2024
9783031535543
9783031535550
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12607/27963
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact