What do we learn by applying multiple methods in topic detection?
A comparative analysis on a large online dataset about mobility electrification

Alboni, Fabrizio; Russo, Margherita; Pavone, Pasquale

Identifying the topics covered in a corpus is one of the central issues in automatic text analysis. The objective of our paper is to contribute to the comparative analysis of different methods. In particular, we compare the results obtained through the use of the most common methods for topic identification, applied to the same corpus. The analysis is performed on a large original textual database created from an e-mobility newsletter. To compare the results between the methods, we refer to two criteria. First of all, the semantic consistency of the different models is evaluated by applying the UMass score and Pointwise mutual information. Secondly, the degree of association between the topics identified by the different models is processed using a heat-map and Cramer's V.

What do we learn by applying multiple methods in topic detection? A comparative analysis on a large online dataset about mobility electrification

Fabrizio Alboni;Margherita Russo;Pasquale Pavone

2022-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Codice ISBN
	
				9788891932310
			
	Parole chiave
	
				topic detection
text mining
Cramer's V
coherence indexes
electric mobility
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12607/38048

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

What do we learn by applying multiple methods in topic detection? A comparative analysis on a large online dataset about mobility electrification

Fabrizio Alboni;Margherita Russo;Pasquale Pavone

2022-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Attenzione

Citazioni

social impact

What do we learn by applying multiple methods in topic detection? A comparative analysis on a large online dataset about mobility electrification

Fabrizio Alboni;Margherita Russo;Pasquale Pavone

2022-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Attenzione

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)