Text Mining of Open-Ended Questions in Self-Assessment of University Teachers: An LDA Topic Modeling Approach

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/103348
Información del item - Informació de l'item - Item information
Título: Text Mining of Open-Ended Questions in Self-Assessment of University Teachers: An LDA Topic Modeling Approach
Autor/es: Buenaño Fernández, Diego | González, Mario | Gil, David | Luján-Mora, Sergio
Grupo/s de investigación o GITE: Lucentia | Advanced deveLopment and empIrical research on Software (ALISoft)
Centro, Departamento o Servicio: Universidad de Alicante. Departamento de Tecnología Informática y Computación | Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Palabras clave: Latent Dirichlet allocation | Open-ended questions | Teacher self-assessment | Topic modeling | Topic network
Área/s de conocimiento: Arquitectura y Tecnología de Computadores | Lenguajes y Sistemas Informáticos
Fecha de publicación: 28-feb-2020
Editor: IEEE
Cita bibliográfica: IEEE Access. 2020, 8: 35318-35330. doi:10.1109/ACCESS.2020.2974983
Resumen: The large amount of text that is generated daily on the web through comments on social networks, blog posts and open-ended question surveys, among others, demonstrates that text data is used frequently, and therefore; its processing becomes a challenge for researchers. The topic modeling is one of the emerging techniques in text mining; it is based on the discovery of latent data and the search for relationships among text documents. In this paper, the objective of the research is to evaluate a generic methodology based on topic modeling and text network modeling, that allows researchers to gather valuable information from surveys that use open-ended questions. To achieve this, this methodology has been evaluated through the use of a case study in which the responses to a teacher self-assessment survey in an Ecuadorian university have been studied. The main contribution of the article is the inclusion of clustering algorithms in order to complement the results obtained when executing topic modeling. The proposed methodology is based on four phases: (a) Construction of a text database, (b) Text mining and topic modeling, (c) Topic network modeling and (d) The relevance of the identified topics. In previous works, it has been observed that the human interpretative contribution plays an important role in the process, especially in phases (a) and (d). For this reason, the visualization interfaces, such as graphs and dendograms, are of critical importance for researchers in order allow topic to efficiently analyze the results of the topic modeling. As a result of this case study, a compendium of the main strategies that teachers carry out in their classes with the aim of improving student retention is presented. In addition, the proposed methodology can be extended to the analysis of the unstructured textual information found in blogs, social networks, forums, etc.
URI: http://hdl.handle.net/10045/103348
ISSN: 2169-3536
DOI: 10.1109/ACCESS.2020.2974983
Idioma: eng
Tipo: info:eu-repo/semantics/article
Derechos: This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/
Revisión científica: si
Versión del editor: https://doi.org/10.1109/ACCESS.2020.2974983
Aparece en las colecciones:INV - LUCENTIA - Artículos de Revistas
INV - ALISoft - Artículos de Revistas

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
Thumbnail09003400.pdf2,12 MBAdobe PDFAbrir Vista previa


Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.