Text Mining of Open-Ended Questions in Self-Assessment of University Teachers: An LDA Topic Modeling Approach
Por favor, use este identificador para citar o enlazar este ítem:
http://hdl.handle.net/10045/103348
Título: | Text Mining of Open-Ended Questions in Self-Assessment of University Teachers: An LDA Topic Modeling Approach |
---|---|
Autor/es: | Buenaño Fernández, Diego | González, Mario | Gil, David | Luján-Mora, Sergio |
Grupo/s de investigación o GITE: | Lucentia | Advanced deveLopment and empIrical research on Software (ALISoft) |
Centro, Departamento o Servicio: | Universidad de Alicante. Departamento de Tecnología Informática y Computación | Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos |
Palabras clave: | Latent Dirichlet allocation | Open-ended questions | Teacher self-assessment | Topic modeling | Topic network |
Área/s de conocimiento: | Arquitectura y Tecnología de Computadores | Lenguajes y Sistemas Informáticos |
Fecha de publicación: | 28-feb-2020 |
Editor: | IEEE |
Cita bibliográfica: | IEEE Access. 2020, 8: 35318-35330. doi:10.1109/ACCESS.2020.2974983 |
Resumen: | The large amount of text that is generated daily on the web through comments on social networks, blog posts and open-ended question surveys, among others, demonstrates that text data is used frequently, and therefore; its processing becomes a challenge for researchers. The topic modeling is one of the emerging techniques in text mining; it is based on the discovery of latent data and the search for relationships among text documents. In this paper, the objective of the research is to evaluate a generic methodology based on topic modeling and text network modeling, that allows researchers to gather valuable information from surveys that use open-ended questions. To achieve this, this methodology has been evaluated through the use of a case study in which the responses to a teacher self-assessment survey in an Ecuadorian university have been studied. The main contribution of the article is the inclusion of clustering algorithms in order to complement the results obtained when executing topic modeling. The proposed methodology is based on four phases: (a) Construction of a text database, (b) Text mining and topic modeling, (c) Topic network modeling and (d) The relevance of the identified topics. In previous works, it has been observed that the human interpretative contribution plays an important role in the process, especially in phases (a) and (d). For this reason, the visualization interfaces, such as graphs and dendograms, are of critical importance for researchers in order allow topic to efficiently analyze the results of the topic modeling. As a result of this case study, a compendium of the main strategies that teachers carry out in their classes with the aim of improving student retention is presented. In addition, the proposed methodology can be extended to the analysis of the unstructured textual information found in blogs, social networks, forums, etc. |
URI: | http://hdl.handle.net/10045/103348 |
ISSN: | 2169-3536 |
DOI: | 10.1109/ACCESS.2020.2974983 |
Idioma: | eng |
Tipo: | info:eu-repo/semantics/article |
Derechos: | This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/ |
Revisión científica: | si |
Versión del editor: | https://doi.org/10.1109/ACCESS.2020.2974983 |
Aparece en las colecciones: | INV - LUCENTIA - Artículos de Revistas INV - ALISoft - Artículos de Revistas |
Archivos en este ítem:
Archivo | Descripción | Tamaño | Formato | |
---|---|---|---|---|
09003400.pdf | 2,12 MB | Adobe PDF | Abrir Vista previa | |
Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.