Empirical Analysis on OpenAPI Topic Exploration and Discovery to Support the Developer Community

Leonardo da Rocha Araujo; Guillermo Rodríguez; Santiago Vidal; Claudia Marcos; Rodrigo Pereira dos Santos

Empirical Analysis on OpenAPI Topic Exploration and Discovery to Support the Developer Community

Autores/as

Leonardo da Rocha Araujo Universidad Nacional del Centro de la provincia de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Argentina
Guillermo Rodríguez Universidad Nacional del Centro de la provincia de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Argentina
Santiago Vidal Universidad Nacional del Centro de la provincia de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Argentina
Claudia Marcos Universidad Nacional del Centro de la provincia de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Argentina
Rodrigo Pereira dos Santos Universidad de Río de Janeiro, Brasil

Palabras clave:

Microservices, OpenAPI, Migration, Legacy Systems, Topic Modeling

Resumen

OpenAPI has become a dominant standard for documentation in the service-oriented software industry. OpenAPI is used in many analysis and reengineering approaches for RESTful service and microservice-based systems. An OpenAPI document has several components that are usually filled by humans using natural language (e.g. description of a certain functionality). Thus, subjectivity may lead to inconsistencies and ambiguities. Understanding what an API does is a challenging question. As a consequence, this issue could hinder developers from identifying the functionality of APIs, after reading all its components. Along this line, we argue that developers will be provided with supportive tools to find those APIs that better suit their needs. In this paper, we propose a step towards creating these kinds of tools by empirically analyzing a set of 2,000 OpenAPI documents with the goal of extracting the main topics of an API using three topic modeling algorithms. To address this issue, we focus on three tasks: i) determine which component of an OpenAPI document provides the most meaningful information, ii) compare three state-of-the-art topic modeling algorithms, and iii) determine the optimal number of topics to represent an API. Our findings show that the best results could be obtained from the Description component by using the Non-negative Matrix Factorization (NMF) or Latent Semantic Indexing (LSI) algorithms. To help developers find services in the OpenAPI directory, we also propose a prototype tool to explore the OpenAPI documents and analyze extracted topics to assess if the APIs meet developer’s needs.

Descargas

Los datos de descarga aún no están disponibles.

Descargas

PDF (Inglés)

Publicado

2022-12-15

Número

Vol. 8 Núm. 3 (2022): ASSE 2022 - Simposio Argentino de Ingeniería de Software

Sección

ASSE - Argentine Symposium on Software Engineering

Licencia

Derechos de autor 2022 Leonardo da Rocha Araujo, Guillermo Rodríguez, Santiago Vidal, Claudia Marcos, Rodrigo Pereira dos Santos

Esta obra está bajo una licencia internacional Creative Commons Atribución-NoComercial-CompartirIgual 4.0.

Acorde a estos términos, el material se puede compartir (copiar y redistribuir en cualquier medio o formato) y adaptar (remezclar, transformar y crear a partir del material otra obra), siempre que a) se cite la autoría y la fuente original de su publicación (revista y URL de la obra), b) no se use para fines comerciales y c) se mantengan los mismos términos de la licencia.

Cómo citar

da Rocha Araujo, L., Rodríguez, G., Vidal, S., Marcos, C., & Pereira dos Santos, R. (2022). Empirical Analysis on OpenAPI Topic Exploration and Discovery to Support the Developer Community. JAIIO, Jornadas Argentinas De Informática, 8(3), 68-69. https://revistas.unlp.edu.ar/JAIIO/article/view/18415