Data mining and development of software with artificial intelligence focused on detecting visa approval profiles for the United States

Authors

  • Leonardo Novoa Maestría
  • Victor Hugo Medina

DOI:

https://doi.org/10.15665/rp.v23i2.3693

Abstract

This article presents an analysis of the profiles of individuals who are approved or denied tourist visas in Colombia using data mining through the CRISP-DM methodology and computational intelligence, taking into account 2 crucial factors. 1. The denial rate is approximately 46% [1], which means that Colombians spend nearly two million dollars on rejected visas. 2. Visa interview appointments are delayed by two years after the emergence of COVID-19. Our analysis obtains information about the patterns and common characteristics among applicants who have obtained approval for their tourist visa, such as their age, gender, nationality, marital status, profession, among other aspects related in the application forms (DS-160). The analysis concludes with the development of an AI calculator capable of predicting the approval probability with an effectiveness of over 85%. Ideal for applicants who could see which points are relevant to improve or simply not show up and wait for the right moment.

References

S. Triche, F. Goncalves, and J. Gama, “A survey on data preprocessing for classification tasks,” Data Mining and Knowledge Discovery, vol. 30, no. 1, pp. 1-36, 2016.

P. Tan, I. Steinwart, and J. Zhu, “A survey of causal inference methods for data mining,” ACM Computing Surveys, vol. 55, no. 2, pp. 1-40, 2022.

Report of the Visa Office 2022. (2023). Travel.gov. Available: https://travel.state.gov/content/travel/en/legal/visa-law0/visa-statistics/annual-reports/report-of-the-visa-office-2022.html [Accessed: Feb. 16, 2023].

A. Prateek and S. Karun, “A survey on data mining techniques for customer churn prediction,” International Journal of Data Mining and Knowledge Management Process, vol. 7, no. 3, pp. 1-25, 2017.

J. Khaterpal, S. Das, and V. Kumar, “A survey on deep learning for data mining,” ACM Computing Surveys, vol. 53, no. 3, pp. 1-43, 2020.

I. H. Witten, E. Frank, M. A. Hall, and C. J. Pal, Data Science for Business. Springer, 2020.

C. P. L. H. Verheijen and P. Adriaans, “The CRISP-DM process: A step-by-step guide,” Springer, 2002.

L. Albarracin, “Visa a Estados Unidos: solicitarla tardaría solo 30 días,” El Tiempo, 9 Aug. 2023. Available: https://www.eltiempo.com/mundo/eeuu-y-canada/visa-a-estados-unidos-solicitarla-tardaria-solo-30-dias-794169 [Accessed: Feb. 7, 2024].

Downloads

Published

2025-08-22