A computational literature review of football performance analysis through probabilistic topic modeling

Vitor Ayres Principe, Rodrigo Gomes de Souza Vale, Juliana Brandão Pinto de Castro, Luiz Marcelo Carvano, Roberto André Pereira Henriques, Victor José de Almeida e Sousa Lobo, Rodolfo de Alkmim Moreira Nunes

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)


This research aims to illustrate the potential use of concepts, techniques, and mining process tools to improve the systematic review process. Thus, a review was performed on two online databases (Scopus and ISI Web of Science) from 2012 to 2019. A total of 9649 studies were identified, which were analyzed using probabilistic topic modeling procedures within a machine learning approach. The Latent Dirichlet Allocation method, chosen for modeling, required the following stages: 1) data cleansing, and 2) data modeling into topics for coherence and perplexity analysis. All research was conducted according to the standards of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses in a fully computerized way. The computational literature review is an integral part of a broader literature review process. The results presented met three criteria: (1) literature review for a research area, (2) analysis and classification of journals, and (3) analysis and classification of academic and individual research teams. The contribution of the article is to demonstrate how the publication network is formed in this particular field of research, and how the content of abstracts can be automatically analyzed to provide a set of research topics for quick understanding and application in future projects.

Original languageEnglish
Number of pages21
JournalArtificial Intelligence Review
Issue number2
Early online date4 Apr 2021
Publication statusPublished - 1 Feb 2022


  • Computational literature review
  • Football
  • LDA
  • Literature review
  • Performance analysis
  • Topic models


Dive into the research topics of 'A computational literature review of football performance analysis through probabilistic topic modeling'. Together they form a unique fingerprint.

Cite this