Multistage modeling for the classification of numerical and categorical datasets: 2017 IEEE International Conference on Fuzzy Systems, FUZZ 2017

C.M. Salgado, M.P. Fernandes, A. Horta, M. Xavier, J.M.C. Sousa, S.M. Vieira

Research output: Contribution to conferencePaperpeer-review

4 Citations (Scopus)

Abstract

Logistic regression and Takagi-Sugeno fuzzy models are sequentially trained with categorical and numerical data in an ensemble-based multistage scheme. In the first stage, a logistic regression model is used to transform the binary feature space into a numerical feature that is used to train a second stage of models consisting of an ensemble of two Takagi-Sugeno fuzzy models. In the ensemble, one model is trained in the space of numerical features and first stage prediction values. The other model is trained only with samples that were classified with a low degree of confidence by the first stage model, in the space of numerical variables. The final output is given by the average of the ensemble predictions at second stage. This scheme was devised under the hypothesis that separating binary from numerical features in the modeling process would increase the performance of a single model using both types of features together. The proposed multistage approach is used to solve a clinical classification problem in a Portuguese hospital. The problem consists of predicting comanagement signalling based on patient clinical data, including diagnosis, procedures, comorbidities and numerical scores, collected before surgery. The multistage performed better in the comanagement dataset, and in 2 out of 5 benchmark datasets. © 2017 IEEE.
Original languageEnglish
DOIs
Publication statusPublished - 2017

Keywords

  • Bins
  • Classification (of information)
  • Forecasting
  • Fuzzy systems
  • Regression analysis
  • Categorical datasets
  • Clinical classification
  • Ensemble prediction
  • Logistic Regression modeling
  • Logistic regressions
  • Multistage approach
  • Numerical variables
  • Takagi Sugeno fuzzy models
  • Numerical models

Fingerprint

Dive into the research topics of 'Multistage modeling for the classification of numerical and categorical datasets: 2017 IEEE International Conference on Fuzzy Systems, FUZZ 2017'. Together they form a unique fingerprint.

Cite this