1D 13C-NMR data as molecular descriptors in spectra - Structure relationship analysis of oligosaccharides

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Spectra-structure relationships were investigated for estimating the anomeric configuration, residues and type of linkages of linear and branched trisaccharides using 13C-NMR chemical shifts. For this study, 119 pyranosyl trisaccharides were used that are trimers of the α or βanomers of D-glucose, D-galactose, D-mannose, L-fucose or L-rhamnose residues bonded through αor βglycosidic linkages of types 1→2, 1→3, 1→4, or 1→6, as well as methoxylated and/or N-acetylated amino trisaccharides. Machine learning experiments were performed for: (1) classification of the anomeric configuration of the first unit, second unit and reducing end; (2) classification of the type of first and second linkages; (3) classification of the three residues: reducing end, middle and first residue; and (4) classification of the chain type. Our previously model for predicting the structure of disaccharides was incorporated in this new model with an improvement of the predictive power. The best results were achieved using Random Forests with 204 di- and trisaccharides for the training set-it could correctly classify 83%, 90%, 88%, 85%, 85%, 75%, 79%, 68% and 94% of the test set (69 compounds) for the nine tasks, respectively, on the basis of unassigned chemical shifts.

Original languageEnglish
Pages (from-to)3818-3833
Number of pages16
JournalMolecules
Volume17
Issue number4
DOIs
Publication statusPublished - 1 Apr 2012

Keywords

  • C-NMR
  • Classification tree
  • CPGNN
  • Disaccharides
  • Machine learning techniques
  • Oligosaccharides
  • Random Forest
  • Trisaccharides

Fingerprint

Dive into the research topics of '1D <sup>13</sup>C-NMR data as molecular descriptors in spectra - Structure relationship analysis of oligosaccharides'. Together they form a unique fingerprint.

Cite this