Subspace Modeling for Classification of Protein Secondary Structure Elements from Cα Trace

Ali Sekmen, Kamal Al Nasr, Christopher Jones

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents a novel subspace segmentation algorithm that models protein Calpha traces of secondary structure elements (SSEs) as a union of subspaces. For each Calpha, a set of general geometric features are considered. The algorithm first identifies the most relevant features for each SSE using a new matrix rank estimation technique and combinatorics. This is followed by grouping Calpha traces in a sliding-window so that each group represents a data point in a high-dimensional ambient space. Then, a lower dimensional subspace is matched for each SSE. When a group of unknown Calpha traces is presented, the algorithm determines a neighborhood around each Calpha and then uses two approaches to classify the Calpha. In the first approach, the Calpha is represented as a data point in the ambient space and its distance to each subspace is calculated. In the second approach, a local subspace is matched to the Calpha, and the separation of this local subspace from each SSE subspace is computed using geodesic distance on the Grassmannian manifold of the subspaces. The minimum point-to-subspace distance and minimum separation of subspaces are used to classify the Calpha. This geometric and mathematical approach has been applied a large protein dataset and generated 85% classification rate without the need to train a large machine learning system.

Original languageEnglish (US)
Title of host publicationProceedings - 2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021
EditorsYufei Huang, Lukasz Kurgan, Feng Luo, Xiaohua Tony Hu, Yidong Chen, Edward Dougherty, Andrzej Kloczkowski, Yaohang Li
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages72-79
Number of pages8
ISBN (Electronic)9781665401265
DOIs
StatePublished - 2021
Externally publishedYes
Event2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021 - Virtual, Online, United States
Duration: Dec 9 2021Dec 12 2021

Publication series

NameProceedings - 2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021

Conference

Conference2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021
Country/TerritoryUnited States
CityVirtual, Online
Period12/9/2112/12/21

Keywords

  • Cα backbone
  • protein modeling
  • secondary structure classification
  • subspace segmentation

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Biomedical Engineering
  • Health Informatics
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Subspace Modeling for Classification of Protein Secondary Structure Elements from Cα Trace'. Together they form a unique fingerprint.

Cite this