학술논문

Prediction of protein subcellular localization using machine learning with novel use of generic feature set
Document Type
Conference
Source
2020 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE) Electrical and Computer Engineering (WIECON-ECE), 2020 IEEE International Women in Engineering (WIE) Conference on. :1-4 Dec, 2020
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Proteins
Location awareness
Support vector machines
Machine learning algorithms
Machine learning
Prediction algorithms
Classification algorithms
Protein
Subcellular localization
Support Vector Machine
Machine Learning
Feature Selection
Language
Abstract
The method of identifying the location of protein within a cell is called subcellular localization of proteins. This area of research in Bioinformatics is pivotal for protein synthesis and drug discovery of several medical conditions and diseases. This paper introduces a new machine learning approach for subcellular localization of proteins, which used 18 basic and physicochemical features novel for such methods. A model with support vector machine (SVM) was developed at first to learn these properties of proteins from 6 locations inside a cell, and then test the model on another independent set of protein sequences. The proposed multi-class classification algorithm achieved an accuracy of about 94%. The results show superior performance with minimal computations when compared to similar algorithms in the literature.