Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-the-art methods. First, several proteins appear in different subcellular structures simultaneously, whereas current methods only predict one protein sequence in one subcellular structure. Second, most software tools are trained with obsolete data and the latest new databases are missed. We proposed a novel multi-label classification algorithm to solve the first problem and integrated several latest databases to improve prediction performance. Experiments proved the effectiveness of the proposed method. The present study would facilitate research on cellular proteomics.
Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
Xiaoqiang Guo,Fulin Liu,Y. Ju,Zhen Wang,Chunyu Wang
Published 2016 in Scientific Reports
ABSTRACT
PUBLICATION RECORD
- Publication year
2016
- Venue
Scientific Reports
- Publication date
2016-06-21
- Fields of study
Biology, Medicine, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar, PubMed
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-59 of 59 references · Page 1 of 1
CITED BY
Showing 1-54 of 54 citing papers · Page 1 of 1