TY - GEN
T1 - Deep recurrent conditional random field network for protein secondary prediction
AU - Johansen, Alexander Rosenberg
AU - Sønderby, Søren Kaae
AU - Sønderby, Casper Kaae
AU - Winther, Ole
PY - 2017
Y1 - 2017
N2 - Deep learning has become the state-of-the-art method for predicting protein secondary structure from only its amino acid residues and sequence profile. Building upon these results, we propose to combine a bi-directional recurrent neural network (biRNN) with a conditional random field (CRF), which we call the biRNN-CRF. The biRNN-CRF may be seen as an improved alternative to an autoregressive uni-directional RNN where predictions are performed sequentially conditioning on the prediction in the previous timestep. The CRF is instead nearest neighbor-aware and models for the joint distribution of the labels for all time-steps. We condition the CRF on the output of biRNN, which learns a distributed representation based on the entire sequence. The biRNN-CRF is therefore close to ideally suited for the secondary structure task because a high degree of cross-talk between neighboring elements can be expected. We validate the model on several benchmark datasets. For example, on CB513, a model with 1.7 million parameters, achieves a Q8 accuracy of 69.4 for single model and 70.9 for ensemble, which to our knowledge is state-of-the-art. 1
AB - Deep learning has become the state-of-the-art method for predicting protein secondary structure from only its amino acid residues and sequence profile. Building upon these results, we propose to combine a bi-directional recurrent neural network (biRNN) with a conditional random field (CRF), which we call the biRNN-CRF. The biRNN-CRF may be seen as an improved alternative to an autoregressive uni-directional RNN where predictions are performed sequentially conditioning on the prediction in the previous timestep. The CRF is instead nearest neighbor-aware and models for the joint distribution of the labels for all time-steps. We condition the CRF on the output of biRNN, which learns a distributed representation based on the entire sequence. The biRNN-CRF is therefore close to ideally suited for the secondary structure task because a high degree of cross-talk between neighboring elements can be expected. We validate the model on several benchmark datasets. For example, on CB513, a model with 1.7 million parameters, achieves a Q8 accuracy of 69.4 for single model and 70.9 for ensemble, which to our knowledge is state-of-the-art. 1
U2 - 10.1145/3107411.3107489
DO - 10.1145/3107411.3107489
M3 - Article in proceedings
SN - 9781450347228
T3 - Acm-bcb - Proc. Acm Int. Conf. Bioinform., Comput. Biol., Health Informatics
SP - 73
EP - 78
BT - 8th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
PB - Association for Computing Machinery
T2 - 8th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
Y2 - 20 August 2017 through 23 August 2017
ER -