Abstract
An accurate deep learning predictor of enzyme optimal pH is essential to quantitatively describe how pH influences the enzyme catalytic activity. CatOpt, developed in this study, outperformed existing predictors of enzyme optimal pH (RMSE=0.833 and R2=0.479), and could provide good interpretability with informative residue attention weights. The classification of acidic and alkaline enzymes and prediction of enzyme optimal pH shifts caused by point mutations showcased the capability of CatOpt as an effective computational tool for identifying enzyme pH preferences. Furthermore, a single point mutation designed with the guidance of CatOpt successfully enhanced the activity of Pyrococcus horikoshii diacetylchitobiose deacetylase at low pH (pH=4.5/5.5) by approximately 7%, suggesting that CatOpt is a promising in-silico enzyme design tool for pH-dependent enzyme activities.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
A case study section has been added as section 3.4.
Abbreviations
- CNN
- convolutional neural network
- GlcN
- glucosamine
- GlcNAc
- N-acetylglucosamine
- Leaky ReLU
- leaky rectified linear unit
- MAE
- mean absolute error
- MSE
- mean squared error
- PhDac
- Pyrococcus horikoshii diacetylchitobiose deacetylase
- pHopt
- optimal pH
- RD
- residual dense block
- R2, r-squared
- the coefficient of determination
- RMSE
- root mean squared error
- WT
- wild-type.