The BioCatNet database system is a repository of sequence, structure and biocatalytic data on protein families to facilitate protein engineering.
Proteins are assigned to homologous families by their sequence similarity. Homologous families are then grouped hierarchically to superfamilies (similar to the DWARF system by Fischer et al. 2006).
On the sequence level, annotations may include functionally relevant positions and domain boundaries. Given sufficient structural information, a standard numbering scheme is provided for a protein family which allows for the unambiguous identification of functionally and structurally relevant residues, to communicate results on mutations and to systematically analyze sequence-function relationships in protein families (Vogel et al. 2012).
Currently, we extend the BioCatNet data model to store biocatalytic data and to analyze experimental data by selected kinetic models.
The BioCatNet infrastructure currently contains the following family-specific protein databases:
|Abbreviation||Database||No. of sequences||No. of structures|
|CYPED||CYtochrome P450 Engineering Database||52674||595|
|HYED||Hydratase Engineering Database||2046||3|
|IRED||Imine Reductase Engineering Database||1409||8|
|LCCED||LaCCase and multicopper oxidase Engineering Database||14536||138|
|oTAED||ω-Transaminase Engineering Database||114655||234|
|TEED||Thiamine diphosphate-dependent Enzymes Engineering Database||119567||308|
|TEMLACED||TEM LACtamase Engineering Database||483||65|
|TTCED||TriTerpene Cyclase Engineering Database||2794||18|
BioCatNet is maintained by the bioinformatics group at the Institute of Technical Biochemistry at University of Stuttgart, Germany.
We acknowledge your comments to this page. Please contact Jürgen Pleiss for contributions or collaborations.
For technical questions, please use our bug tracking page or contact Patrick Buchholz.