Welcome to the BioCatNet database system

The BioCatNet database system is a repository of sequence, structure and biocatalytic data on protein families to facilitate protein engineering.

The BioCatNet concept

BioCatNet core data model

Proteins are assigned to homologous families by their sequence similarity. Homologous families are then grouped hierarchically to superfamilies (similar to the DWARF system by Fischer et al. 2006).

On the sequence level, annotations may include functionally relevant positions and domain boundaries. Given sufficient structural information, a standard numbering scheme is provided for a protein family which allows for the unambiguous identification of functionally and structurally relevant residues, to communicate results on mutations and to systematically analyze sequence-function relationships in protein families (Vogel et al. 2012).

Currently, we extend the BioCatNet data model to store biocatalytic data and to analyze experimental data by selected kinetic models.

Available databases

The BioCatNet infrastructure currently contains the following family-specific protein databases:

Abbreviation Database No. of sequences No. of structures
TEED Thiamine diphosphate-dependent Enzymes Engineering Database 119567 308
IRED Imine Reductase Engineering Database 1409 8
CYPED CYtochrome P450 Engineering Database 52674 595
TEMLACED TEM LACtamase Engineering Database 483 65
LCCED LaCCase and multicopper oxidase Engineering Database 14536 138
TTCED TriTerpene Cyclase Engineering Database 2794 18


BioCatNet is maintained by the bioinformatics group at the Institute of Technical Biochemistry at University of Stuttgart, Germany.
We acknowledge your comments to this page. Please contact Jürgen Pleiss for contributions or collaborations.
For technical questions, please use our bug tracking page or contact Patrick Buchholz.