Abstract
The large and functionally diverse GH5 family is one of the very first Carbohydrate-Active enZYme families classified back in 1991 [3], [4]. The number of confirmed members in CAZy has grown to a staggering 27,000 where only approximately 600 (2%) have been assigned a function experimentally. In spite of early large-scale efforts [1], [2], the eye-watering paucity of characterized GH5 enzymes constitutes a general problem observed in most CAZy families, worsened by an unequal distribution of the characterized GH5 across the sequence space. This project attempts to address this problem by implementing large scale functional screening of approximately 200 carefully chosen GH5 candidates that systematically targets low or unexplored subfamilies in the known sequence space, thereby creating an excellent and large training set for use in developing progressive machine-learning algorithms that potentially can provide reliable and stable functional predictions of enzymes based on sequence information.
Original language | English |
---|---|
Title of host publication | Digitally Driven Biotechnology: 4th DTU Bioengineering symposium |
Number of pages | 1 |
Place of Publication | Kgs. Lyngby, Denmark |
Publisher | DTU Bioengineering |
Publication date | 2023 |
Pages | 33-33 |
Article number | 4 |
Publication status | Published - 2023 |
Event | 4th DTU Bioengineering symposium - Kgs. Lyngby, Denmark Duration: 26 Oct 2023 → 26 Oct 2023 |
Conference
Conference | 4th DTU Bioengineering symposium |
---|---|
Country/Territory | Denmark |
City | Kgs. Lyngby |
Period | 26/10/2023 → 26/10/2023 |