Towards the use of multi-relational data mining (MRDM) in bioprocess models

Research output: Contribution to conferenceConference abstract for conferenceResearchpeer-review

3 Downloads (Pure)


In the era of digitalization, the complexity of bioprocesses stands in the way of the biomanufacturing industry to fully adopt and benefit from the Industry 4.01 standards. Even though several aspects of next generation manufacturing can be adopted from other industries as “of the shelf”, such as the management of data and AI algorithms, making use of the underlying process knowledge and models is key to achieve smart biomanufacturing. So far, a significant investment has been made in developing mechanistic models for bioprocesses, both by academia and industry. However, little thought has been put into how to store and keep on developing and exploiting these models. Hence, even if they contain vast and comprehensive knowledge about the process, this information might be misplaced or even lost. A potential solution is the use of relational databases. They store available information by applying logical connections and consequently allow the creation of rationally interconnected repositories. Furthermore, this structure enables the use of multi-relational data mining (MRDM) as a tool to find inter-table patterns2. MRDM has been successfully applied to other disciplines, such as in bioinformatics3, once it provides a basis for finding new premises and relationships within the information discriminated and disseminated across the multiple tables. Therefore, in order to get a step closer to smart biomanufacturing, this study has a two-fold objective: (i) to design and implement a relational database which aims at collecting mechanistic models for bioprocesses, and related process information; as well as (ii) to explore the potential behind MRDM with regards to biomanufacturing. The relational database, developed in this work, collects knowledge in several interconnected tables. In these tables, the following data is recorded: (I) process conditions (e.g. pH, temperature, mode of operation, etc.); (II) model details (type and characteristics); (III) model parameters; and, (IV) uncertainty ranges associated with the parameters (when available)4. Additionally, the database contains a table of functional problems, based on the operational conditions (e.g. the cooling system is not working), and a table with the corresponding feasible solutions (e.g. adjust the cooling water flow). Subsequently, an inductive logic programming and relational data mining algorithm5 are implemented and applied to the database. This allows, for example, finding connections between new models and the library of operational problems simply based on the implicit characteristics of the model. In order to highlight the applicability and usefulness of the proposed strategy, it is demonstrated on an industrial anaerobic microbial conversion case study.
Original languageEnglish
Publication date2020
Number of pages2
Publication statusPublished - 2020
Event2020 AIChE Annual Meeting - Virtual event
Duration: 16 Nov 202020 Nov 2020


Conference2020 AIChE Annual Meeting
LocationVirtual event

Fingerprint Dive into the research topics of 'Towards the use of multi-relational data mining (MRDM) in bioprocess models'. Together they form a unique fingerprint.

Cite this