What is DD-DeCaF?

The European Commission has awarded 6.3 million Euros to a four-year collaborative project on data-driven design of cells and microbial communities for applications ranging from human health to sustainable production of chemicals. With advances in synthetic biology genomes can now be edited at unprecedented speed allowing making multiple changes in the same genome at the same time. This increases the need for computational tools to design cells and communities of cells analogous to the tools used in Computer Aided Design of cars, buildings and other man-made objects. In biotechnology these design tools need to be able to use existing large-scale databases to discover new parts and place them in the functioning context of the cell. The tools need to be easily accessible and provide an intuitive visual map of the cell to the biotechnologists working in the lab on building better cell factories and communities.

The project, called DD-DeCaF (Bioinformatics Services for Data-Driven Design of Cell Factories and Communities) brings together leading academic partners from five European universities with five innovative European companies to address the challenge of building a comprehensive design tool. The academic partners will develop cutting edge methods for using large scale data to design cell factories and communities for biotechnological applications. Three innovative Small/Medium Enterprise partners will convert these advanced methods to software tools that can be used by non-experts and to build intuitive visualizations of biological networks. These tools will be tested and applied to real world cell factory development projects by end-user partners.

Screencasts and videos

News and Social

3rd workshop in November, ESIB, Graz (November 18, 2019)

The DD-DeCaF 3rd Workshop: Computer-aided design of cell factories will be integrated in the European Summit of Industrial Biotechnology (ESIB), to be held in Graz, Austria.

Read more
2nd periodic review meeting (March 15, 2019)

The 2nd periodic review meeting of the DD-DeCaF projcet was held in Brussels on March 15, 2019 at the CPH EU Office.

Read more
2nd workshop in September, Oeiras/Lisbon (September 18, 2018)

DD-DeCaF 2nd Workshop: data-driven cell factory design

Read more
DD-DeCaF 5th Consortium Meeting in Lausanne (April 24, 2018)

The fifth DD-DeCaF consortium meeting was held at EPFL in Lausanne on April 23-24.

Read more
DD-DeCaF 4th Consortium Meeting in Delft (September 11, 2017)

The fourth DD-DeCaF consortium meeting was held at DSM in Delft on September 11 at the new Rosalind Franklin Biotechnology Center .

Read more
Hands-on workshop in September, Delft (June 29, 2017)

DD-DeCaF 1st Workshop: Hands on introduction to data-driven cell factory and community design

Read more
DD-DeCaF 3rd Consortium Meeting in Copenhagen (May 30, 2017)

The third DD-DeCaF consortium meeting was held in Kongens Lyngby (Copenhagen) on 11-12 May 2017 at the Novo Nordisk Foundation Center for Biosustainability.

Read more
1st edition of the DD-DeCaF newsletter now available! (February 28, 2017)

The first edition of the DD-DeCaF is now available, in this issue it is presented an overview of the project and its resources and a summary including the publications generated in the year 1 of the project.

Read more
Webcast DD-DeCaF platform adds an interactive pathway viewer (January 25, 2017)

Read more
DD-DeCaF 2nd Consortium Meeting in Heidelberg (September 28, 2016)

The second DD-DeCaF consortium meeting was held in Heidelberg on 26-27 September 2016 at EMBL.

Read more
Using big “bio-data” to design better cell factories [Press release] (April 01, 2016)

The EU has granted 6.3 million Euros to the project DD-DeCaF, coordinated by the Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark. The objective is to develop a computer tool that will allow biotech companies to design and engineer cell factories faster than is currently possible today. The tool will accelerate the production of sustainable bio-chemicals and lay the groundwork for design of healthier foodstuff.

Read more
DD-DeCaF Kickoff Meeting in Brussels (March 10, 2016)

The official kick-off meeting of the DD-DeCaF project was held in Brussels on 7-8 March 2016 at creoDK and hosted by the project coordinator the Novo Nordisk Foundation Center for Biosustainability.

Read more

Tweets by @DDDeCaF

Newsletter

Join Messages from DD-DeCaF to receive updates (quarterly) on new releases!

Publications

Huerta-Cepas, Jaime and Forslund, Kristoffer and Szklarczyk, Damian and Jensen, Lars Juhl and von Mering, Christian and Bork, Peer, Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper, bioRxiv, 31961, 10.1101/076331

September 22, 2016

Orthology assignment is ideally suited for functional inference. However, because predicting orthology is computationally intensive at large scale, and most pipelines relatively inaccessible, less precise homology-based functional transfer is still the default for (meta-)genome annotation. We therefore developed eggNOG-mapper, a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from eggNOG. To validate our method, we benchmarked Gene Ontology predictions against two widely used homology-based approaches: BLAST and InterProScan. Compared to BLAST, eggNOG-mapper reduced by 7% the rate of false positive assignments, and increased by 19% the ratio of curated terms recovered over all terms assigned per protein. Compared to InterProScan, eggNOG-mapper achieved similar proteome coverage and precision, while predicting on average 32 more terms per protein and increasing by 26% the rate of curated terms recovered over total term assignments per protein. Through strict orthology assignments, eggNOG-mapper further renders more specific annotations than possible from domain similarity only (e.g. predicting gene family names). eggNOG-mapper runs ~15x than BLAST and at least 2.5x faster than InterProScan. The tool is available standalone or as an online service at http://eggnog-mapper.embl.de.

Read more
Machado, Daniel and Herrgård, Markus J and Rocha, Isabel, Stoichiometric Representation of Gene--Protein--Reaction Associations Leverages Constraint-Based Analysis from Reaction to Gene-Level Phenotype Prediction, PLoS Comput Biol, 12(10) , e1005140, 10.1371/journal.pcbi.1005140

October 06, 2016

Genome-scale metabolic reconstructions are currently available for hundreds of organisms. Constraint-based modeling enables the analysis of the phenotypic landscape of these organisms, predicting the response to genetic and environmental perturbations. However, since constraint-based models can only describe the metabolic phenotype at the reaction level, understanding the mechanistic link between genotype and phenotype is still hampered by the complexity of gene-protein-reaction associations. We implement a model transformation that enables constraint-based methods to be applied at the gene level by explicitly accounting for the individual fluxes of enzymes (and subunits) encoded by each gene. We show how this can be applied to different kinds of constraint-based analysis: flux distribution prediction, gene essentiality analysis, random flux sampling, elementary mode analysis, transcriptomics data integration, and rational strain design. In each case we demonstrate how this approach can lead to improved phenotype predictions and a deeper understanding of the genotype-to-phenotype link. In particular, we show that a large fraction of reaction-based designs obtained by current strain design methods are not actually feasible, and show how our approach allows using the same methods to obtain feasible gene-based designs. We also show, by extensive comparison with experimental 13C-flux data, how simple reformulations of different simulation methods with gene-wise objective functions result in improved prediction accuracy. The model transformation proposed in this work enables existing constraint-based methods to be used at the gene level without modification. This automatically leverages phenotype analysis from reaction to gene level, improving the biological insight that can be obtained from genome-scale models.

Read more
Mende, Daniel R and Letunic, Ivica and Huerta-Cepas, Jaime and Li, Simone S and Forslund, Kristoffer and Sunagawa, Shinichi and Bork, Peer, proGenomes - a resource for consistent functional and taxonomic annotations of prokaryotic genomes., Nucleic Acids Research, gkw989, 10.1093/nar/gkw989

October 24, 2016

The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de.

Read more
Jensen, Kristian and Cardoso, Joao G.R. and Sonnenschein, Nikolaus, Optlang - An algebraic modeling language for mathematical optimization, The Journal of Open Source Software, 10.21105/joss.00139

December 06, 2016

Optlang is a Python package implementing a modeling language for solving mathematical optimization problems, i.e., maximizing or minimizing an objective function over a set of variables subject to a number of constraints. It provides a common native Python interface to a series of optimization tools, so different solver backends can be used and changed in a transparent way. Optlang’s object-oriented API takes advantage of the symbolic math library SymPy (Team 2016) to allow objective functions and constraints to be easily formulated algebraically from symbolic expressions of variables. Optlang targets scientists who can thus focus on formulating optimization problems based on mathematical equations derived from domain knowledge. Solver interfaces can be added by subclassing the four main classes of the optlang API (Variable, Constraint, Objective, and Model) and implementing the relevant API functions.

Read more
Xavier, Joana C and Patil, Kiran Raosaheb and Rocha, Isabel, Integration of Biomass Formulations of Genome-Scale Metabolic Models with Experimental Data Reveals Universally Essential Cofactors in Prokaryotes, Metabolic Engineering, 39, 200-208, 10.1016/j.ymben.2016.12.002

December 27, 2016

The composition of a cell in terms of macromolecular building blocks and other organic molecules underlies the metabolic needs and capabilities of a species. Although some core biomass components such as nucleic acids and proteins are evident for most species, the essentiality of the pool of other organic molecules, especially cofactors and prosthetic groups, is yet unclear. Here we integrate biomass compositions from 71 manually curated genome-scale models, 33 large-scale gene essentiality datasets, enzyme-cofactor association data and a vast array of publications, revealing universally essential cofactors for prokaryotic metabolism and also others that are specific for phylogenetic branches or metabolic modes. Our results revise predictions of essential genes in Klebsiella pneumoniae and identify missing biosynthetic pathways in models of Mycobacterium tuberculosis. This work provides fundamental insights into the essentiality of organic cofactors and has implications for minimal cell studies as well as for modeling genotype-phenotype relations in prokaryotic metabolic networks.

Read more
Jouhten,Paula and Huerta-Cepas, Jaime and Bork,Peer and Patil, Kiran Raosaheb, Metabolic anchor reactions for robust biorefining, Metabolic Engineering, 40, 1-4, 10.1016/j.ymben.2017.02.010

February 27, 2017

Microbial cell factories based on renewable carbon sources are fundamental to a sustainable bio-economy. The economic feasibility of producer cells requires robust performance balancing growth and production. However, the inherent competition between these two objectives often leads to instability and reduces productivity. While algorithms exist to design metabolic network reduction strategies for aligning these objectives, the biochemical basis of the growth-product coupling has remained unresolved. Here, we reveal key reactions in the cellular biochemical repertoire as universal anchor reactions for aligning cell growth and production. A necessary condition for a reaction to be an anchor is that it splits a substrate into two or more molecules. By searching the currently known biochemical reaction space, we identify 62 C‐C cleaving anchor reactions, such as isocitrate lyase (EC 4.1.3.1) and L-tryptophan indole-lyase (EC 4.1.99.1), which are relevant for biorefining. The here identified anchor reactions mark network nodes for basing growth-coupled metabolic engineering and novel pathway designs.

Read more
Hansen, Anne Sofie Lærke and Lennen, Rebecca M. and Sonnenschein, Nikolaus and Herrgård, Markus J., Systems biology solutions for biochemical production challenges, Current Opinion in Biotechnology, 45, 85-91, 10.1016/j.copbio.2016.11.018

March 16, 2017

There is an urgent need to significantly accelerate the development of microbial cell factories to produce fuels and chemicals from renewable feedstocks in order to facilitate the transition to a biobased society. Methods commonly used within the field of systems biology including omics characterization, genome-scale metabolic modeling, and adaptive laboratory evolution can be readily deployed in metabolic engineering projects. However, high performance strains usually carry tens of genetic modifications and need to operate in challenging environmental conditions. This additional complexity compared to basic science research requires pushing systems biology strategies to their limits and often spurs innovative developments that benefit fields outside metabolic engineering. Here we survey recent advanced applications of systems biology methods in engineering microbial production strains for biofuels and -chemicals.

Read more
Ataman, Meric and Gardiol, Daniel F. Hernandez and Fengos, Georgios and Hatzimanikatis, Vassily, redGEM - Systematic reduction and analysis of genome-scale metabolic reconstructions for development of consistent core metabolic models, PLOS Computational Biology, 13(7) , 10.1371/journal.pcbi.1005444

July 20, 2017

Genome-scale metabolic reconstructions have proven to be valuable resources in enhancing our understanding of metabolic networks as they encapsulate all known metabolic capabilities of the organisms from genes to proteins to their functions. However the complexity of these large metabolic networks often hinders their utility in various practical applications. Although reduced models are commonly used for modeling and in integrating experimental data, they are often inconsistent across different studies and laboratories due to different criteria and detail, which can compromise transferability of the findings and also integration of experimental data from different groups. In this study, we have developed a systematic semi-automatic approach to reduce genome-scale models into core models in a consistent and logical manner focusing on the central metabolism or subsystems of interest. The method minimizes the loss of information using an approach that combines graph-based search and optimization methods. The resulting core models are shown to be able to capture key properties of the genome-scale models and preserve consistency in terms of biomass and by-product yields, flux and concentration variability and gene essentiality. The development of these “consistently-reduced” models will help to clarify and facilitate integration of different experimental data to draw new understanding that can be directly extendable to genome-scale models.

Read more
Ataman, Meric and Hatzimanikatis, Vassily, lumpGEM - Systematic generation of subnetworks and elementally balanced lumped reactions for the biosynthesis of target metabolites, PLOS Computational Biology, 13(7) , 10.1371/journal.pcbi.1005513

July 20, 2017

In the post-genomic era, Genome-scale metabolic networks (GEMs) have emerged as invaluable tools to understand metabolic capabilities of organisms. Different parts of these metabolic networks are defined as subsystems/pathways, which are sets of functional roles to implement a specific biological process or structural complex, such as glycolysis and TCA cycle. Subsystem/pathway definition is also employed to delineate the biosynthetic routes that produce biomass building blocks. In databases, such as MetaCyc and SEED, these representations are composed of linear routes from precursors to target biomass building blocks. However, this approach cannot capture the nested, complex nature of GEMs. Here we implemented an algorithm, lumpGEM, which generates biosynthetic subnetworks composed of reactions that can synthesize a target metabolite from a set of defined core precursor metabolites. lumpGEM captures balanced subnetworks, which account for the fate of all metabolites along the synthesis routes, thus encapsulating reactions from various subsystems/pathways to balance these metabolites in the metabolic network. Moreover, lumpGEM collapses these subnetworks into elementally balanced lumped reactions that specify the cost of all precursor metabolites and cofactors. It also generates alternative subnetworks and lumped reactions for the same metabolite, accounting for the flexibility of organisms. lumpGEM is applicable to any GEM and any target metabolite defined in the network. Lumped reactions generated by lumpGEM can be also used to generate properly balanced reduced core metabolic models.

Read more
Maia, Paulo and Rocha, Isabel and Rocha, Miguel, Identification of robust strain designs via tandem pFBA/LMOMA phenotype prediction, GECCO '17 - Proceedings of the Genetic and Evolutionary Computation Conference, 1661-1668, 10.1145/3067695.3082542

July 21, 2017

The past two decades have witnessed great advances in the computational modeling and systems biology fields. Soon after the first models of metabolism were developed, methods for phenotype prediction were put forward, as well as strain optimization methods, within the field of Metabolic Engineering. Evolutionary computation has been on the front line, with the proposal of bilevel metaheuristics, where EC works over phenotype simulation, selecting the most promising solutions for bioengineering tasks. Recently, Schuetz and co-workers proposed that the metabolism of bacteria operates close to the Pareto-optimal surface of a three-dimensional space defined by competing objectives. Albeit multi-objective strain optimization approaches focused on bioengineering objectives have been proposed, none tackles the multiob-jective nature of the cellular objectives. In this work, we propose multi-objective evolutionary algorithms for strain optimization, where objective functions are defined based on distinct phenotype prediction methods, showing that those can lead to more robust designs, allowing to find solutions in more complex scenarios.

Read more
Sánchez, Benjamín J. and Zhang, Cheng and Nilsson, Avlant and Lahtvee, Petri‐Jaan and Kerkhoven, Eduard J. and Nielsen, Jens, Improving the phenotype predictions of a yeast genome‐scale metabolic model by incorporating enzymatic constraints, Molecular Systems Biology, 13(8) , 935, 10.15252/msb.20167411

August 03, 2017

Genome‐scale metabolic models (GEMs) are widely used to calculate metabolic phenotypes. They rely on defining a set of constraints, the most common of which is that the production of metabolites and/or growth are limited by the carbon source uptake rate. However, enzyme abundances and kinetics, which act as limitations on metabolic fluxes, are not taken into account. Here, we present GECKO, a method that enhances a GEM to account for enzymes as part of reactions, thereby ensuring that each metabolic flux does not exceed its maximum capacity, equal to the product of the enzyme’s abundance and turnover number. We applied GECKO to a Saccharomyces cerevisiae GEM and demonstrated that the new model could correctly describe phenotypes that the previous model could not, particularly under high enzymatic pressure conditions, such as yeast growing on different carbon sources in excess, coping with stress, or overexpressing a specific pathway. GECKO also allows to directly integrate quantitative proteomics data; by doing so, we significantly reduced flux variability of the model, in over 60% of metabolic reactions. Additionally, the model gives insight into the distribution of enzyme usage between and within metabolic pathways. The developed method and model are expected to increase the use of model‐based design in metabolic engineering.

Read more
Verwaal, R. and Buiting-Wiessenhaan, N. and Dalhuijsen, S. and Roubos, J. A., CRISPR/Cpf1 enables fast and simple genome editing of Saccharomyces cerevisiae, Yeast, 35(2) , 201-211, 10.1002/yea.3278

September 08, 2017

Cpf1 represents a novel single RNA‐guided CRISPR/Cas endonuclease system suitable for genome editing with distinct features compared with Cas9. We demonstrate the functionality of three Cpf1 orthologues – Acidaminococcus spp. BV3L6 (AsCpf1), Lachnospiraceae bacterium ND2006 (LbCpf1) and Francisella novicida U112 (FnCpf1) – for genome editing of Saccharomyces cerevisiae. These Cpf1‐based systems enable fast and reliable introduction of donor DNA on the genome using a two‐plasmid‐based editing approach together with linear donor DNA. LbCpf1 and FnCpf1 displayed editing efficiencies comparable with the CRISPR/Cas9 system, whereas AsCpf1 editing efficiency was lower. Further characterization showed that AsCpf1 and LbCpf1 displayed a preference for their cognate crRNA, while FnCpf1‐mediated editing with similar efficiencies was observed using non‐cognate crRNAs of AsCpf1 and LbCpf1. In addition, multiplex genome editing using a single LbCpf1 crRNA array is shown to be functional in yeast. This work demonstrates that Cpf1 broadens the genome editing toolbox available for Saccharomyces cerevisiae.

Read more
Ponomarova, O. and Gabrielli, N. and Sévin, D.C. and Mülleder, M. and Zirngibl, K. and Bulyha, K. and Andrejev, S. and Kafkia, E. and Typas, A. and Sauer, U. and Ralser, M. and Patil, KR, Yeast Creates a Niche for Symbiotic Lactic Acid Bacteria through Nitrogen Overflow, Cell Systems, 10.1016/j.cels.2017.09.002

September 27, 2017

Many microorganisms live in communities and depend on metabolites secreted by fellow community members for survival. Yet our knowledge of interspecies metabolic dependencies is limited to few communities with small number of exchanged metabolites, and even less is known about cellular regulation facilitating metabolic exchange. Here we show how yeast enables growth of lactic acid bacteria through endogenous, multi-component, cross-feeding in a readily established community. In nitrogen-rich environments, Saccharomyces cerevisiae adjusts its metabolism by secreting a pool of metabolites, especially amino acids, and thereby enables survival of Lactobacillus plantarum and Lactococcus lactis. Quantity of the available nitrogen sources and the status of nitrogen catabolite repression pathways jointly modulate this niche creation. We demonstrate how nitrogen overflow by yeast benefits L. plantarum in grape juice, and contributes to emergence of mutualism with L. lactis in a medium with lactose. Our results illustrate how metabolic decisions of an individual species can benefit others.

Read more
Cardoso, J. G. R. and Zeidan, A. A. and Jensen, K. and Sonnenschein, N. and Neves, A. R. and Herrgård, M. J., MARSI - metabolite analogues for rational strain improvement, Bioinformatics, 34(13) , 2319-2321, 10.1093/bioinformatics/bty108

February 23, 2018

Summary: Metabolite analogues (MAs) mimic the structure of native metabolites, can competitively inhibit their utilization in enzymatic reactions, and are commonly used as selection tools for isolating desirable mutants of industrial microorganisms. Genome-scale metabolic models representing all biochemical reactions in an organism can be used to predict effects of MAs on cellular phenotypes. Here, we present the metabolite analogues for rational strain improvement (MARSI) framework. MARSI provides a rational approach to strain improvement by searching for metabolites as targets instead of genes or reactions. The designs found by MARSI can be implemented by supplying MAs in the culture media, enabling metabolic rewiring without the use of recombinant DNA technologies that cannot always be used due to regulations. To facilitate experimental implementation, MARSI provides tools to identify candidate MAs to a target metabolite from a database of known drugs and analogues.

Read more
Cardoso, J. G. and Jensen, K. and Lieven, C. and Lærke Hansen, A. S. and Galkina, S. and Beber, M. and Özdemir, E. and Herrgård, M. J. and Redestig, H. and Sonnenschein, N., Cameo - A Python Library for Computer Aided Metabolic Engineering and Optimization of Cell Factories, ACS synthetic biology, 7(4) , 1163-1166, 10.1021/acssynbio.7b00423

March 01, 2018

Computational systems biology methods enable rational design of cell factories on a genome-scale and thus accelerate the engineering of cells for the production of valuable chemicals and proteins. Unfortunately, the majority of these methods’ implementations are either not published, rely on proprietary software, or do not provide documented interfaces, which has precluded their mainstream adoption in the field. In this work we present cameo, a platform-independent software that enables in silico design of cell factories and targets both experienced modelers as well as users new to the field. It is written in Python and implements state-of-the-art methods for enumerating and prioritizing knockout, knock-in, overexpression, and down-regulation strategies and combinations thereof. Cameo is an open source software project and is freely available under the Apache License 2.0. A dedicated Web site including documentation, examples, and installation instructions can be found at http://cameo.bio. Users can also give cameo a try at http://try.cameo.bio.

Read more
Tramontano, M. and Andrejev, S. and Pruteanu, M. and Klünemann, M. and Kuhn, M. and Galardini, M. and Jouhten, P. and Zelezniak, A. and Zeller, G. and Bork, P. and Typas, A. and Patil, K. R., Nutritional preferences of human gut bacteria reveal their metabolic idiosyncrasies, Nature microbiology, 3(4) , 514, 10.1038/s41564-018-0123-9

March 19, 2018

Bacterial metabolism plays a fundamental role in gut microbiota ecology and host–microbiome interactions. Yet the metabolic capabilities of most gut bacteria have remained unknown. Here we report growth characteristics of 96 phylogenetically diverse gut bacterial strains across 4 rich and 15 defined media. The vast majority of strains (76) grow in at least one defined medium, enabling accurate assessment of their biosynthetic capabilities. These do not necessarily match phylogenetic similarity, thus indicating a complex evolution of nutritional preferences. We identify mucin utilizers and species inhibited by amino acids and short-chain fatty acids. Our analysis also uncovers media for in vitro studies wherein growth capacity correlates well with in vivo abundance. Further value of the underlying resource is demonstrated by correcting pathway gaps in available genome-scale metabolic models of gut microorganisms. Together, the media resource and the extracted knowledge on growth abilities widen experimental and computational access to the gut microbiota.

Read more
Coelho, L. P. and Kultima, J. R. and Costea, P. I. and Fournier, C. and Pan, Y. and Czarnecki-Maulden, G. and Hayward, M. R. and Forslund, S. K. and Schmidt, T. S. B. and Descombes, P. and Jackson, J. R. and Li, Q. and Bork, P., Similarity of the dog and human gut microbiomes in gene content and response to diet, Microbiome, 6(1) , 72, 10.1186/s40168-018-0450-3

April 19, 2018

Background Gut microbes influence their hosts in many ways, in particular by modulating the impact of diet. These effects have been studied most extensively in humans and mice. In this work, we used whole genome metagenomics to investigate the relationship between the gut metagenomes of dogs, humans, mice, and pigs.

Read more
McCloskey, D. and Xu, J. and Schrübbers, L. and Christensen, H. B. and Herrgård, M. J., RapidRIP quantifies the intracellular metabolome of 7 industrial strains of E. coli, Metabolic Engineering, 47(1) , 383-392, 10.1016/j.ymben.2018.04.009

May 01, 2018

Fast metabolite quantification methods are required for high throughput screening of microbial strains obtained by combinatorial or evolutionary engineering approaches. In this study, a rapid RIP-LC-MS/MS (RapidRIP) method for high-throughput quantitative metabolomics was developed and validated that was capable of quantifying 102 metabolites from central, amino acid, energy, nucleotide, and cofactor metabolism in less than 5 minutes. The method was shown to have comparable sensitivity and resolving capability as compared to a full length RIP-LC-MS/MS method (FullRIP). The RapidRIP method was used to quantify the metabolome of seven industrial strains of E. coli revealing significant differences in glycolytic, pentose phosphate, TCA cycle, amino acid, and energy and cofactor metabolites were found. These differences translated to statistically and biologically significant differences in thermodynamics of biochemical reactions between strains that could have implications when choosing a host for bioprocessing.

Read more
Shepelin, D. and Hansen, A. S. L. and Lennen, R. and Luo, H. and Herrgård, M. J., Selecting the best - evolutionary engineering of chemical production in microbes, Genes, 9(5) , 249, 10.3390/genes9050249

May 11, 2018

Microbial cell factories have proven to be an economical means of production for many bulk, specialty, and fine chemical products. However, we still lack both a holistic understanding of organism physiology and the ability to predictively tune enzyme activities in vivo, thus slowing down rational engineering of industrially relevant strains. An alternative concept to rational engineering is to use evolution as the driving force to select for desired changes, an approach often described as evolutionary engineering. In evolutionary engineering, in vivo selections for a desired phenotype are combined with either generation of spontaneous mutations or some form of targeted or random mutagenesis. Evolutionary engineering has been used to successfully engineer easily selectable phenotypes, such as utilization of a suboptimal nutrient source or tolerance to inhibitory substrates or products. In this review, we focus primarily on a more challenging problem—the use of evolutionary engineering for improving the production of chemicals in microbes directly. We describe recent developments in evolutionary engineering strategies, in general, and discuss, in detail, case studies where production of a chemical has been successfully achieved through evolutionary engineering by coupling production to cellular growth.

Read more
Salvy, P. and Fengos, G. and Ataman, M. and Pathier, T. and Soh, K. C. and Hatzimanikatis, V., pyTFA and matTFA - A Python package and a Matlab toolbox for Thermodynamics-based Flux Analysis, Bioinformatics, 1(3) , 10.1093/bioinformatics/bty499

July 02, 2018

Summary: pyTFA and matTFA are the first published implementations of the original TFA paper. Specifically, they include explicit formulation of Gibbs energies and metabolite concentrations, which enables straightforward integration of metabolite concentration measurements.

Read more
Bahram, M. and Hildebrand, F. and Forslund, S. K. and Anderson, J. L. and Soudzilovskaia, N. A. and Bodegom, P. M. and Bengtsson-Palme, J. and Anslan, S. and Coelho, L. P. and Harend, H. and Huerta-Cepas, J. and Medema, M. H. and Maltz, M. R. and Mundra, S. and Olsson, P. A. and Pent, M. and Põlme, S. and Sunagawa, S. and Ryberg, M. and Tedersoo, L. and Bork, P., Structure and function of the global topsoil microbiome, Nature, 560(7717) , 233, 10.1038/s41586-018-0386-6

August 01, 2018

Soils harbour some of the most diverse microbiomes on Earth and are essential for both nutrient cycling and carbon storage. To understand soil functioning, it is necessary to model the global distribution patterns and functional gene repertoires of soil microorganisms, as well as the biotic and environmental associations between the diversity and structure of both bacterial and fungal soil communities1–4. Here we show, by leveraging metagenomics and metabarcoding of global topsoil samples (189 sites, 7,560 subsamples), that bacterial, but not fungal, genetic diversity is highest in temperate habitats and that microbial gene composition varies more strongly with environmental variables than with geographic distance. We demonstrate that fungi and bacteria show global niche differentiation that is associated with contrasting diversity responses to precipitation and soil pH. Furthermore, we provide evidence for strong bacterial–fungal antagonism, inferred from antibiotic-resistance genes, in topsoil and ocean habitats, indicating the substantial role of biotic interactions in shaping microbial communities. Our results suggest that both competition and environmental filtering affect the abundance, composition and encoded gene functions of bacterial and fungal communities, indicating that the relative contributions of these microorganisms to global nutrient cycling varies spatially.

Read more
Kim, O. D. and Rocha, M. and Maia, P., A review of dynamic modeling approaches and their application in computational strain optimization for metabolic engineering, Frontiers in microbiology, 10.3389/fmicb.2018.01690

August 01, 2018

Mathematical modeling is a key process to describe the behavior of biological networks. One of the most difficult challenges is to build models that allow quantitative predictions of the cells’ states along time. Recently, this issue started to be tackled through novel in silico approaches, such as the reconstruction of dynamic models, the use of phenotype prediction methods, and pathway design via efficient strain optimization algorithms. The use of dynamic models, which include detailed kinetic information of the biological systems, potentially increases the scope of the applications and the accuracy of the phenotype predictions. New efforts in metabolic engineering aim at bridging the gap between this approach and other different paradigms of mathematical modeling, as constraint-based approaches. These strategies take advantage of the best features of each method, and deal with the most remarkable limitation—the lack of available experimental information—which affects the accuracy and feasibility of solutions. Parameter estimation helps to solve this problem, but adding more computational cost to the overall process. Moreover, the existing approaches include limitations such as their scalability, flexibility, convergence time of the simulations, among others. The aim is to establish a trade-off between the size of the model and the level of accuracy of the solutions. In this work, we review the state of the art of dynamic modeling and related methods used for metabolic engineering applications, including approaches based on hybrid modeling. We describe approaches developed to undertake issues regarding the mathematical formulation and the underlying optimization algorithms, and that address the phenotype prediction by including available kinetic rate laws of metabolic processes. Then, we discuss how these have been used and combined as the basis to build computational strain optimization methods for metabolic engineering purposes, how they lead to bi-level schemes that can be used in the industry, including a consideration of their limitations.

Read more
Machado, D. and Andrejev, S. and Tramontano, M. and Patil, K. R., Fast automated reconstruction of genome-scale metabolic models for microbial species and communities, Nucleic Acids Research, 46(15) , 7542–7553, 10.1093/nar/gky537

September 06, 2018

Genome-scale metabolic models are instrumental in uncovering operating principles of cellular metabolism, for model-guided re-engineering, and unraveling cross-feeding in microbial communities. Yet, the application of genome-scale models, especially to microbial communities, is lagging behind the availability of sequenced genomes. This is largely due to the time-consuming steps of manual curation required to obtain good quality models. Here, we present an automated tool, CarveMe, for reconstruction of species and community level metabolic models. We introduce the concept of a universal model, which is manually curated and simulation ready. Starting with this universal model and annotated genome sequences, CarveMe uses a top-down approach to build single-species and community models in a fast and scalable manner. We show that CarveMe models perform closely to manually curated models in reproducing experimental phenotypes (substrate utilization and gene essentiality). Additionally, we build a collection of 74 models for human gut bacteria and test their ability to reproduce growth on a set of experimentally defined media. Finally, we create a database of 5587 bacterial models and demonstrate its potential for fast generation of microbial community models. Overall, CarveMe provides an open-source and user-friendly tool towards broadening the use of metabolic modeling in studying microbial species and communities.

Read more
Acevedo-Rocha, C. G. and Gronenberg, L. S. and Mack, M. and Commichau, F. M. and Genee, H. J., Microbial cell factories for the sustainable manufacturing of B vitamins, Current opinion in biotechnology, 10.1016/j.copbio.2018.07.006

September 13, 2018

Vitamins are essential compounds in human and animal diets. Their demand is increasing globally in food, feed, cosmetics, chemical and pharmaceutical industries. Most current production methods are unsustainable because they use non-renewable sources and often generate hazardous waste. Many microorganisms produce vitamins naturally, but their corresponding metabolic pathways are tightly regulated since vitamins are needed only in catalytic amounts. Metabolic engineering is accelerating the development of microbial cell factories for vitamins that could compete with chemical methods that have been optimized over decades, but scientific hurdles remain. Additional technological and regulatory issues need to be overcome for innovative bioprocesses to reach the market. Here, we review the current state of development and challenges for fermentative processes for the B vitamin group.

Read more
Massaiu, I. and Pasotti, L. and Sonnenschein, N. and Rama, E. and Cavaletti, M. and Magni, P. and Calvio, C. and Herrgård, M.J., Integration of enzymatic data in Bacillus subtilis genome-scale metabolic model improves phenotype predictions and enables in silico design of poly-γ-glutamic acid production strains, Microbial Cell Factories, 18(1) , 3, 10.1186/s12934-018-1052-2

January 09, 2019

Background Genome-scale metabolic models (GEMs) allow predicting metabolic phenotypes from limited data on uptake and secretion fluxes by defining the space of all the feasible solutions and excluding physio-chemically and biologically unfeasible behaviors. The integration of additional biological information in genome-scale models, e.g., transcriptomic or proteomic profiles, has the potential to improve phenotype prediction accuracy. This is particularly important for metabolic engineering applications where more accurate model predictions can translate to more reliable model-based strain design. Results Here we present a GEM with Enzymatic Constraints using Kinetic and Omics data (GECKO) model of Bacillus subtilis, which uses publicly available proteomic data and enzyme kinetic parameters for central carbon (CC) metabolic reactions to constrain the flux solution space. This model allows more accurate prediction of the flux distribution and growth rate of wild-type and single-gene/operon deletion strains compared to a standard genome-scale metabolic model. The flux prediction error decreased by 43% and 36% for wild-type and mutants respectively. The model additionally increased the number of correctly predicted essential genes in CC pathways by 2.5-fold and significantly decreased flux variability in more than 80% of the reactions with variable flux. Finally, the model was used to find new gene deletion targets to optimize the flux toward the biosynthesis of poly-γ-glutamic acid (γ-PGA) polymer in engineered B. subtilis. We implemented the single-reaction deletion targets identified by the model experimentally and showed that the new strains have a twofold higher γ-PGA concentration and production rate compared to the ancestral strain. Conclusions This work confirms that integration of enzyme constraints is a powerful tool to improve existing genome-scale models, and demonstrates the successful use of enzyme-constrained models in B. subtilis metabolic engineering. We expect that the new model can be used to guide future metabolic engineering efforts in the important industrial production host B. subtilis.

Read more
Sanchez, B.J. and Li, F. and Kerkhoven, E.J. and Nielsen, J., SLIMEr - probing flexibility of lipid metabolism in yeast with an improved constraint-based modeling framework, BMC Systems Biology, 13(1) , 4, 10.1186/s12918-018-0673-8

January 11, 2019

Background A recurrent problem in genome-scale metabolic models (GEMs) is to correctly represent lipids as biomass requirements, due to the numerous of possible combinations of individual lipid species and the corresponding lack of fully detailed data. In this study we present SLIMEr, a formalism for correctly representing lipid requirements in GEMs using commonly available experimental data. Results SLIMEr enhances a GEM with mathematical constructs where we Split Lipids Into Measurable Entities (SLIME reactions), in addition to constraints on both the lipid classes and the acyl chain distribution. By implementing SLIMEr on the consensus GEM of Saccharomyces cerevisiae, we can represent accurate amounts of lipid species, analyze the flexibility of the resulting distribution, and compute the energy costs of moving from one metabolic state to another. Conclusions The approach shows potential for better understanding lipid metabolism in yeast under different conditions. SLIMEr is freely available at https://github.com/SysBioChalmers/SLIMEr.

Read more
Hildebrand, F. and Moitinho-Silva, L. and Blasche, S. and Jahn, M.T. and Gossmann, T.I. and Huerta-Cepas, J. and Hercog, R. and Luetge, M. and Bahram, M. and Pryszlak, A. and Alves, R.J. and Waszak, S.M. and Zhu, A. and Ye, L. and Costea, P.I. and Aalvink, S. and Belzer, C. and Forslund, S.K. and Sunagawa, S. and Hentschel, U. and Merten, C. and Patil, K.R. and Benes, V. and Bork, P., Antibiotics-induced monodominance of a novel gut bacterial order, Gut, 10.1136/gutjnl-2018-317715

January 18, 2019

Objective The composition of the healthy human adult gut microbiome is relatively stable over prolonged periods, and representatives of the most highly abundant and prevalent species have been cultured and described. However, microbial abundances can change on perturbations, such as antibiotics intake, enabling the identification and characterisation of otherwise low abundant species. Design Analysing gut microbial time-series data, we used shotgun metagenomics to create strain level taxonomic and functional profiles. Community dynamics were modelled postintervention with a focus on conditionally rare taxa and previously unknown bacteria. Results In response to a commonly prescribed cephalosporin (ceftriaxone), we observe a strong compositional shift in one subject, in which a previously unknown species, UBorkfalki ceftriaxensis, was identified, blooming to 92% relative abundance. The genome assembly reveals that this species (1) belongs to a so far undescribed order of Firmicutes, (2) is ubiquitously present at low abundances in at least one third of adults, (3) is opportunistically growing, being ecologically similar to typical probiotic species and (4) is stably associated to healthy hosts as determined by single nucleotide variation analysis. It was the first coloniser after the antibiotic intervention that led to a long-lasting microbial community shift and likely permanent loss of nine commensals. Conclusion The bloom of UB. ceftriaxensis and a subsequent one of Parabacteroides distasonis demonstrate the existence of monodominance community states in the gut. Our study points to an undiscovered wealth of low abundant but common taxa in the human gut and calls for more highly resolved longitudinal studies, in particular on ecosystem perturbations.

Read more
Jensen, K. and Broeken, V. and Hansen, A.S.L. and Sonnenschein, N. and Herrgard, M.J., OptCouple - Joint simulation of gene knockouts, insertions and medium modifications for prediction of growth-coupled strain designs, Metabolic Engineering Communications, 8(1) , 10.1016/j.mec.2019.e00087

March 16, 2019

Biological production of chemicals is an attractive alternative to petrochemical-based production, due to advantages in environmental impact and the spectrum of feasible targets. However, engineering microbial strains to overproduce a compound of interest can be a long, costly and painstaking process. If production can be coupled to cell growth it is possible to use adaptive laboratory evolution to increase the production rate. Strategies for coupling production to growth, however, are often not trivial to find. Here we present OptCouple, a constraint-based modeling algorithm to simultaneously identify combinations of gene knockouts, insertions and medium supplements that lead to growth-coupled production of a target compound. We validated the algorithm by showing that it can find novel strategies that are growth-coupled in silico for a compound that has not been coupled to growth previously, as well as reproduce known growth-coupled strain designs for two different target compounds. Furthermore, we used OptCouple to construct an alternative design with potential for higher production. We provide an efficient and easy-to-use implementation of the OptCouple algorithm in the cameo Python package for computational strain design.

Read more
Buerger, J. and Gronenberg, L.S. and Genee, H.J. and Sommer, M., Wiring cell growth to product formation, Current opinion in biotechnology, 59(1) , 85-92, 10.1016/j.copbio.2019.02.014

March 28, 2019

Microbial cell factories offer new and sustainable production routes for high-value chemicals. However, identification of high producers within a library of clones remains a challenge. When product formation is coupled to growth, millions of metabolic variants can be effectively interrogated by growth selection, dramatically increasing the throughput of strain evaluation. While growth-coupled selections for cell factories have a long history of success based on metabolite auxotrophies and toxic antimetabolites, such methods are generally restricted to molecules native to their host metabolism. New synthetic biology tools offer the opportunity to rewire cellular metabolism to depend on specific and non-native products for growth.

Read more
Pereira, R. and Vilaça, P. and Maia, P. and Nielsen, J. and Rocha, I., Turnover Dependent Phenotypic Simulation - A Quantitative Constraint-Based Simulation Method That Accommodates All Main Strain Design Strategies, ACS synthetic biology, 8(5) , 976-988, 10.1021/acssynbio.8b00248

March 29, 2019

The uncertain relationship between genotype and phenotype can make strain engineering an arduous trial and error process. To identify promising gene targets faster, constraint-based modeling methodologies are often used, although they remain limited in their predictive power. Even though the search for gene knockouts is fairly established in constraint-based modeling, most strain design methods still model gene up/down-regulations by forcing the corresponding flux values to fixed levels without taking in consideration the availability of resources. Here, we present a constraint-based algorithm, the turnover dependent phenotypic simulation (TDPS) that quantitatively simulates phenotypes in a resource conscious manner. Unlike other available algorithms, TDPS does not force flux values and considers resource availability, using metabolite production turnovers as an indicator of metabolite abundance. TDPS can simulate up-regulation of metabolic reactions as well as the introduction of heterologous genes, alongside gene deletion and down-regulation scenarios. TDPS simulations were validated using engineered Saccharomyces cerevisiae strains available in the literature by comparing the simulated and experimental production yields of the target metabolite. For many of the strains evaluated, the experimental production yields were within the simulated intervals and the relative strain performance could be predicted with TDPS. However, the algorithm failed to predict some of the production changes observed experimentally, suggesting that further improvements are necessary. The results also showed that TDPS may be helpful in finding metabolic bottlenecks, but further experiments would be required to confirm these findings.

Read more
Ciurkot, K. and Vonk, B. and Gorochowski, T. and Roubos, J. and Verwaal, R., CRISPR/Cas12a Multiplex Genome Editing of Saccharomyces cerevisiae and the Creation of Yeast Pixel Art, JoVE (Journal of Visualized Experiments)(147) , e59350, 10.3791/59350

May 28, 2019

The CRISPR/Cas12a system in combination with a single crRNA array enables efficient multiplex editing of the S. cerevisiae genome at multiple loci simultaneously. This is demonstrated by constructing carotenoid producing yeast strains which are subsequently used to create yeast pixel art.

Read more
Coelho, L.P. and Alves, R. and Monteiro, P. and Huerta-Cepas, J. and Freitas, A.T. and Bork, P., NG-meta-profiler - fast processing of metagenomes using NGLess, a domain-specific language, Microbiome, 7(1) , 84, 10.1186/s40168-019-0684-8

June 03, 2019

Background Shotgun metagenomes contain a sample of all the genomic material in an environment, allowing for the characterization of a microbial community. In order to understand these communities, bioinformatics methods are crucial. A common first step in processing metagenomes is to compute abundance estimates of different taxonomic or functional groups from the raw sequencing data. Given the breadth of the field, computational solutions need to be flexible and extensible, enabling the combination of different tools into a larger pipeline. Results We present NGLess and NG-meta-profiler. NGLess is a domain specific language for describing next-generation sequence processing pipelines. It was developed with the goal of enabling user-friendly computational reproducibility. It provides built-in support for many common operations on sequencing data and is extensible with external tools with configuration files. Using this framework, we developed NG-meta-profiler, a fast profiler for metagenomes which performs sequence preprocessing, mapping to bundled databases, filtering of the mapping results, and profiling (taxonomic and functional). It is significantly faster than either MOCAT2 or htseq-count and (as it builds on NGLess) its results are perfectly reproducible. Conclusions NG-meta-profiler is a high-performance solution for metagenomics processing built on NGLess. It can be used as-is to execute standard analyses or serve as the starting point for customization in a perfectly reproducible fashion. NGLess and NG-meta-profiler are open source software (under the liberal MIT license) and can be downloaded from https://ngless.embl.de or installed through bioconda.

Read more
Vieira, V. and Maia, P. and Rocha, M. and Rocha, I., Comparison of pathway analysis and constraint-based methods for cell factory design, BMC Bioinformatics, 20(1) , 350, 10.1186/s12859-019-2934-y

June 20, 2019

Background Computational strain optimisation methods (CSOMs) have been successfully used to exploit genome-scale metabolic models, yielding strategies useful for allowing compound overproduction in metabolic cell factories. Minimal cut sets are particularly interesting since their definition allows searching for intervention strategies that impose strong growth-coupling phenotypes, and are not subject to optimality bias when compared with simulation-based CSOMs. However, since both types of methods have different underlying principles, they also imply different ways to formulate metabolic engineering problems, posing an obstacle when comparing their outputs. Results In this work, we perform an in-depth analysis of potential strategies that can be obtained with both methods, providing a critical comparison of performance, robustness, predicted phenotypes as well as strategy structure and size. To this end, we devised a pipeline including enumeration of strategies from evolutionary algorithms (EA) and minimal cut sets (MCS), filtering and flux analysis of predicted mutants to optimize the production of succinic acid in Saccharomyces cerevisiae. We additionally attempt to generalize problem formulations for MCS enumeration within the context of growth-coupled product synthesis. Strategies from evolutionary algorithms show the best compromise between acceptable growth rates and compound overproduction. However, constrained MCSs lead to a larger variety of phenotypes with several degrees of growth-coupling with production flux. The latter have proven useful in revealing the importance, in silico, of the gamma-aminobutyric acid shunt and manipulation of cofactor pools in growth-coupled designs for succinate production, mechanisms which have also been touted as potentially useful for metabolic engineering. Conclusions The two main groups of CSOMs are valuable for finding growth-coupled mutants. Despite the limitations in maximum growth rates and large strategy sizes, MCSs help uncover novel mechanisms for compound overproduction and thus, analyzing outputs from both methods provides a richer overview on strategies that can be potentially carried over in vivo.

Read more
Fernandes, B. and Dias, O. and Costa, G. and Neto, A and Resende, T. and Oliveira, J. and Riaño-Pachón, D. and Zaiat, M. and Pradella, J. and Rocha, I., Genome-wide sequencing and metabolic annotation of Pythium irregulare CBS 494.86 - understanding Eicosapentaenoic acid production, BMC Biotechnology, 19(1) , 41, 10.1186/s12896-019-0529-3

June 28, 2019

Background Pythium irregulare is an oleaginous Oomycete able to accumulate large amounts of lipids, including Eicosapentaenoic acid (EPA). EPA is an important and expensive dietary supplement with a promising and very competitive market, which is dependent on fish-oil extraction. This has prompted several research groups to study biotechnological routes to obtain specific fatty acids rather than a mixture of various lipids. Moreover, microorganisms can use low cost carbon sources for lipid production, thus reducing production costs. Previous studies have highlighted the production of EPA by P. irregulare, exploiting diverse low cost carbon sources that are produced in large amounts, such as vinasse, glycerol, and food wastewater. However, there is still a lack of knowledge about its biosynthetic pathways, because no functional annotation of any Pythium sp. exists yet. The goal of this work was to identify key genes and pathways related to EPA biosynthesis, in P. irregulare CBS 494.86, by sequencing and performing an unprecedented annotation of its genome, considering the possibility of using wastewater as a carbon source. Results Genome sequencing provided 17,727 candidate genes, with 3809 of them associated with enzyme code and 945 with membrane transporter proteins. The functional annotation was compared with curated information of oleaginous organisms, understanding amino acids and fatty acids production, and consumption of carbon and nitrogen sources, present in the wastewater. The main features include the presence of genes related to the consumption of several sugars and candidate genes of unsaturated fatty acids production. Conclusions The whole metabolic genome presented, which is an unprecedented reconstruction of P. irregulare CBS 494.86, shows its potential to produce value-added products, in special EPA, for food and pharmaceutical industries, moreover it infers metabolic capabilities of the microorganism by incorporating information obtained from literature and genomic data, supplying information of great importance to future work.

Read more
Vieira, V. and Rocha, M., CoBAMP - a Python framework for metabolic pathway analysis in constraint-based models, Bioinformatics, btz598, 10.1093/bioinformatics/btz598

July 29, 2019

Summary CoBAMP is a modular framework for the enumeration of pathway analysis concepts, such as elementary flux modes (EFM) and minimal cut sets in genome-scale constraint-based models (CBMs) of metabolism. It currently includes the K-shortest EFM algorithm and facilitates integration with other frameworks involving reading, manipulation and analysis of CBMs. Availability and implementation The software is implemented in Python 3, supported on most operating systems and requires a mixed-integer linear programming optimizer supported by the optlang framework. Source-code is available at https://github.com/BioSystemsUM/cobamp.

Read more
Cruz, F. and Lagoa, D. and Mendes, J. and Rocha, I. and Ferreira, E.C. and Rocha, M. and Dias, O., SamPler - a novel method for selecting parameters for gene functional annotation routines, BMC Bioinformatics, 20(1) , 1-11, 10.1186/s12859-019-3038-4

August 08, 2019

Background As genome sequencing projects grow rapidly, the diversity of organisms with recently assembled genome sequences peaks at an unprecedented scale, thereby highlighting the need to make gene functional annotations fast and efficient. However, the (high) quality of such annotations must be guaranteed, as this is the first indicator of the genomic potential of every organism. Automatic procedures help accelerating the annotation process, though decreasing the confidence and reliability of the outcomes. Manually curating a genome-wide annotation of genes, enzymes and transporter proteins function is a highly time-consuming, tedious and impractical task, even for the most proficient curator. Hence, a semi-automated procedure, which balances the two approaches, will increase the reliability of the annotation, while speeding up the process. In fact, a prior analysis of the annotation algorithm may leverage its performance, by manipulating its parameters, hastening the downstream processing and the manual curation of assigning functions to genes encoding proteins. Results Here SamPler, a novel strategy to select parameters for gene functional annotation routines is presented. This semi-automated method is based on the manual curation of a randomly selected set of genes/proteins. Then, in a multi-dimensional array, this sample is used to assess the automatic annotations for all possible combinations of the algorithm’s parameters. These assessments allow creating an array of confusion matrices, for which several metrics are calculated (accuracy, precision and negative predictive value) and used to reach optimal values for the parameters. Conclusions The potential of this methodology is demonstrated with four genome functional annotations performed in merlin, an in-house user-friendly computational framework for genome-scale metabolic annotation and model reconstruction. For that, SamPler was implemented as a new plugin for the merlin tool.

Read more
Lu, H. and Li, F. and Sánchez, B. and Zhu, Z. and Li, G. and Domenzain, I. and Marcišauskas, S. and Anton, P. and Lappa, D. and Lieven, C. and others, A consensus S. cerevisiae metabolic model Yeast8 and its ecosystem for comprehensively probing cellular metabolism, Nature communications, 10(1) , 1-13, 10.1038/s41467-019-11581-3

September 05, 2019

Genome-scale metabolic models (GEMs) represent extensive knowledgebases that provide a platform for model simulations and integrative analysis of omics data. This study introduces Yeast8 and an associated ecosystem of models that represent a comprehensive computational resource for performing simulations of the metabolism of Saccharomyces cerevisiae - an important model organism and widely used cell-factory. Yeast8 tracks community development with version control, setting a standard for how GEMs can be continuously updated in a simple and reproducible way. We use Yeast8 to develop the derived models panYeast8 and coreYeast8, which in turn enable the reconstruction of GEMs for 1,011 different yeast strains. Through integration with enzyme constraints (ecYeast8) and protein 3D structures (proYeast8DB), Yeast8 further facilitates the exploration of yeast metabolism at a multi-scale level, enabling prediction of how single nucleotide variations translate to phenotypic traits.

Read more

Consortium

Scientific partners

Technical University of Denmark

The sequencing, informatics, and modeling (SIM) group led by Drs. Markus Herrgard and Nikolaus Sonnenschein at the Novo Nordisk Foundation Center for Biosustainability at DTU focuses on developing and applying tools for microbial cell factory design. SIM group members have developed widely- used modeling platforms such as the COBRA (opencobra.github.io/) and MASS (opencobra.github.io/MASS-Toolbox/) toolboxes. Additionally, Dr. Herrgard has extensive experience in developing web-based software for genomic and metagenomic data mining and synthetic biology design from a leading synthetic biology company (Synthetic Genomics, Inc.).

Personnel involved:

	Markus Herrgård, Professor in Data-driven cell factory engineering at the Novo Nordisk Foundation Center for Biosustainability (CFB) at the Technical University of Denmark. He is also the Director of the iLoop Translational Core Unit at the CFB focusing on development of commercialization ready microbial cell factories. Markus has a Ph.D. in Bioengineering from the University of California, San Diego and M.Sc./B.Sc. degrees in Engineering Physics and Mathematics from Aalto University in Finland. From 2006 to 2008, he was a project leader at the University of California, San Diego focusing on systems biology of the yeast S. cerevisiae. From 2008 to 2012 Markus was a senior scientist and group leader at Synthetic Genomics, Inc. in La Jolla, CA leading a group focused on genome mining, synthetic biology design and modeling. He has been at the CFB since 2012 and has been the Director of the iLoop Unit since 2014. Markus is co-author of over 50 peer-reviewed publications with more than 5000 citations and is a co-inventor of several patents and patent applications.
	Nikolaus Sonnenschein is an in silico strain engineer and group leader at The Novo Nordisk Foundation Center for Biosustainability at the Technical University of Denmark. He is involved in the iLoop ² project that brings together people from genome engineering, informatics, modeling, robotics, fermentation, screening, and analytics to develop an accelerated strain engineering process. Before moving to Denmark, he worked as a postdoc in the Systems Biology Research Group at University of California, San Diego. Before going to the US, he received a PhD in Bioinformatics from Jacobs University Bremen in Germany.
	Ricarda Lohmann, has a degree in Political Science, Communication and Law (Technical University Dresden, Lund University and FSU Jena). She was a project manager at the European Project Center of TU Dresden and is now working as project assistant at the Center for Biosustainability (Technical University of Denmark). Her work focusses on the management of EU-projects (H2020, Tempus, LLP) and impact assessment tools.
	Elisabeth Beck Knudsen, has a degree in Social Sciences and Global Development (Roskilde University and Copenhagen University). She has worked as a project manager on several EU-projects at The National Space Institute (DTU Space) and is now working as a project assistant at the Center for Biosustainability (Technical University of Denmark). Elisabeth is the maternity cover for Ricarda Lohman.
	Svetlana Kutuzova joined the project at May 2016 in a role of software developer with main focus on designing and developing fast, reliable and easy-to-use web platform which will provided access to advanced bioinformatics algorithms. She has Masters degree in Applied Mathematics and Computer Science from Lomonosov Moscow State University and has previously worked as software developer in different domains, including e-commerce and weather forecasting.
	Moritz Beber joined the DD-DeCaF team as a post-doc in January 2017. He holds a PhD in bioinformatics and has previously researched the topology of metabolic networks, large scale dynamics of transcriptional regulation in E. coli, comparisons of manufacturing and metabolic systems, and extracting features from meta studies of toxicogenomics data. Moritz is passionate about delivering useful, well designed, and enjoyable tools to experimentalists and other developers alike. He will ensure that the DD-DeCaF platform adequately represents biological phenomena and that the individual services can be employed for bioinformatics use.
	Christian Lieven joined DD-DeCaF as a post-doctoral researcher in March 2018. He received his PhD working on the Environmentally Friendly Proein Production (EFPro2) project, for which he reconstructed a genome-scale metabolic model for the methanotroph Methylococcus capsulatus. Furthermore, by engaging the COBRA community, Christian and colleagues developed `memote`, a software for quality control of genome-scale metabolic models inspired by common software development practises. Christian is interested in emerging technologies to harness the potential of C1 feedstocks, software development, and the standardization and maintenance of genome-scale metabolic reconstructions. He seeks to create meaningful tools that enhance rather than impede the work of the biotech community through an enjoyable and self-explanatory interface.
	Ali Kaafarani joined the project in December 2017 as a software developer and operations engineer. He focuses on correctness, robustness and resilience in the DD-DeCaF platform, delivering a stable, secure and trustworthy experience. Ali has a B.E. in Computer Science from The Arctic University of Norway, where his final year project involved processing and presentation of satellite data from NASAs Direct Readout Laboratory. Before coming to Denmark, he worked as a systems developer at The Norwegian Trekking Association, promoting mountain hiking and outdoors activities.
	Henning Redestig. Henning has contributed to the project until December 2017. We wish him good luck for his future endeavors.
	Danny Dannaher. Danny has contributed to the project until September 2017. We wish him good luck for his future endeavors.

Partner website

European Molecular Biology Laboratory

PIs, P. Bork and K. Patil, have complementary research expertise and have together considerable experience in bioinformatics, specifically in function prediction, metagenomics and modeling. They have successfully delivered in >10 European consortia in FP6 and FP7 including consortia focusing on analysis of metagenomics data relevant for human health and industrial applications. They have also developed several widely-used bioinformatics tools such as String and iPath.

Personnel involved:

	Kiran Patil. M. tech. (Chemical engineering) 2002, Indian Institute of Technology, Bombay. PhD (Systems biology) 2006, then Assistant Professor, 2006–2010, Technical University of Denmark. Group leader at EMBL since 2010.
	Peer Bork is senior group leader and joint head of the Structural and Computational Biology unit at EMBL, a European research organization with headquarters in Heidelberg where he also serves as strategic head of bioinformatics. In addition, he holds an appointment at the Max-Delbrück-Center for Molecular Medicine in Berlin. Dr. Bork received his PhD in Biochemistry (1990) and his Habilitation in Theoretical Biophysics (1995). He works in various areas of computational and systems biology with a focus on function prediction, comparative analysis and data integration. He has published more than 500 research articles in international, peer-reviewed journals, among them more than 60 in Nature, Science or Cell. According to ISI (analyzing 10 years spans), Dr. Bork was for many years the most cited European researcher in Molecular Biology and Genetics and is among the top 5 worldwide in Biochemistry and Biology. He is on the editorial board of a number of journals including Science and functions as senior editor of the journal Molecular Systems Biology. Dr. Bork co-founded five successful biotech companies, two of which went public. More than 35 of his former associates now hold professorships or other group leader positions in prominent institutions all over the world. He received the "Nature award for creative mentoring" for his achievements in nurturing and stimulating young scientists and was recipient of the prestigious "Royal Society and Académie des Sciences Microsoft award" for the advancement of science using computational methods. Dr. Bork obtained two competitive ERC advanced investigator grants and is elected member of both the German national academy of sciences (Leopoldina) and the European molecular biology organization (EMBO).
	Jaime Huerta-Cepas has a degree in Biological Sciences from the Universidad Complutense of Madrid and a PhD on Molecular Biology and Evolution (CNIO and CIPF; Madrid, Valencia, 2004-2008). He worked as a postdoc at CRG (Barcelona, 2009-2014), and currently holds a staff research scientist position at EMBL (Heidelberg). His past and current research lines include i) studying the phylogenetic variability among gene families and its impact on the species taxonomy and the reconstruction of the Tree of Life ii) studying the role of gene duplication on the acquisition of novel gene functions through the use of phylogenetic and phylostratigraphic techniques iii) predicting orthology relationships between molecular sequences iv) studying horizontal gene transfer events in prokaryotes and other microbial organisims v) characterizing the genetic variability within and between microbial communities in environmental and gut metagenomic samples vi) discovering and characterizing novel gene functions and unknown organisms out of massive metagenomic data, with a focus on the discovery of novel enzymes (responsible for DD-DeCaF WP2).
	Daniel Machado has degree in Mathematics and Computer Science (University of Minho, 2006) and a PhD in Bioengineering (MIT-Portugal program, 2012). He was a post-doc at the Novo Nordisk Foundation Center for Biosustainability (Denmark), and is currently a post-doc at EMBL (Germany). His work focuses on the development of computational approaches for analysis and simulation of metabolism. His current interests include the study of microbial communities and the development of software tools for generation and simulation of microbial community models from large-scale multi-omics datasets.
	Sergej Andrejev is a scientific programmer in EMBL, Heidelberg. In his research he applies metabolic modeling, statistical analysis and other bioinformatics technics to study microbial population ecology dynamics. Recently he co-authored a paper elucidating the role of bacterial metabolic cooperation in bacterial ecology. The study demonstrated that competition is not the only major force shaping bacterial communities and metabolic cooperation is driving force behind many naturally occurring communities. His current area of interest includes metabolic interactions and population dynamics in soil, gut, ocean and others bacterial communities as well as single cells interactions in multicellular organisms.
	Luis Pedro Coelho has BS and MS degrees in Computer Science (IST, University of Lisbon, 2004 and 2006) and a PhD in computational biology from Carnegie Mellon University (2011). He is currently a postdoc at EMBL (Germany) work in the group of Peer Bork. His current work is on large-scale microbial ecology, with a focus on the use of sequencing technologies, in particular developing computational tools capable of integrating very large metagenomics datasets
	Paula Jouhten, Paula has contributed to the project until December 2017. We wish her good luck for her future endeavors.

Partner website

Chalmers University of Technology

The J. Nielsen’s group at Chalmers is one of the world leaders on genome-scale modeling of microorganisms, and statistical and model-based analysis of omics data for cell factory and human health applications. The group has reconstructed >20 models and has developed software tools for performing semi-automatic reconstruction. The group has developed several tools for simulation and integrative analysis of omics data in the context of genome-scale models (GEMs) as well as general omics analysis tools ().

Personnel involved:

	Hongzhong Lu, PhD Bioengineering in 2016 (East China University of Science and Technology), has one year of work experience in DSM biotechnology center (Shanghai) after graduation. In his PhD phase, he mainly focused on reconstruction of Aspergillus niger genome metabolic model, 13C labelled flux analysis and multi-omics analysis based on model simulation. He is interested in the big data mining based on genome scale metabolic model. He also has great interest in the reconstruction of advanced metabolic models and their innovative applications, like the ME, GEMs-PRO, etc. He is currently involved in the DD-DeCaF project to develop yeast genome metabolic model of high quality.
	Yu Chen, PhD student and will receive his PhD in Bioengineering in 2017 (East China University of Science and Technology). His research interest is in systems biology of metabolism. In his previous research, he applied constraint-based methods, especially flux balance analysis, to study physiology and metabolic regulation of industrial microorganisms. Besides, he focused on integrative analysis of omics data using genome-scale metabolic models. He is currently involved in the DD-DeCaF project to develop integrated models of metabolism and protein synthesis.
	Benjamín Sánchez, PhD student and computational biologist with an MSc degree in Biotechnology Engineering from the Pontifical Catholic University of Chile. He has interests in systems biology and mathematical modeling of biological processes, and as of 2014 he is a PhD student in the Division of Systems and Synthetic Biology in Chalmers. For his thesis, he is working on genome scale modeling of yeast, developing new tools for integration of models with omic data and for computational prediction of metabolic engineering strategies. In the DD-DeCaF project he is developing metabolic models that account for protein abundances as constraints.
	Feiran Li, PhD student with a MSc degree in Biochemical Engineering (Tianjin University, China). From 2017, she has been a PhD student in the Division of Systems and Synthetic Biology in Chalmers. She is working on genome scale modelling of yeast, with a focus on modelling of industrial proteins production and secretion through a combination of metabolic models and protein secretory models. She is currently involved in the DD-DeCaF project to develop integrated models of metabolism and protein secretion.

Partner website

École polytechnique fédérale de Lausanne

The Hatzimanikatis group has 20 years of experience in academic and industrial research for the design of metabolic pathways and whole-cell industrial biocatalysts. Dr. Ljubisa Miskovic is a senior research scientist with extensive experience with modeling and analysis of metabolic pathways. The group has developed large-scale nonlinear models for S. cerevisiae, human cells, E. coli and P. putida, which account for the thermodynamic and kinetic properties of metabolic pathways and can be used for the identification of rate-limiting steps in the absence of complete kinetic information on the pathway enzymes.

Personnel involved:

	Vassily Hatzimanikatis, has been Associate Professor of Chemical Engineering and Bioengineering at the École Polytechnique Fédérale de Lausanne (EPFL) since 2006. He received his Diploma in Chemical Engineering from the University of Patras (Greece) in 1991. In 1997, he graduated from the California Institute of Technology (Caltech) with a master (M.S.) and a doctoral (Ph.D.) degree in Chemical Engineering under the supervision of Prof. Jay Bailey. His work focused on the mathematical analysis of metabolic reaction networks and their design to achieve desired phenotypes. Professor Hatzimanikatis continued his career as a research associate in the laboratory of Prof. Jay Bailey in the Swiss Federal Institute of Technology (ETH) in Zurich (Switzerland). Between 1997 and 2000, he worked on the development of biocatalysts for the production of industrial chemicals in DuPont, Cargill, and Cargill Dow. In 2000, he became Assistant Professor of Chemical Engineering at Northwestern University in Evanston, Illinois. He held this position until he joined the EPFL. Professor Hatzimanikatis’ research interests are in the areas of systems biotechnology, bioinformatics, and metabolic engineering. He is editor in chief of Metabolic Engineering Communications and associate editor of Biotechnology and Bioengineering, and Metabolic Engineering. He has published over 100 technical articles and he is co-inventor in three patents.
	Georgios Fengos is a Post-Doc researcher at EPFL and joined the DD-Decaf project in 2017. He holds a PhD in Systems Biology from ETH Zurich, and a Diploma in Chemical Engineering from University of Patras in Greece. His research interests are in the area of Systems Biotechnology with focus on the generation and analysis of large-scale kinetic models of metabolic networks for the design of metabolic engineering strategies.
	Liliana Angeles is a Post-Doc researcher at EPFL. She holds a PhD in Chemical Engineering from the University of Manchester (UK), and an MSc in Bioprocesses from the National Polytechnic Institute (Mexico). Her research interests lie in the field of systems biology and bioengineering, with focus on the modelling of metabolic networks and complex reaction-diffusion systems.
	Zhaleh Hosseini, PhD student with an MSc degree in Biotechnology from University of Tehran. In her MSc thesis, she worked on gap filling of metabolic network models using gene expression data. As of June 2016, she is a PhD student in the Laboratory of Computational Systems Biotechnology in EPFL. She is working on development of optimization methods for analysis of thermodynamically constrained metabolic network models.
	Daniel Robert Weilandt, PhD student with an MSc degree in Microsystem Engineering from the Albert-Ludwigs University of Freiburg. In his PhD thesis he was working on dissipative particle models for cell membranes. He started in May 2016 as a PhD student in the Laboratory of Computational Systems Biotechnology in EPFL. He now works on methods to integrate enzyme data into large scale metabolic models.
	Robin Denhardt-Eriksson, PhD at student at the Laboratory of Computational Systems Biotechnology (LCSB) since 2017. He holds an MsC in Chemistry and Chemical Engineering from EPFL. His research interests are in the area of analysis and understanding of large-scale kinetic models.
	Meric Ataman. Meric has contributed to the project until November 2017. We wish him good luck for his future endeavors.

Partner website

University of Minho

The Biosystems – Systems Biology and Metabolic Engineering research group, led by Prof. Isabel Rocha has currently over 20 members working in the field of Systems Biology and Metabolic Engineering, with emphasis on the development of algorithms for strain design. Moreover, the team has expertise and experience in developing open-source user-friendly software tools such as Merlin (www.merlin-sysbio.org) and OptFlux (www.optflux.org).

Personnel involved:

	Isabel Rocha, teaches Bioinformatics and Systems Biology at Minho University where she is the PI of several projects in Systems Biology applied to Industrial Biotechnology and Life Sciences, with more than 100 papers published in international journals, books and international conferences. She did her PhD in Chemical and Biological Engineering at Minho University in Portugal and a Post-Doc in Systems Biology at the Technical University of Denmark. She spent one semester at MIT in Boston and she is part of the MIT-Portugal Program Faculty, being the head of a National program in Teaching innovation and entrepreneurship. She is co-founder and Scientific Director of both SilicoLife and Biotempo. Biotempo is focused on Bioprocess Development for the food industry while SilicoLife develops improved microbial strains for industrial applications using in silico methods.
	Miguel Rocha, is an Associate Professor at the Department of Informatics of the University of Minho, being the Director of the Master course in Bioinformatics, and teaching contents related to Bioinformatics and Data Sciences/ Machine Learning. He is also a senior researcher at the Centre of Biological Engineering, performing research in the themes Bioinformatics and Systems Biology. He has (co-)authored over 150 peer-reviewed articles, and participated in over 20 funded research projects (5 as a PI). He also coordinates several open-source software development projects including OptFlux and @Note.
	Óscar Dias, is an Invited Assistant Researcher and Professor at the Centre of Biological Engineering at the University of Minho. He completed a Doctoral Program in Chemical and Biological Engineering by the University of Minho in 2013. Before, he completed a Master's degree in Informatics and a Licentiate's degree in Biological Engineering. He (co-)authored over 20 peer-reviewed publications and participated in several funded research projects. He is the main developer of the open-source model's reconstruction platforms merlin and triage.
	Sara Correia, post-doc researcher at the Centre of Biological Engineering (CEB- University of Minho). PhD in Informatics (University of Minho, 2016), M.Sc in Bioinformatics (University of Minho, 2011) and BSc in Mathematics and Computer Science (University of Minho, 2001). She worked as software developer between 2001 and 2005 and as a high school teacher in the following five years. During the PhD, she worked in the development of computational tools for the reconstruction of tissue-specific metabolic models. She joined the DD-DeCaF project, in September 2016, to develop optimization algorithms for the design of optimal strains and microbial communities.
	Sophia Santos, is currently a PhD candidate in the MIT-Portugal doctoral program in Bioengineering at Centre of Biological Engineering (CEB - University of Minho). M.Sc in Bioinformatics (University of Minho, 2013), B.Sc in Biochemistry (University of Porto, 2011) and B.Sc in Mathematics (University of Minho, 2007). She has worked on the development of computational and experimental methods for measuring biomass composition and evaluating its impact in genome-scale models predictions and also in the integration of yeast genome-scale metabolic models. She joined the DD-DeCaF project, in July 2016, to develop optimization tools for the design of microbial communities and the to apply those tools to some case studies.

Partner website

Industrial partners

SilicoLife

SilicoLife creates computational biology solutions for the fast growing Industrial Biotechnology applications, allowing our clients to design optimized microbial strains for the cost-effective production of specific target compounds such as biofuels, chemicals, food ingredients or biopolymers. Leading chemical, materials and synthetic biology companies have chosen Silicolife as development partner to maintain their market edge in increasingly competitive markets. SilicoLife is considered a highly innovative company, including as one of the 40 Hottest Small Companies in the Advanced Bioeconomy by BiofuelsDigest 2014/15 and Startup of the week by Wired UK.

Personnel involved:

	Simão Soares, CEO SilicoLife. MSc Bioinformatics (University of Minho), postgraduate in management from NOVA School of Business and Economics and trained in Blue Ocean Strategy at INSEAD. Was visiting researcher at the Technical University of Denmark, in the integration of omics data with genome scale models. Collaborated with the UMinho in the fields of Bioinformatics and Systems Biology, with published work in computational biology top ranked journals. Member of the board of P-BIO, Portuguese Bioindustry Association and consultant for international organizations in industrial biotechnology topics.
	Paulo Vilaça, COO SilicoLife. MSc in Bioinformatics and is currently developing a PhD in Computer Science. He has a strong background in software development and engineering. Paulo is responsible for the daily management of a development team with more than 15 people working in systems biology and bioinformatics applied to industrial biotechnology problems.
	Paulo Maia, Head of Development. PhD in Bioengineering (MIT Portugal, U.Minho), MSc in Bioinformatics and BSc in Computer Science (U.Minho). His work during the past 9 years ranged from the development of machine learning approaches for feature extraction in biological datasets to the development of novel phenotype prediction and computational strain optimization methods. He is an experienced software engineer, co-developing the OptFlux workbench and several other open-source applications and co-authored various publications in the metabolic engineering arena.
	Sónia Carneiro, Head of Development. PhD in Chemical and Biological Engineering from the University of Minho, Portugal and a post-doc in Cancer Systems Biology at the Computational Genomics Laboratory in IGC and University of Minho, Portugal. Her work during the past 10 years is focused on the application of Systems Biology approaches for the characterization of metabolic bottlenecks in cellular systems. She is a co-author of several publications in research topics like recombinant bioprocesses, metabolomics and metabolic modelling.

Partner website

Genialis d.o.o.

Genialis builds data analytics software that facilitates secure and compliant management, rigorous and reproducible pipelines, and intuitive, interactive visualizations. These components are seamlessly integrated into a single system, Genialis Platform, that streamlines the collaboration between end-user and data expert. The existing software focuses on bioinformatics (RNA-seq, microarray, ChIP-seq, whole genome sequencing), however the dataflow engine can accommodate any type of data or algorithm. The Platform leverages cutting-edge web and database technologies, and its modular architecture allows for continuous evolution of sophisticated data analysis methods, including data mining. Genialis d.o.o. was founded in February 2013 in Ljubljana, Slovenia.

Personnel involved:

	Luka Ausec, bioinformatician and project manager. PhD in Biotechnology by the University of Ljubljana (2014) with a focus on metagenomic analyses and bioinformatics. Luka joined Genialis in 2015 as a leader of BioBash Workshops programme and designed various courses in bioinformatics and computing for biologists. Currently, he is overseeing the Genialis’ project management team.
	Eva Zupan Horaček, designer. Has been a designer for almost 10 years, freelancing for various companies and marketing agencies. She was also worked as junior lecturer at the Faculty of Design (University of Primorska) and as teaching assistant at the Department for landscape architecture at Biotechnical faculty (University of Ljubljana). At Genialis leads the graphic and UX design processes.
	Miha Štajdohar, CTO, developer and bioinformatician. Phd in Computer and Information Science by the University of Ljubljana (2012) focusing on visualizations and data mining, and Postdoctoral associate Baylor College of Medicine since 2014. Miha has 15 years of experience in software development and data science. After his PhD, Miha co-founded Genialis as Chief Technology Officer, where he leads the software development process.
	Nejc Škoberne, CEO. PhD computer and Information Science by the University of Ljubljana (2014). Nejc Škoberne has 16 years of experience in the IT industry. He started his career as a system and computer network administrator at Infrax d.o.o. and later continued as an information security expert and auditor at Viris d.o.o.. Nejc finished his PhD in computer science in 2013 and co-founded Genialis, where he is the CEO.
	Mátyás Fodor, obtained his BSc in Molecular Bionics at the Pázmány Péter Catholic University in Hungary. He changed career to become a web developer and gained some industrial experience. He's responsible for maintaining and developing the front end web application of the DD-DeCaf project.
	João Pita Costa. João has contributed to the project until March 2017. We wish him good luck for his future endeavors.
	Nace Kranjc. Nace has contributed to the project until March 2017. We wish him good luck for his future endeavors.

Partner website

Software Tools

OptFlux is an open-source and modular software to support in silico metabolic engineering tasks aimed at being the reference computational application in the field.

Cameo is a high-level python library developed to aid the strain design process in metabolic engineering projects.

iPath2: interactive Pathways Explorer is a web-based tool for the visualization, analysis and customization of various pathways maps.

The Transport Reactions Annotation and Generation (Triage) tool identifies the metabolites transported by each transmembrane protein and its transporter family.

@Note is a Biomedical Text Mining platform that copes with major Information Retrieval and Information Extraction tasks and promotes multi-disciplinary research.

NGLess is a domain-specific language for NGS (next-generation sequencing data) processing with a focus on metagenomics processing.

The Mass-Action Stoichiometric Simulation (MASS) toolbox is a modeling software package that focuses on the construction and analysis of kinetic and constraint-based models of biochemical reactions systems.

iTOL: interactive Tree Of Life is an online tool for the display, annotation and management of phylogenetic trees. Trees can be annotated with 14 different dataset types, and exported into various graphical formats.

The Metabolic Models Reconstruction Using Genome-Scale Information (merlin) tool is an user-friendly Java application that performs the reconstruction of genome-scale metabolic models for any organism that has its genome sequenced.

Optlang is a Python package implementing a modeling language for solving mathematical optimization problems, i.e. maximizing or minimizing an objective function over a set of variables subject to a number of constraints. Optlang provides a common interface to a series of optimization tools, so different solver backends can be changed in a transparent way.

The GECKO toolbox is a Matlab/Python package for enhancing a Genome-scale model to account for Enzyme Constraints, using Kinetics and Omics.

DD-DeCaF

What is DD-DeCaF?

Screencasts and videos

News and Social

Publications

Consortium

Scientific partners

Technical University of Denmark

Personnel involved:

European Molecular Biology Laboratory

Personnel involved:

Chalmers University of Technology

Personnel involved:

École polytechnique fédérale de Lausanne

Personnel involved:

University of Minho

Personnel involved:

Industrial partners

SilicoLife

Personnel involved:

Genialis d.o.o.

Personnel involved:

biobyte solutions GmbH

Personnel involved:

Biosyntia ApS

Personnel involved:

DSM

Personnel involved:

Software Tools