Addressing unknown constants and metabolic network behaviors through petascale computing: understanding H2 production in green algae

2007 
The Genomics Revolution has resulted in a massive and growing quantity of whole-genome DNA sequences, which encode the metabolic catalysts necessary for life. However, gene annotations can rarely be complete, and measurement of the kinetic constants associated with the encoded enzymes can not possibly keep pace, necessitating the use of careful modeling to explore plausible network behaviors. Key challenges are (1) quantitatively formulating kinetic laws governing each transformation in a fixed model network; (2) characterizing the stable solution (if any) of the associated ordinary differential equations (ODEs); (3) fitting the latter to metabolomics data as it becomes available; and, (4) optimizing a model output against the possible space of kinetic parameters, with respect to properties such as robustness of network response, or maximum consumption/production. This SciDAC-2 project addresses this large-scale uncertainty in the genome-scale metabolic network of the water-splitting, H2-producing green alga Chlamydomonas reinhardtii. Each metabolic transformation is formulated as an irreversible steady-state process, such that the vast literature on known enzyme mechanisms may be incorporated directly. To start, glycolysis, the tricarboxylic acid cycle, and basic fermentation pathways have been encoded in Systems Biology Markup Language (SBML) with careful annotation and consistency with the KEGG database, yielding a model with 3 compartments, 95 species, 38 reactions, and 109 kinetic constants. To study and optimize such models with a view toward larger models, we have developed a system which takes as input an SBML model, and automatically produces C code that when compiled and executed optimizes the model's kinetic parameters according to test criteria. We describe the system and present numerical results. Further development, including overlaying of a parallel multistart algorithm, will allow optimization of thousands of parameters on high-performance systems ranging from distributed grids to unified petascale architectures.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    9
    References
    6
    Citations
    NaN
    KQI
    []