To systematically evaluate the relationship between enzyme expression and flux, we required a dataset that (i) has both flux and enzyme expression data acquired from the same samples; (ii) provides accurate flux values spread across a metabolic network, i.e., not just confined to core carbon metabolism; and (iii) involves multiple conditions to ensure a statistically meaningful analysis. It is difficult to obtain data that covers a broad range of reactions as traditional methods, such as determining flux through complete isotope balancing, are typically limited to core carbon metabolic pathways (Gopalakrishnan and Maranas,
2015; Kharchenko et al,
2005; Zamboni et al,
2009). Alternatively, network-level flux distributions can be predicted using Flux Balance Analysis (FBA), which incorporates various measured rates as constraints (Antoniewicz,
2015; Hackett et al,
2016). However, the capability to determine flux is frequently restricted by the comprehensiveness of the measured rates. Consequently, only a few studies to date have achieved nearly comprehensive network-level flux determination (Hackett et al,
2016; Kochanowski et al,
2021; Lahtvee et al,
2017). Our literature survey led us to conclude that only a yeast dataset (Hackett et al,
2016) fulfilled the three requirements (Table
EV1). In this dataset, flux estimates are available for 232 metabolic reactions, with 156 associated measurements of enzyme levels, across 25 conditions in which different nutrients (glucose, leucine, uracil, phosphate, nitrogen) were limited and the growth rate was titrated (Datasets EV1-EV3).
An enhanced flux potential analysis algorithm that translates relative enzyme expression to relative flux
The observation of pathway-level association between flux and enzyme levels suggests that flux predictions may be more effective when focused on pathways in the network neighborhood of a ROI. We reasoned that the most relevant pathways in this neighborhood are likely to extend along reactions that have a linear connection to the ROI, where fluxes are strictly coupled. The linear connections are reflected in the network architecture and can be quantified by the network connectivity degree of metabolites (i.e., the total number of sources and sinks for the metabolite, Fig.
2A). A degree of two indicates strict linearity, and low degrees may generally suggest higher likelihood of flux coupling. To test whether metabolite degree is indicative of flux-expression association, we inspected the ‘pairwise cross-informing rate’, which defines how likely the flux or expression of one reaction is correlated with the flux or expression of a connected reaction (FDR < 0.05), given the degree of the bridging metabolite that links the two reactions. Indeed, reactions connected by metabolites with a low degree are more likely to cross-inform each other for both flux and expression as well as expression-flux correlation (Fig.
2B). Although the clarity of this association diminishes with higher degrees, i.e., becoming notably noisier beyond a degree of about 8 (Fig.
2B), these findings support the use of metabolite degree as a heuristic approach to define pathways related to each ROI. Such heuristics can then be incorporated into our FPA algorithm for enhancing network integration.
FPA calculates the relative flux potential (rFP) of an ROI based on relative enzyme mRNA levels across conditions (Yilmaz et al,
2020) (Appendix Text
S2). FPA is formulated as an FBA problem whose objective function is the maximal flux of an ROI and whose maximized objective value is named
flux potential. The key concept of FPA is to convert relative enzyme levels into weight coefficients such that the flux through a reaction is penalized when corresponding enzyme levels are low (Fig.
EV2A, Methods) (Yilmaz et al,
2020). The resulting flux potential value is further normalized by its theoretical maximum, i.e., the flux potential in a hypothetical condition that expresses all enzymes at the highest observed level, to obtain a dimensionless rFP that ranges between 0 and 1. FPA uses a distance decay function with which the influence of enzyme levels decreases with distance from the ROI. To fine-tune FPA, we incorporated metabolite degree to utilize flux routes that are both flux-balanced and low in metabolite degree, which can be conceptualized as the related pathways of the ROI (Appendix Text
S1) (Fig.
EV2B). In addition, we formulated a new distance decay function with a
distance boundary parameter to precisely control the maximum length of the related pathways where the expression data is integrated, which we hereafter call
integrated pathways (Figs.
2C and
EV2C, Appendix Text
S1). We refer to this algorithm as enhanced FPA (eFPA).
Using eFPA, we dissected the association between changes in flux and enzyme expression at various scales of integrated pathways (Fig.
2C). We set each of the 232 reactions for which flux was measured (Hackett et al,
2016) as ROI, and titrated distance boundary from zero to the maximum reaction distance in the network, thus varying pathway definition from the sole ROI to the entire network. For most reactions, we found that the correlation between rFP and flux varied with distance boundary (Figs.
2D and
EV3). We refer to the boundary where the correlation is maximal as the
optimal distance boundary and to the corresponding predictions as the results of
optimal-boundary eFPA. Overall, the flux of 44% (101/232) of the reactions could be significantly correlated with rFP (Appendix Fig.
S2A). Importantly, with an optimal boundary, eFPA improved the correlation for most reactions, further confirming that flux changes are associated with enzyme expression in related pathways (Fig.
2E). Most reactions showed high correlation when the distance boundary was set between 2 and 6 (Fig.
EV3), and the total number of significantly correlated reactions peaked at a distance boundary of 4 (Fig.
2F, yellow curve). As expected, the correlation with flux diminished when gene expression was solely considered for the ROI (Fig.
2F, blue and purple curve). These observations together indicate that where flux and expression levels are associated, this relationship is mostly localized to short pathways.
Ideally, different distance boundaries need to be considered for each ROI. However, this is not practical. Therefore, we developed a robust variant of eFPA that is not sensitive to the choice of distance boundary for different reactions, and thus can be broadly used for different expression datasets. We optimized the distance decay function of eFPA and found that an exponential decay (Fig.
EV2C) helped to predict relative flux for a broader range of distance boundaries (Fig.
2F, red curve). We refer to the use of exponential decay function with a distance boundary of 6 as “standard eFPA”, which is independent of the fitted optimal distance boundaries. To test the predictive power of this tool more directly, we asked if robust flux predictions can be obtained even when enzyme levels of the ROI were ignored. Indeed, only using levels of enzymes catalyzing reactions other than the ROI predicted reaction fluxes equally well as when ROI’s enzyme levels were included (Fig.
2G). In addition, we evaluated the significance of eFPA predictions by a randomization that shuffled the reaction labels in the input expression data. This analysis clearly showed that pathway-level flux-expression association is an intrinsic property of the flux-expression relationship and not a result of overfitting (Fig.
2H; Appendix Fig.
S2B). Overall, eFPA correctly predicted the relative flux of a large proportion of metabolic reactions, including ROIs for which enzyme level measurements were not available (
gap-filling predictions) and ROIs for which their own enzyme levels did not correlate with flux (
overriding predictions) (Fig.
2I). The remaining 130 reactions likely represent cases where fluxes are not controlled or correlated with enzyme expression. Instead, these fluxes may be driven by metabolite levels or regulated through mechanisms such as allostery, which are not directly influenced by changes in enzyme expression.
We evaluated the predictive power of standard eFPA through a benchmark analysis against alternative methods (Fig.
3A). Standard eFPA outperformed three recent algorithms, ΔFBA (Ravi and Gunawan,
2021), Compass (Wagner et al,
2021) and REMI (Pandey et al,
2019), all of which predict flux changes based on variations in enzyme expression. A critical factor in this performance increase is the selective integration of ROI-proximate network neighborhoods: removing the distance decay from FPA diminished its power, whereas incorporating a decay function into Compass boosted its predictivity (Fig.
3A). In addition, adjusting the distance order parameter in FPA, corresponding to the distance boundary in eFPA (Fig.
EV2C), was insufficient to match the predictive power achieved by eFPA (Fig.
3B), further emphasizing the significance of the enhancements in our approach (Yilmaz et al,
2020). This difference is largely driven by the weighted distance (Fig.
3B), which incorporates the metabolite degree heuristics (Fig.
EV2B). For instance, FPA predictions showed a poor correlation with flux changes for r_0939 in tyrosine biosynthesis, a reaction without available enzyme level data, whereas eFPA achieved significantly improved accuracy (Fig.
3C). Analysis of the integration process revealed that FPA predictions were influenced by protein expressions from glycolysis (e.g., r_0892) and the electron transport chain (r_0770), which were unrelated to the flux of the ROI (r_0939). These reactions appeared proximal in the network under the standard distance metric, compromising the original FPA predictions. However, eFPA effectively corrected for this issue by using weighted distance, which assigned larger values to these reactions due to high-degree metabolites (e.g., pyruvate) in the pathways connecting them to the ROI. Consistent with these observations, incorporating a weighted distance into FPA nearly restored the correlation, demonstrating its critical role in improving predictive accuracy (Fig.
3C).
To further illustrate how eFPA predicts flux, we used the proline synthesis pathway as an example (Fig.
3D). This pathway includes four reactions with only the first reaction, r_0468, showing statistically significant flux-enzyme correlation (Fig.
3D). However, eFPA predictions significantly correlated with relative fluxes in all reactions of this pathway, including r_1887, which lacks associated enzymes and therefore can only be predicted by data integration along the pathway. Although the correlation between flux and rFP values was weakened by conditions with inconsistent expression levels and flux (e.g., red data points, Fig.
3D), it remained significant due to conditions with consistently high expression levels corresponding to high flux (dark blue data points, Fig.
3D). Notably, the enzyme levels for the last pathway reaction, r_0957, showed minimal variation across the 25 conditions and no meaningful correlation with flux (Fig.
3D), suggesting that this reaction is not regulated by enzyme expression. Thus, eFPA can not only predict flux for reactions lacking associated enzymes but also for those not regulated by their enzyme levels. Overall, eFPA demonstrated optimal performance for this pathway: while reaction expression gave the best correlation for a single reaction, eFPA achieved significant correlations across all reactions despite compromises from integrating data from poorly correlated reactions. The original FPA algorithm also captured some flux-expression relationships but performed suboptimally (Fig.
3D). Taken together, these results suggest that metabolic flux changes correlate more efficiently with coordinated enzyme expression changes in pathways, a principle computationally best realized in the eFPA algorithm.
eFPA enables metabolic analysis at single-cell level
Single-cell RNA-seq (scRNA-seq) is an emerging method that delineates the transcriptomes of individual cells (Kolodziejczyk et al,
2015). However, this method often has low sensitivity, which can result in numerous zero counts of gene expression. These zero counts are frequently caused by dropout events, where an mRNA is present in the cell but fails to be detected due to technical limitations, rather than its true absence (Luecken and Theis,
2019). Since standard eFPA integrates not only the expression of genes associated with a ROI, but also the expression levels from enzymes catalyzing surrounding reactions, we wondered whether it could significantly enhance the metabolic analysis of scRNA-seq data beyond what was achievable with previous approaches. We extracted a diverse dataset from the Tabula Sapiens database (Tabula Sapiens et al,
2022) that includes 2264 cells across 23 cell types from 13 organs, and integrated this data with the Human 1 model using each of the 3002 cytosolic reactions as ROI (Fig.
5A). We compared the results with two other metrics: solely the expression levels of enzymes associated with each ROI, and Compass (Wagner et al,
2021), which integrates network reactions but does not incorporate a distance decay. The original Compass method offers a partial lumping of multiple cells (i.e., information sharing) to mitigate the sensitivity issue. To test the integrations at the actual single-cell level, we skipped this approach in our implementation and named it Compass-.
We hypothesized that eFPA achieves an optimal integration balance between reaction expression and Compass-. Reaction expression has limited analytical depth as it does not consider information beyond the ROI. In contrast, Compass- does incorporate network-wide gene expression information, but this can potentially dilute local signals by over-emphasizing distant gene expression. Given that eFPA is designed to selectively integrate local, pathway-level gene expression data, we reasoned that it would be the most effective approach for generating biologically meaningful predictions. Initial tests using Principal Components Analysis (PCA) supported this: eFPA predictions showed better clustering and retained higher explained variance in the top principal components compared to ROI expression alone (Fig.
5B,C). Although Compass- explained an even broader variance due to its comprehensive network approach, it tended to blur the distinctions among cell types in the PCA plot (Fig.
5B, e.g., orange cells became less distinguishable from blue cells). Thus, eFPA strikes a balance, capturing enough variance for clear clustering without over-generalizing, which can obscure finer distinctions among cell types.
We next counted the number of cell-type-enriched metabolic functions predicted by the three approaches. We conservatively defined such predictions as reaction fluxes that are significantly higher in a particular cell type compared to the main population, with a Fold Change (FC) greater than 1.2 and a
p-value less than 1E−10 (Fig.
5D). This approach resulted in numerous reactions that are enriched in many cell types, with liver hepatocytes, heart muscle cells, and bone marrow erythroid progenitors having the highest predicted metabolic activity (Fig.
5E). In all cell types, eFPA, either alone or in agreement with one or both of the other two methods, covered the majority of predictions. Reaction expression alone also produced a significant number of cell-type-enriched predictions, but Compass- alone typically yielded a much smaller number, except for cell types of high general metabolic activity. Importantly, there were only few predictions shared by Compass- and reaction expression but not by eFPA (Fig.
5E, brown).
We reasoned that predictions solely made by reaction expression are less reliable because it means that expression levels from enzymes that catalyze surrounding reactions do not support the ROI expression. In other words, if the expression changes of surrounding reactions were coherent with that of a ROI, the flux potential of that ROI would then also have been captured by eFPA and/or Compass-. Given that eFPA recalled vast majority of other predictions, we next focused on the predictions that were not always in agreement with the other two methods and identified those predictions that could also be verified with existing knowledge (Table
1). We studied these instances in detail to understand why eFPA recalled them as true positives, while one or both of the other methods did not. We determined several expected scenarios that are particularly challenging for reaction expression (Fig.
5F–I) or Compass- (Fig.
5J,K).
For example, reaction expression often fails to detect active ROIs when the associated enzyme appears unexpressed likely due to dropout events (Fig.
5F). This approach also struggles when an enzyme functions in multiple pathways, causing all associated ROIs to seem equally active (Fig.
5G). Complications also arise with multiple isozymes, including paralogs with uncertain annotation and isozymes active in different cell types, which can obscure the true activity signal of the ROI (Fig.
5H). Moreover, reaction expression cannot account for ROIs without associated enzymes, such as non-enzymatic reactions or those catalyzed by unidentified enzymes (Fig.
5I). Both eFPA and Compass- address these issues by integrating gene expression from other network reactions alongside the ROI. However, Compass- might dilute these local signals since it considers the entire network (Fig.
5J). Further, even when few genes are integrated, the involvement of hub metabolites, which participate in numerous processes, can introduce irrelevant gene expression data (Fig.
5K). Several specific examples illustrate how these issues manifest and how eFPA effectively resolves the introduced complexities (Fig.
6).
The first example focuses on two ROIs in glycogen metabolism (Fig.
6A), one producing (MAR05398) and the other converting (MAR01380) phosphorylase-limit dextrin (dxtrn). Both reactions were predicted by eFPA to have high flux potential in all three muscle cell types, a prediction not as strongly supported by the other two methods (Fig.
6B), but consistent with general knowledge (Cusso et al,
2003; Testoni et al,
2017). To understand the mechanism of prediction, we focused on two fast muscle cells, which have opposite levels of flux potentials for MAR05398 according to eFPA and reaction expression (Fig.
6A,B; the two cells are represented by a square and a circle). For Cell #1 (square), eFPA predicted high flux in MAR05398 even though the reaction expression was low (Fig.
6B). Indeed, we found that this cell exhibits high expression (low penalty) throughout the pathway, except for two reactions including this ROI (Fig.
6A). These two reactions share the same gene-protein-reaction (GPR) association, and we reasoned that the lack of their expression may be due to technical dropouts (Fig.
5F). Thus, the eFPA prediction reflects the integration of the relevant pathway and avoids potential misinterpretation from noise. In contrast, Cell #2 shows low flux potential in MAR05398, but high expression (Fig.
6B, circle). However, this high expression likely reflects the activity of the other reaction that shares the same GPR with MAR05398, but is located in the dxtrn degradation branch of the pathway (Figs.
5G and
6A). Indeed, reactions along the degradation branch are well expressed, while the branch that produces dxtrn is poorly expressed in Cell #2 (Fig.
6A). Consistently, the flux potential of MAR01380 on the degradation branch is predicted to be high by eFPA in this cell (Fig.
6B).
Meanwhile, the potential for both reactions in most skeletal muscle cells was masked in Compass- integration due to strong negative signals (i.e., low expression) from network reactions such as diphosphatase that acts as a source of phosphate (pi) (Figs.
5K and
6A). As with the original FPA (Yilmaz et al,
2020), eFPA does not assume proximity of reactions through side metabolites such as pi and associates reactions connected by these to the ROI with a high distance, thus filtering out their influences. Overall, the eFPA algorithm perfectly captured the human interpretation of glycogen metabolism at the pathway-level for specific ROIs and was the only resource to provide reliable predictions for all muscle cells.
The second example (Fig.
6C) involves two ROIs that produce (MAR02551) and modify (MAR02553) leukotriene B5 (LT-B5). These reactions were consistently, and likely correctly (Strasser et al,
1985; von Schacky et al,
1990), predicted to be most active in neutrophils solely by eFPA (Fig.
6D). The flux potential of MAR02553 was underestimated by both reaction expression and Compass- for distinct reasons. For reaction expression, we found that the prediction was complicated by the involvement of multiple cytochrome p450 genes, each linked to dozens of other reactions across the metabolic network (Fig.
6E). Only one of these genes is significantly expressed in neutrophils (Fig.
6E, CYP4F3). Consequently, this example shows a severe case for both ambiguous GPR with multiple genes (Fig.
5H) and parallel reactions with similar GPR (Fig.
5G), where the interpretation of reaction expression is often confounded and the effective assessment of the likelihood of activity requires contextual analysis within the network, as done by eFPA and Compass-. However, Compass- failed to recognize this activity due to a network constraint that allows the product of MAR02553, 20OH LT-B5, to drain only if various other arachidonates are simultaneously produced, as there is a single reaction which ties them together (Fig.
6C, MAM10003). This requirement creates an extreme case for signal dilution (Fig.
5J) in Compass- integration, which uniformly processes all network reactions.
Both Compass- and reaction expression performed reasonably well in predicting the activity of the other reaction, MAR02551 (Fig.
6C,D), aided by a clear transport system for unmodified LT-B5 and a GPR involving a single gene (LTA4H). Nevertheless, only eFPA consistently ranked neutrophils highest, with a robust and realistic distribution of predicted activities across this cell population (Fig.
6D). Overall, this example highlights eFPA as a robust predictor, demonstrating better performance not only in the presence of GPR ambiguities and network complexities but also in more straightforward scenarios.
The third example examines the two-step conversion of IMP to AMP through adenosyl succinate (dcamp) by the ROIs MAR04042 and MAR04412 (Fig.
6F). This conversion is known to occur in muscle cells as part of the purine nucleotide cycle (Lowenstein and Goodman,
1978), which is clear for the first reaction from reaction expression (Fig.
6G). However, reaction expression failed to recognize the high activity of muscle cells for the second reaction (MAR04412) and Compass- failed for both reactions (Fig.
6G). The prediction of the second reaction is complicated due to the involvement of genes shared with another reaction in purine biosynthesis, MAR04812 (Figs.
5G,
6F,H). These shared genes are more prominently expressed in erythroid progenitors, likely reflecting the significant role of MAR04812 in their purine biosynthesis (Maynard et al,
2024), which was successfully predicted by all three approaches. However, reaction expression inaccurately suggests that erythroid progenitors as the primary site of MAR04412 activity because it cannot differentiate between concurrent pathways (Fig.
5G). Meanwhile, Compass- failed to predict both MAR04042 and MAR04412 because it emphasizes the general involvement of hub metabolites (Fig.
5K), specifically aspartate and AMP in this context (Fig.
6F), which skewed the integration. By contextualizing these reactions within a pathway-integration framework, eFPA once again outperforms other methods, providing accurate predictions at both the cell and cell-type levels that the other methods only partially capture (Fig.
6G,I, Table
1).
Multiple other examples (Table
1) demonstrate that many metabolic processes in expected cell types were overlooked or failed to be properly differentiated by either reaction expression (Fig.
EV4A–E), or Compass- (Fig.
EV4F–K), but not by eFPA. Additionally, numerous reactions lack associated genes (Fig.
5I), with their flux potentials thus only predictable by eFPA and Compass- (Fig.
EV4L–O). In these instances, eFPA was typically better at discriminating the correct cell types from others, as highlighted by the example of eumelanin production in melanocytes (Fig.
EV4l). Finally, there were multiple metabolic processes assigned to the correct cell type by all three methods, including glycolysis and creatine conversions in muscle cells, cardiolipin-coA formation in heart, and kynurenine metabolism in liver (data not shown). Collectively, our scRNA-seq analysis shows the robustness and effectiveness of eFPA, confirming its ability to accurately identify and predict metabolic activities in single cells.
Finally, to validate the general applicability of the improvements that were made to our integration algorithm based on yeast data, we conducted a repeat analysis using the original FPA on single-cell datasets (Fig.
EV5, Table
EV2). Although the PCA outcomes were similar (Figs.
EV5A and
5B), likely reflecting the focus of data integration around the ROIs, FPA covered fewer cell-type-specific predictions than eFPA (Figs.
EV5B and
5E). Furthermore, FPA failed to predict many metabolic activities successfully identified by eFPA, with no instances where FPA significantly outperformed eFPA (Fig.
EV5C,D). For example, the activity of LT-B5 metabolism reactions MAR02553 and MAR02551 in blood neutrophils was overlooked by FPA (Figs.
5C,D and
6C,D), and predictions of dextrin synthesis (MAR05398) and degradation (MAR01380) in fast muscle cells were not as precise (Fig.
5C,D and
6A,B). These findings underscore eFPA as the most effective tool for integrating single-cell data to predict metabolic activities.