GapMind for catabolism of small carbon sources

 

Potential Gaps in catabolism of small carbon sources in Methanosarcina soligelidi SMA-21

Found 182 low-confidence and 27 medium-confidence steps on the best paths for 62 pathways.

Pathway Step Best candidate 2nd candidate
2-oxoglutarate kgtP: 2-oxoglutarate:H+ symporter KgtP
4-hydroxybenzoate adh: acetaldehyde dehydrogenase (not acylating) M886_RS14685 M886_RS09200
4-hydroxybenzoate mhpD: 2-hydroxypentadienoate hydratase
4-hydroxybenzoate mhpE: 4-hydroxy-2-oxovalerate aldolase M886_RS00515
4-hydroxybenzoate pcaK: 4-hydroxybenzoate transporter pcaK
4-hydroxybenzoate pobA: 4-hydroxybenzoate 3-monooxygenase
4-hydroxybenzoate praA: protocatechuate 2,3-dioxygenase
4-hydroxybenzoate xylF: 2-hydroxymuconate semialdehyde hydrolase
arabinose araA: L-arabinose isomerase
arabinose araB: ribulokinase
arabinose araD: L-ribulose-5-phosphate epimerase
arabinose araE: L-arabinose:H+ symporter
arginine gabT: gamma-aminobutyrate transaminase M886_RS04440 M886_RS14680
arginine patA: putrescine aminotransferase (PatA/SpuC) M886_RS04440 M886_RS14680
arginine patD: gamma-aminobutyraldehyde dehydrogenase M886_RS14685 M886_RS01390
arginine rocE: L-arginine permease
asparagine ans: asparaginase M886_RS09020
cellobiose bgl: cellobiase
cellobiose ptsG-crr: glucose PTS, enzyme II (CBA components, PtsG)
citrate icd: isocitrate dehydrogenase M886_RS00365 M886_RS02350
citrate SLC13A5: citrate:Na+ symporter
citrulline AO353_03040: ABC transporter for L-Citrulline, ATPase component M886_RS07115 M886_RS14640
citrulline AO353_03045: ABC transporter for L-Citrulline, permease component 2
citrulline AO353_03050: ABC transporter for L-Citrulline, permease component 1
citrulline AO353_03055: ABC transporter for L-Citrulline, periplasmic substrate-binding component M886_RS07105
citrulline arcC: carbamate kinase
citrulline rocA: 1-pyrroline-5-carboxylate dehydrogenase M886_RS14685 M886_RS01390
D-alanine cycA: D-alanine:H+ symporter CycA
D-alanine dadA: D-alanine dehydrogenase
D-lactate D-LDH: D-lactate dehydrogenase M886_RS03825 M886_RS06205
D-lactate lctP: D-lactate:H+ symporter LctP or LidP
D-serine cycA: D-serine:H+ symporter CycA
D-serine dsdA: D-serine ammonia-lyase
deoxyinosine adh: acetaldehyde dehydrogenase (not acylating) M886_RS14685 M886_RS09200
deoxyinosine deoB: phosphopentomutase M886_RS16195 M886_RS05030
deoxyinosine deoC: deoxyribose-5-phosphate aldolase
deoxyinosine nupC: deoxyinosine:H+ symporter NupC
deoxyribonate deoxyribonate-dehyd: 2-deoxy-D-ribonate 3-dehydrogenase
deoxyribonate deoxyribonate-transport: 2-deoxy-D-ribonate transporter
deoxyribonate garK: glycerate 2-kinase
deoxyribonate ketodeoxyribonate-cleavage: 2-deoxy-3-keto-D-ribonate cleavage enzyme
deoxyribose adh: acetaldehyde dehydrogenase (not acylating) M886_RS14685 M886_RS09200
deoxyribose deoC: deoxyribose-5-phosphate aldolase
deoxyribose deoK: deoxyribokinase
deoxyribose deoP: deoxyribose transporter
ethanol adh: acetaldehyde dehydrogenase (not acylating) M886_RS14685 M886_RS09200
ethanol etoh-dh-nad: ethanol dehydrogenase (NAD(P)) M886_RS11280 M886_RS00920
fructose 1pfk: 1-phosphofructokinase
fructose fruII-ABC: fructose-specific PTS system (fructose 1-phosphate forming), EII-ABC components
fucose aldA: lactaldehyde dehydrogenase M886_RS14685 M886_RS01390
fucose fucA: L-fuculose-phosphate aldolase FucA M886_RS05100
fucose fucI: L-fucose isomerase FucI
fucose fucK: L-fuculose kinase FucK
fucose fucP: L-fucose:H+ symporter FucP
fucose fucU: L-fucose mutarotase FucU
fumarate dctA: fumarate:H+ symporter DctA M886_RS14085
galactose galK: galactokinase (-1-phosphate forming) M886_RS06255
galactose galP: galactose:H+ symporter GalP
galactose galT: UDP-glucose:alpha-D-galactose-1-phosphate uridylyltransferase M886_RS00085
galacturonate exuT: D-galacturonate transporter ExuT
galacturonate gci: D-galactarolactone cycloisomerase
galacturonate gli: D-galactarolactone isomerase
galacturonate kdgD: 5-dehydro-4-deoxyglucarate dehydratase M886_RS03355
galacturonate udh: D-galacturonate dehydrogenase
gluconate gnd: 6-phosphogluconate dehydrogenase, decarboxylating
gluconate gntK: D-gluconate kinase
gluconate gntT: gluconate:H+ symporter GntT
glucosamine gamP: glucosamine PTS system, EII-CBA components (GamP/NagE)
glucosamine nagB: glucosamine 6-phosphate deaminase (isomerizing) M886_RS16190
glucose ptsG-crr: glucose PTS, enzyme II (CBA components, PtsG)
glucose-6-P uhpT: glucose-6-phosphate:phosphate antiporter
glucuronate exuT: D-glucuronate:H+ symporter ExuT
glucuronate gci: D-glucaro-1,4-lactone cycloisomerase
glucuronate kdgD: 5-dehydro-4-deoxyglucarate dehydratase M886_RS03355
glucuronate udh: D-glucuronate dehydrogenase
glutamate gdhA: glutamate dehydrogenase, NAD-dependent M886_RS16490
glycerol glpD: glycerol 3-phosphate dehydrogenase (monomeric)
glycerol glpF: glycerol facilitator glpF
glycerol glpK: glycerol kinase
histidine hutG: N-formiminoglutamate formiminohydrolase
histidine hutH: histidine ammonia-lyase
histidine hutI: imidazole-5-propionate hydrolase
histidine hutU: urocanase
histidine permease: L-histidine permease
isoleucine acdH: (2S)-2-methylbutanoyl-CoA dehydrogenase
isoleucine Bap2: L-isoleucine permease Bap2
isoleucine ech: 2-methyl-3-hydroxybutyryl-CoA hydro-lyase
isoleucine fadA: 2-methylacetoacetyl-CoA thiolase M886_RS01560
isoleucine ivdG: 3-hydroxy-2-methylbutyryl-CoA dehydrogenase
isoleucine prpB: 2-methylisocitrate lyase
isoleucine prpC: 2-methylcitrate synthase M886_RS05060
isoleucine prpD: 2-methylcitrate dehydratase
L-lactate L-LDH: L-lactate dehydrogenase M886_RS07235
L-lactate lctP: L-lactate:H+ symporter LctP or LidP
L-malate sdlC: L-malate:Na+ symporter SdlC
lactose galK: galactokinase (-1-phosphate forming) M886_RS06255
lactose galT: UDP-glucose:alpha-D-galactose-1-phosphate uridylyltransferase M886_RS00085
lactose glk: glucokinase
lactose lacP: lactose permease LacP
lactose lacZ: lactase (homomeric)
leucine leuT: L-leucine:Na+ symporter LeuT
leucine liuA: isovaleryl-CoA dehydrogenase
leucine liuB: 3-methylcrotonyl-CoA carboxylase, alpha (biotin-containing) subunit M886_RS06565
leucine liuC: 3-methylglutaconyl-CoA hydratase
leucine liuD: 3-methylcrotonyl-CoA carboxylase, beta subunit
leucine liuE: hydroxymethylglutaryl-CoA lyase
lysine amaB: L-2-aminoadipate semialdehyde dehydrogenase (AmaB/Pcd) M886_RS14685 M886_RS01390
lysine hglS: D-2-hydroxyglutarate synthase
lysine lysN: 2-aminoadipate transaminase M886_RS15865 M886_RS09335
lysine lysP: L-lysine:H+ symporter LysP
lysine ydiJ: (R)-2-hydroxyglutarate dehydrogenase M886_RS03825
maltose ptsG-crr: glucose PTS, enzyme II (CBA components, PtsG)
maltose susB: alpha-glucosidase (maltase)
mannitol mtlA: mannitol phosphotransferase system, EII-CBA components
mannitol mtlD: mannitol-1-phosphate 5-dehydrogenase
mannose manP: mannose PTS system, EII-CBA components
myoinositol iolB: 5-deoxy-D-glucuronate isomerase
myoinositol iolC: 5-dehydro-2-deoxy-D-gluconate kinase
myoinositol iolD: 3D-(3,5/4)-trihydroxycyclohexane-1,2-dione hydrolase M886_RS00510
myoinositol iolE: scyllo-inosose 2-dehydratase
myoinositol iolG: myo-inositol 2-dehydrogenase
myoinositol iolJ: 5-dehydro-2-deoxyphosphogluconate aldolase
myoinositol iolT: myo-inositol:H+ symporter
myoinositol mmsA: malonate-semialdehyde dehydrogenase M886_RS14685 M886_RS01390
NAG nagA: N-acetylglucosamine 6-phosphate deacetylase
NAG nagB: glucosamine 6-phosphate deaminase (isomerizing) M886_RS16190
NAG nagEcba: N-acetylglucosamine phosphotransferase system, EII-CBA components
phenylacetate paaA: phenylacetyl-CoA 1,2-epoxidase, subunit A
phenylacetate paaB: phenylacetyl-CoA 1,2-epoxidase, subunit B
phenylacetate paaC: phenylacetyl-CoA 1,2-epoxidase, subunit C
phenylacetate paaE: phenylacetyl-CoA 1,2-epoxidase, subunit E
phenylacetate paaF: 2,3-dehydroadipyl-CoA hydratase
phenylacetate paaG: 1,2-epoxyphenylacetyl-CoA isomerase / 2-(oxepinyl)acetyl-CoA isomerase / didehydroadipyl-CoA isomerase
phenylacetate paaH: 3-hydroxyadipyl-CoA dehydrogenase
phenylacetate paaJ1: 3-oxo-5,6-dehydrosuberyl-CoA thiolase
phenylacetate paaJ2: 3-oxoadipyl-CoA thiolase
phenylacetate paaT: phenylacetate transporter Paa
phenylacetate paaZ1: oxepin-CoA hydrolase
phenylacetate paaZ2: 3-oxo-5,6-didehydrosuberyl-CoA semialdehyde dehydrogenase
phenylalanine aroP: L-phenylalanine:H+ symporter AroP
phenylalanine fahA: fumarylacetoacetate hydrolase
phenylalanine hmgA: homogentisate dioxygenase
phenylalanine HPD: 4-hydroxyphenylpyruvate dioxygenase
phenylalanine maiA: maleylacetoacetate isomerase
phenylalanine PAH: phenylalanine 4-monooxygenase
phenylalanine PCBD: pterin-4-alpha-carbinoalamine dehydratase
phenylalanine QDPR: 6,7-dihydropteridine reductase
proline proY: proline:H+ symporter
proline put1: proline dehydrogenase
proline putA: L-glutamate 5-semialdeyde dehydrogenase M886_RS14685 M886_RS01390
propionate prpB: 2-methylisocitrate lyase
propionate prpC: 2-methylcitrate synthase M886_RS05060
propionate prpD: 2-methylcitrate dehydratase
propionate prpE: propionyl-CoA synthetase M886_RS11665 M886_RS09460
propionate putP: propionate transporter; proline:Na+ symporter
putrescine gabT: gamma-aminobutyrate transaminase M886_RS04440 M886_RS14680
putrescine patA: putrescine aminotransferase (PatA/SpuC) M886_RS04440 M886_RS14680
putrescine patD: gamma-aminobutyraldehyde dehydrogenase M886_RS14685 M886_RS01390
putrescine puuP: putrescine:H+ symporter PuuP/PlaP
pyruvate SLC5A8: sodium-coupled pyruvate transporter
rhamnose aldA: lactaldehyde dehydrogenase M886_RS14685 M886_RS01390
rhamnose rhaA: L-rhamnose isomerase
rhamnose rhaB: L-rhamnulokinase
rhamnose rhaD: rhamnulose 1-phosphate aldolase
rhamnose rhaM: L-rhamnose mutarotase
rhamnose rhaT: L-rhamnose:H+ symporter RhaT
ribose rbsK: ribokinase
ribose rbsU: probable D-ribose transporter RbsU
serine sdaB: L-serine ammonia-lyase
serine serP: L-serine permease SerP
sorbitol mtlA: PTS system for polyols, EII-CBA components
sorbitol srlD: sorbitol 6-phosphate 2-dehydrogenase M886_RS13680
sucrose scrK: fructokinase M886_RS11895
sucrose SUS: sucrose synthase
sucrose sut: sucrose:proton symporter SUT/SUC
threonine aldA: lactaldehyde dehydrogenase M886_RS14685 M886_RS01390
threonine L-LDH: L-lactate dehydrogenase M886_RS07235
threonine tdcC: L-threonine:H+ symporter TdcC
threonine tdh: L-threonine 3-dehydrogenase M886_RS11280 M886_RS00920
threonine tynA: aminoacetone oxidase
threonine yvgN: methylglyoxal reductase (NADPH-dependent)
thymidine adh: acetaldehyde dehydrogenase (not acylating) M886_RS14685 M886_RS09200
thymidine deoA: thymidine phosphorylase DeoA M886_RS14870
thymidine deoB: phosphopentomutase M886_RS16195 M886_RS05030
thymidine deoC: deoxyribose-5-phosphate aldolase
thymidine nupG: thymidine permease NupG/XapB
trehalose ptsG-crr: glucose PTS, enzyme II (CBA components, PtsG)
trehalose treF: trehalase
tryptophan aroP: tryptophan:H+ symporter AroP
tryptophan tnaA: tryptophanase
tyrosine aroP: L-tyrosine transporter (AroP/FywP)
tyrosine fahA: fumarylacetoacetate hydrolase
tyrosine hmgA: homogentisate dioxygenase
tyrosine HPD: 4-hydroxyphenylpyruvate dioxygenase
tyrosine maiA: maleylacetoacetate isomerase
valine acdH: isobutyryl-CoA dehydrogenase
valine Bap2: L-valine permease Bap2
valine bch: 3-hydroxyisobutyryl-CoA hydrolase
valine ech: (S)-3-hydroxybutanoyl-CoA hydro-lyase
valine mmsA: methylmalonate-semialdehyde dehydrogenase M886_RS14685 M886_RS01390
valine mmsB: 3-hydroxyisobutyrate dehydrogenase
valine prpB: 2-methylisocitrate lyase
valine prpC: 2-methylcitrate synthase M886_RS05060
valine prpD: 2-methylcitrate dehydratase
xylitol fruI: xylitol PTS, enzyme IIABC (FruI)
xylitol x5p-reductase: D-xylulose-5-phosphate 2-reductase
xylose xylA: xylose isomerase
xylose xylB: xylulokinase
xylose xylT: D-xylose transporter

Confidence: high confidence medium confidence low confidence

This GapMind analysis is from Sep 24 2021. The underlying query database was built on Sep 17 2021.

Links

Downloads

Related tools

About GapMind

Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.

A candidate for a step is "high confidence" if either:

where "other" refers to the best ublast hit to a sequence that is not annotated as performing this step (and is not "ignored").

Otherwise, a candidate is "medium confidence" if either:

Other blast hits with at least 50% coverage are "low confidence."

Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:

GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).

For more information, see:

If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know

by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory