GapMind for catabolism of small carbon sources

 

Potential Gaps in catabolism of small carbon sources in Thiomicrospira cyclica ALM1

Found 198 low-confidence and 21 medium-confidence steps on the best paths for 62 pathways.

Pathway Step Best candidate 2nd candidate
2-oxoglutarate kgtP: 2-oxoglutarate:H+ symporter KgtP
4-hydroxybenzoate adh: acetaldehyde dehydrogenase (not acylating) THICY_RS01540
4-hydroxybenzoate mhpD: 2-hydroxypentadienoate hydratase
4-hydroxybenzoate mhpE: 4-hydroxy-2-oxovalerate aldolase THICY_RS04720 THICY_RS04725
4-hydroxybenzoate pcaK: 4-hydroxybenzoate transporter pcaK
4-hydroxybenzoate pobA: 4-hydroxybenzoate 3-monooxygenase
4-hydroxybenzoate praA: protocatechuate 2,3-dioxygenase
4-hydroxybenzoate xylF: 2-hydroxymuconate semialdehyde hydrolase
acetate actP: cation/acetate symporter ActP THICY_RS04480
alanine TRIC: TRIC-type L-alanine transporter THICY_RS05500
arabinose araA: L-arabinose isomerase
arabinose araB: ribulokinase
arabinose araD: L-ribulose-5-phosphate epimerase
arabinose araE: L-arabinose:H+ symporter
arginine rocA: 1-pyrroline-5-carboxylate dehydrogenase
arginine rocD: ornithine aminotransferase THICY_RS06630 THICY_RS03115
arginine rocE: L-arginine permease
arginine rocF: arginase
asparagine ans: asparaginase
cellobiose cbp: cellobiose phosphorylase
cellobiose cdt: cellobiose transporter cdt-1/cdt-2
cellobiose glk: glucokinase THICY_RS05350 THICY_RS06685
citrate SLC13A5: citrate:Na+ symporter
citrulline AO353_03040: ABC transporter for L-Citrulline, ATPase component THICY_RS04390 THICY_RS05440
citrulline AO353_03045: ABC transporter for L-Citrulline, permease component 2
citrulline AO353_03050: ABC transporter for L-Citrulline, permease component 1
citrulline AO353_03055: ABC transporter for L-Citrulline, periplasmic substrate-binding component
citrulline arcC: carbamate kinase
citrulline rocA: 1-pyrroline-5-carboxylate dehydrogenase
citrulline rocD: ornithine aminotransferase THICY_RS06630 THICY_RS03115
D-alanine cycA: D-alanine:H+ symporter CycA
D-alanine dadA: D-alanine dehydrogenase
D-serine cycA: D-serine:H+ symporter CycA
D-serine dsdA: D-serine ammonia-lyase THICY_RS05450
deoxyinosine adh: acetaldehyde dehydrogenase (not acylating) THICY_RS01540
deoxyinosine deoB: phosphopentomutase
deoxyinosine deoC: deoxyribose-5-phosphate aldolase
deoxyinosine deoD: deoxyinosine phosphorylase
deoxyinosine nupC: deoxyinosine:H+ symporter NupC
deoxyribonate aacS: acetoacetyl-CoA synthetase
deoxyribonate atoB: acetyl-CoA C-acetyltransferase
deoxyribonate deoxyribonate-dehyd: 2-deoxy-D-ribonate 3-dehydrogenase
deoxyribonate deoxyribonate-transport: 2-deoxy-D-ribonate transporter
deoxyribonate ketodeoxyribonate-cleavage: 2-deoxy-3-keto-D-ribonate cleavage enzyme
deoxyribose adh: acetaldehyde dehydrogenase (not acylating) THICY_RS01540
deoxyribose deoC: deoxyribose-5-phosphate aldolase
deoxyribose deoK: deoxyribokinase
deoxyribose deoP: deoxyribose transporter
ethanol adh: acetaldehyde dehydrogenase (not acylating) THICY_RS01540
ethanol etoh-dh-nad: ethanol dehydrogenase (NAD(P)) THICY_RS05820 THICY_RS00105
fructose 1pfk: 1-phosphofructokinase
fructose fruII-ABC: fructose-specific PTS system (fructose 1-phosphate forming), EII-ABC components
fucose aldA: lactaldehyde dehydrogenase THICY_RS01540
fucose fucA: L-fuculose-phosphate aldolase FucA
fucose fucI: L-fucose isomerase FucI
fucose fucK: L-fuculose kinase FucK
fucose fucP: L-fucose:H+ symporter FucP
fucose fucU: L-fucose mutarotase FucU
galactose galK: galactokinase (-1-phosphate forming)
galactose galP: galactose:H+ symporter GalP
galactose galT: UDP-glucose:alpha-D-galactose-1-phosphate uridylyltransferase
galacturonate eda: 2-keto-3-deoxygluconate 6-phosphate aldolase
galacturonate exuT: D-galacturonate transporter ExuT
galacturonate kdgK: 2-keto-3-deoxygluconate kinase
galacturonate uxaA: D-altronate dehydratase
galacturonate uxaB: tagaturonate reductase
galacturonate uxaC: D-galacturonate isomerase
gluconate gntK: D-gluconate kinase
gluconate gntT: gluconate:H+ symporter GntT
glucosamine gamP: glucosamine PTS system, EII-CBA components (GamP/NagE)
glucosamine nagB: glucosamine 6-phosphate deaminase (isomerizing) THICY_RS00025
glucose ptsG-crr: glucose PTS, enzyme II (CBA components, PtsG)
glucose-6-P uhpT: glucose-6-phosphate:phosphate antiporter
glucuronate exuT: D-glucuronate:H+ symporter ExuT
glucuronate garL: 5-dehydro-4-deoxy-D-glucarate aldolase THICY_RS04720
glucuronate garR: tartronate semialdehyde reductase
glucuronate gci: D-glucaro-1,4-lactone cycloisomerase
glucuronate udh: D-glucuronate dehydrogenase
glutamate gdhA: glutamate dehydrogenase, NAD-dependent THICY_RS01295
glycerol glpD: glycerol 3-phosphate dehydrogenase (monomeric)
glycerol glpF: glycerol facilitator glpF
glycerol glpK: glycerol kinase
histidine hutG: N-formiminoglutamate formiminohydrolase
histidine hutH: histidine ammonia-lyase
histidine hutI: imidazole-5-propionate hydrolase
histidine hutU: urocanase
histidine permease: L-histidine permease
isoleucine acdH: (2S)-2-methylbutanoyl-CoA dehydrogenase
isoleucine Bap2: L-isoleucine permease Bap2
isoleucine ech: 2-methyl-3-hydroxybutyryl-CoA hydro-lyase
isoleucine fadA: 2-methylacetoacetyl-CoA thiolase
isoleucine ivdG: 3-hydroxy-2-methylbutyryl-CoA dehydrogenase THICY_RS05820
isoleucine ofo: branched-chain alpha-ketoacid:ferredoxin oxidoreductase, fused
isoleucine prpB: 2-methylisocitrate lyase
isoleucine prpC: 2-methylcitrate synthase THICY_RS00915
isoleucine prpD: 2-methylcitrate dehydratase
L-lactate lctO: L-lactate oxidase or 2-monooxygenase
L-malate sdlC: L-malate:Na+ symporter SdlC
lactose galK: galactokinase (-1-phosphate forming)
lactose galT: UDP-glucose:alpha-D-galactose-1-phosphate uridylyltransferase
lactose glk: glucokinase THICY_RS05350 THICY_RS06685
lactose lacP: lactose permease LacP
lactose lacZ: lactase (homomeric)
leucine aacS: acetoacetyl-CoA synthetase
leucine atoB: acetyl-CoA C-acetyltransferase
leucine leuT: L-leucine:Na+ symporter LeuT
leucine liuA: isovaleryl-CoA dehydrogenase
leucine liuB: 3-methylcrotonyl-CoA carboxylase, alpha (biotin-containing) subunit THICY_RS02025
leucine liuC: 3-methylglutaconyl-CoA hydratase
leucine liuD: 3-methylcrotonyl-CoA carboxylase, beta subunit
leucine liuE: hydroxymethylglutaryl-CoA lyase THICY_RS04725
leucine ofo: branched-chain alpha-ketoacid:ferredoxin oxidoreductase, fused
lysine amaB: L-2-aminoadipate semialdehyde dehydrogenase (AmaB/Pcd)
lysine hglS: D-2-hydroxyglutarate synthase
lysine lat: L-lysine 6-aminotransferase THICY_RS06630
lysine lysN: 2-aminoadipate transaminase THICY_RS07300 THICY_RS06630
lysine lysP: L-lysine:H+ symporter LysP
lysine ydiJ: (R)-2-hydroxyglutarate dehydrogenase THICY_RS06590
maltose ptsG-crr: glucose PTS, enzyme II (CBA components, PtsG)
maltose susB: alpha-glucosidase (maltase)
mannitol mtlA: mannitol phosphotransferase system, EII-CBA components
mannitol mtlD: mannitol-1-phosphate 5-dehydrogenase
mannose manA: mannose-6-phosphate isomerase
mannose manP: mannose PTS system, EII-CBA components
myoinositol iolB: 5-deoxy-D-glucuronate isomerase
myoinositol iolC: 5-dehydro-2-deoxy-D-gluconate kinase
myoinositol iolD: 3D-(3,5/4)-trihydroxycyclohexane-1,2-dione hydrolase
myoinositol iolE: scyllo-inosose 2-dehydratase
myoinositol iolG: myo-inositol 2-dehydrogenase
myoinositol iolJ: 5-dehydro-2-deoxyphosphogluconate aldolase THICY_RS00470
myoinositol iolT: myo-inositol:H+ symporter
myoinositol mmsA: malonate-semialdehyde dehydrogenase
NAG nagA: N-acetylglucosamine 6-phosphate deacetylase
NAG nagB: glucosamine 6-phosphate deaminase (isomerizing) THICY_RS00025
NAG nagEcba: N-acetylglucosamine phosphotransferase system, EII-CBA components
phenylacetate paaA: phenylacetyl-CoA 1,2-epoxidase, subunit A
phenylacetate paaB: phenylacetyl-CoA 1,2-epoxidase, subunit B
phenylacetate paaC: phenylacetyl-CoA 1,2-epoxidase, subunit C
phenylacetate paaE: phenylacetyl-CoA 1,2-epoxidase, subunit E
phenylacetate paaF: 2,3-dehydroadipyl-CoA hydratase
phenylacetate paaG: 1,2-epoxyphenylacetyl-CoA isomerase / 2-(oxepinyl)acetyl-CoA isomerase / didehydroadipyl-CoA isomerase
phenylacetate paaH: 3-hydroxyadipyl-CoA dehydrogenase
phenylacetate paaJ1: 3-oxo-5,6-dehydrosuberyl-CoA thiolase
phenylacetate paaJ2: 3-oxoadipyl-CoA thiolase
phenylacetate paaK: phenylacetate-CoA ligase
phenylacetate paaT: phenylacetate transporter Paa
phenylacetate paaZ1: oxepin-CoA hydrolase
phenylacetate paaZ2: 3-oxo-5,6-didehydrosuberyl-CoA semialdehyde dehydrogenase
phenylalanine aacS: acetoacetyl-CoA synthetase
phenylalanine aroP: L-phenylalanine:H+ symporter AroP
phenylalanine atoB: acetyl-CoA C-acetyltransferase
phenylalanine fahA: fumarylacetoacetate hydrolase
phenylalanine hmgA: homogentisate dioxygenase
phenylalanine HPD: 4-hydroxyphenylpyruvate dioxygenase
phenylalanine maiA: maleylacetoacetate isomerase THICY_RS07195
phenylalanine PAH: phenylalanine 4-monooxygenase
phenylalanine PCBD: pterin-4-alpha-carbinoalamine dehydratase THICY_RS01255
proline ectP: proline transporter EctP THICY_RS00890 THICY_RS08430
proline put1: proline dehydrogenase
proline putA: L-glutamate 5-semialdeyde dehydrogenase
propionate prpB: 2-methylisocitrate lyase
propionate prpC: 2-methylcitrate synthase THICY_RS00915
propionate prpD: 2-methylcitrate dehydratase
putrescine gabT: gamma-aminobutyrate transaminase THICY_RS06630 THICY_RS03115
putrescine patA: putrescine aminotransferase (PatA/SpuC) THICY_RS06630
putrescine patD: gamma-aminobutyraldehyde dehydrogenase THICY_RS01540
putrescine puuP: putrescine:H+ symporter PuuP/PlaP
pyruvate dctM: pyruvate TRAP transporter, large permease component THICY_RS07360
pyruvate dctQ: pyruvate TRAP transporter, small permease component THICY_RS07355
rhamnose aldA: lactaldehyde dehydrogenase THICY_RS01540
rhamnose LRA1: L-rhamnofuranose dehydrogenase THICY_RS05820 THICY_RS00105
rhamnose LRA2: L-rhamnono-gamma-lactonase
rhamnose LRA3: L-rhamnonate dehydratase
rhamnose LRA4: 2-keto-3-deoxy-L-rhamnonate aldolase THICY_RS04720
rhamnose rhaT: L-rhamnose:H+ symporter RhaT
ribose rbsK: ribokinase
ribose rbsU: probable D-ribose transporter RbsU
serine sdaB: L-serine ammonia-lyase THICY_RS05450
serine serP: L-serine permease SerP
sorbitol mtlA: PTS system for polyols, EII-CBA components
sorbitol srlD: sorbitol 6-phosphate 2-dehydrogenase THICY_RS05820
sucrose scrK: fructokinase THICY_RS05350
sucrose SUS: sucrose synthase
sucrose sut: sucrose:proton symporter SUT/SUC
threonine tdcC: L-threonine:H+ symporter TdcC
threonine tdh: L-threonine 3-dehydrogenase
threonine tynA: aminoacetone oxidase
thymidine adh: acetaldehyde dehydrogenase (not acylating) THICY_RS01540
thymidine deoA: thymidine phosphorylase DeoA THICY_RS02255
thymidine deoB: phosphopentomutase
thymidine deoC: deoxyribose-5-phosphate aldolase
thymidine nupG: thymidine permease NupG/XapB
trehalose glk: glucokinase THICY_RS05350 THICY_RS06685
trehalose PsTP: trehalose phosphorylase
trehalose TRET1: facilitated trehalose transporter Tret1
tryptophan aroP: tryptophan:H+ symporter AroP
tryptophan tnaA: tryptophanase
tyrosine aacS: acetoacetyl-CoA synthetase
tyrosine aroP: L-tyrosine transporter (AroP/FywP)
tyrosine atoB: acetyl-CoA C-acetyltransferase
tyrosine fahA: fumarylacetoacetate hydrolase
tyrosine hmgA: homogentisate dioxygenase
tyrosine HPD: 4-hydroxyphenylpyruvate dioxygenase
tyrosine maiA: maleylacetoacetate isomerase THICY_RS07195
valine acdH: isobutyryl-CoA dehydrogenase
valine Bap2: L-valine permease Bap2
valine bch: 3-hydroxyisobutyryl-CoA hydrolase
valine ech: (S)-3-hydroxybutanoyl-CoA hydro-lyase
valine mmsA: methylmalonate-semialdehyde dehydrogenase
valine mmsB: 3-hydroxyisobutyrate dehydrogenase
valine ofo: branched-chain alpha-ketoacid:ferredoxin oxidoreductase, fused
valine prpB: 2-methylisocitrate lyase
valine prpC: 2-methylcitrate synthase THICY_RS00915
valine prpD: 2-methylcitrate dehydratase
xylitol fruI: xylitol PTS, enzyme IIABC (FruI)
xylitol x5p-reductase: D-xylulose-5-phosphate 2-reductase
xylose xylA: xylose isomerase
xylose xylB: xylulokinase
xylose xylT: D-xylose transporter

Confidence: high confidence medium confidence low confidence

This GapMind analysis is from Apr 09 2024. The underlying query database was built on Sep 17 2021.

Links

Downloads

Related tools

About GapMind

Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.

A candidate for a step is "high confidence" if either:

where "other" refers to the best ublast hit to a sequence that is not annotated as performing this step (and is not "ignored").

Otherwise, a candidate is "medium confidence" if either:

Other blast hits with at least 50% coverage are "low confidence."

Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:

GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).

For more information, see:

If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know

by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory