GapMind for catabolism of small carbon sources

 

Definition of L-tryptophan catabolism

As text, or see rules and steps

# Tryptophan degradation in GapMind is based on MetaCyc degradation pathways
# I via anthranilate (metacyc:TRPCAT-PWY),
# II via pyruvate (metacyc:TRYPDEG-PWY),
# or IX via 3-hydroxyanthranilate (metacyc:PWY-5655).
# Pathway XII (metacyc:PWY-6505) overlaps with pathway I and is also represented.
# The other MetaCyc pathways do not yield fixed carbon or are not reported in
# prokaryotes, and are not included.
# For example, pathway IV yields indole-3-lactate, which could potentially be oxidized to
# indole-3-acetate, which has a known catabolic pathway,
# but no prokaryotes are known to consume tryptophan
# this way.
# Pathway VIII yields tryptophol (also known as indole-3-ethanol),
# which could potentially be oxidized to indole-3-acetate and consumed.
# Pathways X and XIII yield indole-3-propionate, which may
# spontaneously oxidize to kynurate, but kynurate catabolism is not reported.

# Four-part transporter (bioMNY-type) with two ATPase (A1/A2) components, a transmembrane (T) component,
# and a substrate-specific (S) component.
# The A1A2T components are not specific to tryptophan.
trpP	energy-coupling factor transporter, tryptophan-specific (S) component TrpP	curated:SwissProt::O07515

# Orthologs may be annotated as ecfA1 or cbiO; these have the same function
# (but might not be part of a tryptophan transporting system).
ecfA1	energy-coupling factor transporter, ATPase 1 (A1) component	uniprot:P40735	ignore:TCDB::Q839D5	ignore:SwissProt::Q035B2	ignore:SwissProt::A2RI01	ignore:SwissProt::Q5M243	ignore:TCDB::Q03PY5	ignore:metacyc::G12WB-158-MONOMER	ignore:SwissProt::Q03ZL6	

# Orthologs may be annotated as ecfA2 or cbiO2; these have the same function
ecfA2	energy-coupling factor transporter, ATPase 2 (A2) component	uniprot:P70970	ignore:TCDB::Q839D4	ignore:SwissProt::Q5M244	ignore:SwissProt::A2RI02	ignore:SwissProt::Q035B3	ignore:TCDB::Q03PY6	ignore:metacyc::G12WB-159-MONOMER	ignore:SwissProt::Q03ZL5

# Ignore EcfT proteins from other bacteria
ecfT	energy-coupling factor transporter, transmembrane (T) component	curated:SwissProt::P70972	ignore:TCDB::I7BV49	ignore:SwissProt::Q5M245	ignore:SwissProt::A2RI03	ignore:SwissProt::Q035B4	ignore:metacyc::G12WB-160-MONOMER	ignore:TCDB::Q03PY7	ignore:SwissProt::Q03ZL4

# Transporters were identified using
# query: transporter:tryptophan:L-tryptophan:trp
tryptophan-transport: trpP ecfA1 ecfA2 ecfT

# AO353_05930 (A0A0N9WG97) from Pseudomonas fluorescens FW300-N2E3 is related to aroP and
#   is specifically improtant for tryptophan utilization (although more so if Trp is the nitrogen source).
# PfGW456L13_4291 (A0A293QSB2) from P. fluorescens GW4560-L13 is related to aroP and
#   is specifically important for tryptophan utilization.
#	Unfortunately A0A293QSB2 is no longer in UniProt; the closest remaining sequence is
#	Q4KIP0 (only 87% identical).
# FywP (A2RMP5) may well be a tryptophan transporter as well, so ignore.
aroP	tryptophan:H+ symporter AroP	curated:TCDB::P15993	curated:TCDB::F2HN33	curated:TCDB::Q2VQZ4	curated:TCDB::Q46065	uniprot:A0A0N9WG97	uniprot:Q4KIP0	ignore:SwissProt::A2RMP5	ignore:reanno::pseudo5_N2C3_1:AO356_18530
tryptophan-transport: aroP

tnaB	tryptophan:H+ symporter TnaB	curated:CharProtDB::CH_000783	curated:SwissProt::Q02DS7	curated:TCDB::P0AAD2
tryptophan-transport: tnaB

TAT	tryptophan permease	curated:CharProtDB::CH_091156	curated:CharProtDB::CH_091631
tryptophan-transport: TAT

tnaT	tryptophan:Na+ symporter TnaT	curated:TCDB::O50649
tryptophan-transport: tnaT

# Serine/threonine exchangers and non-specific eukaryotic amino acid transporters were ignored

# Tryptophan catabolism often involves anthranilate (2-aminobenzoate),
# 3-hydroxyanthranilate, or catechol (1,2-dihydroxybenzene) as
# intermediates

# acetaldehyde is also an intermediate
import ethanol.steps:acetaldehyde-degradation

# 2-hydroxypenta-2,4-dienoate-degradation (HPD), 2-hydroxymuconate 6-semialdehyde,
# and 3-oxoadipate are intermediates in catechol degradation.
# pcaD is 3-oxoadipate enol-lactone hydrolase
import 4-hydroxybenzoate.steps:2-hydroxypenta-2,4-dienoate-degradation 2-hydroxymuconate-6-semialdehyde-degradation 3-oxoadipate-degradation pcaD

# Catechol degradation

# BRENDA misannotates P23103 as the P. putida xylE (that should be, P06622)
xylE	catechol 2,3-dioxygenase	EC:1.13.11.2	ignore:BRENDA::P23103

# In MetaCyc pathway catechol degradation to HPD I (meta-cleavage, metacyc:P183-PWY),
# dioxygenase xylE converts catechol to
# (2Z,4E)-2-hydroxy-6-oxohexa-2,4-dienoate (also known as
# 2-hydroxymuconate 6-semialdehyde).
# (Catechol degradation to HPD II also involves xylE and HPD, metacyc:PWY-5419.) 
catechol-degradation: xylE 2-hydroxymuconate-6-semialdehyde-degradation

# Ignore two protein fragments for catA
catA	catechol 1,2-dioxygenase	EC:1.13.11.1	ignore:CharProtDB::CH_012040	ignore:SwissProt::P83715
catB	muconate cycloisomerase	EC:5.5.1.1
catC	muconolactone isomerase	EC:5.3.3.4

# In MetaCyc pathway catechol degradation III (ortho-cleavage, metacyc:PWY-5417),
# the 1,2-dioxygenase catA forms cis,cis-muconate, a cycloisomerase forms
# (+)-muconolactone, an isomerase converts this to
# (4,5-dihydro-5-oxofuran-2-yl)-acetate (also known as 3-oxoadipate
# enol lactone), and a hydrolase cleaves this to 3-oxoadipate.
catechol-degradation: catA catB catC pcaD 3-oxoadipate-degradation

# 3-hydroxyanthranilate (2-amino-3-hydroxybenzoate) is an intermediate
# in anthranilate or tryptophan degradation

nbaC	3-hydroxyanthranilate 3,4-dioxygenase	EC:1.13.11.6
nbaD	2-amino-3-carboxymuconate-6-semialdehyde decarboxylase	EC:4.1.1.45
nbaE	2-aminomuconate 6-semialdehyde dehydrogenase	EC:1.2.1.32
nbaF	2-aminomuconate deaminase	EC:3.5.99.5
nbaG	2-oxo-3-hexenedioate decarboxylase	EC:4.1.1.77

# 3-hydroxyanthranilate degradation is part of
# L-tryptophan degradation pathway XII (metacyc:PWY-6505).
# Dioxygenase NbaC cleaves the aromatic ring,  yielding 2-amino-3-carboxymuconate
# 6-semialdehyde, a decarboxylase forms (2Z,4E)-2-aminomuconate
# semialdehyde, a dehydrogenase forms (2Z,4E)-2-aminomuconate, a
# deaminase forms (3E)-2-oxo-3-hexenedioate (also known as
# 2-oxalocrotonate), and a decarboxylase forms
# (2Z)-2-hydroxypenta-2,4-dienoate (HPD).
3-hydroxyanthranilate-degradation: nbaC nbaD nbaE nbaF nbaG 2-hydroxypenta-2,4-dienoate-degradation

antA	anthranilate 1,2-dioxygenase (deaminating, decarboxylating), large subunit AntA	curated:SwissProt::O85673	curated:reanno::WCS417:GFF4629	curated:reanno::pseudo13_GW456_L13:PfGW456L13_2740	curated:reanno::pseudo3_N2E3:AO353_05950	ignore_other:1.14.12.1

antB	anthranilate 1,2-dioxygenase (deaminating, decarboxylating), small subunit AntB	curated:SwissProt::O85674	curated:reanno::WCS417:GFF4630	curated:reanno::pseudo13_GW456_L13:PfGW456L13_2739	curated:reanno::pseudo3_N2E3:AO353_05955	ignore_other:1.14.12.1

antC	anthranilate 1,2-dioxygenase (deaminating, decarboxylating), electron transfer component AntC	curated:SwissProt::O85675	curated:reanno::WCS417:GFF4631	curated:reanno::pseudo13_GW456_L13:PfGW456L13_2738	ignore_other:1.14.12.1

# There are two forms of anthranilate dioxygenase, 3-subunit antABC or 4-subunit andAabcd.
anthranilate-dioxygenase: antA antB antC

andAa	anthranilate 1,2-dioxygenase (deaminating, decarboxylating), ferredoxin--NAD(+) reductase component AndAa	curated:SwissProt::Q84BZ0	ignore_other:1.14.12.1

andAb	anthranilate 1,2-dioxygenase (deaminating, decarboxylating), ferredoxin subunit AndAb	curated:SwissProt::Q84BZ1	ignore_other:1.14.12.1

andAc	anthranilate 1,2-dioxygenase (deaminating, decarboxylating), large subunit AndAc	curated:SwissProt::Q84BZ3	ignore_other:1.14.12.1

andAd	athranilate 1,2-dioxygenase (deaminating, decarboxylating), small subunit AndAd	curated:SwissProt::Q84BZ2	ignore_other:1.14.12.1

anthranilate-dioxygenase: andAa andAb andAc andAd

# In MetaCyc pathway anthranilate degradation I (metacyc:PWY-6079),
# a dioxygenase cleaves off carbon dioxide and ammonia, leaving catechol.
anthranilate-degradation: anthranilate-dioxygenase catechol-degradation

hpaH	anthranilate 3-monooxygenase (hydroxylase), FADH2-dependent	EC:1.14.14.8

# In MetaCyc pathway anthranilate degradation IV (metacyc:PWY-6504),
# anthranilate hydroxylase/monooxygenase (hpaH) yields
# 3-hydroxyanthranilate.
#
# Additional pathways are not included: the fate of
# 2-amino-5-oxocyclohex-1-enecarboxyl-CoA is not known
# (metacyc:PWY-6077), and anthraniloyl-CoA reductase
# (the only anaerobic route known, metacyc:2AMINOBENZDEG-PWY) has not
# been linked to sequence.
anthranilate-degradation: hpaH 3-hydroxyanthranilate-degradation

kynA	tryptophan 2,3-dioxygenase	EC:1.13.11.11	EC:1.13.11.52
# MONOMER-19595 is involved in echinomycin biosynthesis and its molecular function
# is controversial, see the MetaCyc entry.
kynB	kynurenine formamidase	EC:3.5.1.9	ignore:metacyc::MONOMER-19595

# 3-hydroxykynurenine hydrolase and kynurenine hydrolase
# are described by the same EC number.
kyn	kynureninase	EC:3.7.1.3

# In pathway I, dioxygenase kynA opens the non-aromatic ring, to N-formyl-L-kynureine,
# a hydrolase yields L-kynurenine (and formate),
# and a hydrolase yields anthranilate and L-alanine.
all: tryptophan-transport kynA kynB kyn anthranilate-degradation

# MONOMER-7584 and -7541 carry out the same reaction but were not linked to the EC number
tnaA	tryptophanase	EC:4.1.99.1	curated:metacyc::MONOMER-7584	curated:metacyc::MONOMER-7541

# In pathway II, the tryptophan is hydrolyzed to indole and pyruvate,
# and the indole may be secreted (as in E. coli).
all: tryptophan-transport tnaA

sibC	L-kynurenine 3-monooxygenase	EC:1.14.13.9

# In pathway IX, dioxygenase kynA forms N-formyl-L-kynurenine and a hydrolase
# forms L-kynurenine, as in pathway I; then, oxygenase sibC forms
# 3-hydroxy-L-kynurenine, which is hydrolyzed to L-alanine and
# 3-hydroxyanthranilate.
all: tryptophan-transport kynA kynB sibC kyn 3-hydroxyanthranilate-degradation

Links

Downloads

Related tools

About GapMind

Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.

A candidate for a step is "high confidence" if either:

where "other" refers to the best ublast hit to a sequence that is not annotated as performing this step (and is not "ignored").

Otherwise, a candidate is "medium confidence" if either:

Other blast hits with at least 50% coverage are "low confidence."

Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:

GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).

For more information, see the paper from 2019 on GapMind for amino acid biosynthesis, the paper from 2022 on GapMind for carbon sources, or view the source code.

If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know

by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory