As text, or see rules and steps
# Phenylalanine biosynthesis in GapMind is based on MetaCyc pathways # L-phenylalanine biosynthesis I (metacyc:PHESYN), # II (metacyc:PWY-3462), and III (metacyc:PWY-7432). # Pathways I and III proceed via 3-phenyl-2-oxopropanoate, but with different # amino acids providing the amino group for the conversion to phenylalanine. # In pathway II, L-arogenate is the intermediate (the aminotransferase reaction occurs before the dehydratase reaction). # These enzymes are usually annotated as aromatic-amino-acid transaminase, but only # the activity on phenylpyruvate (to form phenylalanine) is relevant here. # Phenylalanine is also considered a substrate for aspartate aminotransferases, # uniprot:P39643 is annotated with this EC number but is involved in bacilysin biosynthesis and # probably wouldn't form phenylalanine. # uniprot:H7CE71 is misannotated as this in BRENDA. # metacyc:PHETRANTHAUERA-MONOMER has this activity but is annotated with a different EC number. # uniprot:A0A060PQX5 has this activity but is given a different EC number in BRENDA. # metacyc:MONOMER-15918 and uniprot:Q93QX0 probably do not have this activity. # Many branched-chain or tryptophan transaminases are also active on phenylalanine so any similarity to them is ignored. # Ignore the very short protein uniprot:P12343 PPYAT phenylpyruvate aminotransferase EC:2.6.1.57 EC:2.6.1.1 EC:2.6.1.5 ignore:SwissProt::P39643 ignore:BRENDA::H7CE71 curated:metacyc::PHETRANTHAUERA-MONOMER curated:BRENDA::A0A060PQX5 ignore_other:EC 2.6.1.27 ignore_other:2.6.1.42 ignore:SwissProt::P12343 ignore:metacyc::MONOMER-15918 ignore:SwissProt::Q93QX0 # Chorismate mutase is usually fused to prephenate dehydratase, which # makes it difficult to find this activity when it is fused to something else, so # we included the entire pfam in the step's definition. # The only other function for this family that we are aware of is # 4-amino-4-deoxychorismate mutase, which is involved in the biosynthesis of some antibiotics. # (Proteins that are similar to 4-amino-4-deoxychorismate mutases will be marked as medium-confidence # in the results.) # Several fusion proteins are missing this EC number in their annotations # (uniprot:P43902, uniprot:Q74NC4, uniprot:A8AAX2) cmutase chorismate mutase EC:5.4.99.5 hmm:PF01817 curated:BRENDA::P43902 curated:BRENDA::Q74NC4 curated:BRENDA::A8AAX2 # prephenate dehydratase and arogenate dehydratase can be difficult to distinguish. # BT3936 (uniprot:Q8A0T5_BACTN) is diverged but has auxotrophic phenotypes (as do homologs Echvi_0123, CA265_RS11630). # HVO_0449 (uniprot:D4GRZ0) is a phenylalanine auxotroph and is probably prephenate dehydratase (PMC4300041). # uniprot:A0A160HWY7 from Brevundimonas is a somewhat diverged prephenate dehydratase; it has auxotrophic phenotypes. # The putative prephenate dehydratase from Carboxydothermus pertinax (A0A1L8CUL6) is lacking # the regulatory (ACT) domain but has the active site residues and is often encoded near aromatic amino # acid synthesis genes (it might be an arogenate dehydratase instead). preph-dehydratase prephenate dehydratase EC:4.2.1.51 uniprot:Q8A0T5_BACTN ignore_other:EC 4.2.1.91 uniprot:D4GRZ0 uniprot:A0A160HWY7 predicted:A0A1L8CUL6 # This enzyme forms arogenate, also known as pretyrosine. # Similarity to uniprot:O50434 is ignored because it is not acutally characterized. ptransferase prephenate aminotransferase EC:2.6.1.79 EC:2.6.1.78 ignore_other:EC 2.6.1.5 ignore_other:EC 2.6.1.27 ignore_other:EC 2.6.1.57 ignore_other:EC 2.6.1.1 ignore:SwissProt::O50434 aro-dehydratase arogenate dehydratase EC:4.2.1.91 ignore_other:EC 4.2.1.51 all: cmutase ptransferase aro-dehydratase all: cmutase preph-dehydratase PPYAT
Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.
A candidate for a step is "high confidence" if either:
Otherwise, a candidate is "medium confidence" if either:
Other blast hits with at least 50% coverage are "low confidence."
Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:
GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).
For more information, see:
If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know
by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory