GapMind for Amino acid biosynthesis

 

Definition of L-valine biosynthesis

As text, or see rules and steps

# Valine biosynthesis in GapMind is based on MetaCyc pathway L-valine biosynthesis
# (metacyc:VALSYN-PWY).

# ilvIH (or ilvGM) is a two-subunit enzyme that forms acetolactate or acetohydroxybutanoate.
# (S)-2-aceto-2-hydroxybutanoate synthesis is part of isoleucine biosynthesis.
# Acetolactate synthesis is part of valine or leucine biosynthesis.
# CH_124129 is probably correct but has limited data and vaguer annotations.
# IlvX from Mycobacterium tuberculosis (uniprot:O53554) is ignored because, although
# annotated with this activity by both SwissProt and BRENDA, we could not find
# experimental evidence, and it is reported to
# lack activity (PMID:20884690).
ilvI	acetolactate/acetohydroxybutanoate synthase catalytic subunit	hmm:TIGR00118	term:acetohydroxy-acid synthase%large	term:acetohydroxy acid synthase%large	term:acetohydroxybutanoate synthase, catalytic subunit	term:acetohydroxybutanoate synthase, catalytic subunit	term:acetohydroxyacid synthase subunit B	ignore_other:EC 2.2.1.6	ignore:CharProtDB::CH_124219	ignore:SwissProt::O53554

# The isolated catalytic subunit can have some activity on its own, so it's not clear if the regulatory
# subunit (ilvH) is always required, but ilvH does always seem to be present.
# uniprot:P0ADG1 is annotated with this EC number but not explicitly as the small regulatory subunit,
# so it was added manually.
# uniprot:Q93YZ7 is annotated as this but without the EC number, so is added manually.
# Most regulatory subunits have an N-terminal ACT domain and a C-terminal ACT-like domain,
# but E. coli IlvM, which is required for the activity of E. coli acetohydroxyacid synthase isoenzyme II,
# has the N-terminal ACT domain only.
# We identified several other short (one-domain) regulatory subunits.
# In Rhodanobacter and related genera, the putative regulatory subunit has
# just one ACT domain (i.e., LRK54_RS10305, which is nearly identical to A0A154R0Y7).
# Based on sequence analysis, short ilvH probably maintains the ability to bind
# valine and to bind the catalytic subunit, but not the ability to bind ATP or other regulatory subunits.
# Mutant fitness data confirms that LRK54_RS10305 is involved in amino acid biosynthesis.
# In Xanthomonas campestris, the one ACT-domain protein Xcc-8004.1058.1 (uniprot:A0A0H2X4P1), which
# is conserved next to ilvI, has a similar fitness pattern as ilvI (Alice Castaing, unpublished data).
# Furthermore, its AlphaFold structure is
# very similar to that of E. coli IlvM (TM-score 0.89, RMSD 1.49 A, foldseek).
# So Xcc-8004.1058.1 is another short regulatory subunit.
# Similarly, in Brevundimonas sp. GW460-12-10-14-LB2, the putative ilvH has the ACT domain only
# (Brev2_1981 = A0A161J739).
# Many Thermoproteota seem to have a diverged short regulatory subunit, such as the ACT domain protein
# KCR_RS03285 (uniprot:A0A7J3AYJ4), which is conserved next to ilvI. Foldseek shows that this protein
# is similar to the ACT domain of (p)ppGpp synthase but also to IlvH of Staphylococcus aureus,
# so we predict that it is the regulatory subunit.
ilvH	acetolactate/acetohydroxybutanoate synthase regulatory subunit 	hmm:TIGR00119	term:acetohydroxy-acid synthase%small	term:acetohydroxybutanoate synthase, regulatory subunit	term:small subunit of acetolactate synthase	curated:BRENDA::P0ADG1	ignore_other:EC 2.2.1.6	uniprot:A0A154R0Y7	curated:SwissProt::Q93YZ7	uniprot:A0A0H2X4P1	predicted:A0A7J3AYJ4	predicted:A0A161J739

# The three EC numbers correspond to different preferences for NAD(P)H as the cofactor;
# the transformations to the carbon skeleton are the same.
# CH_123630 is added because it is annotated as this but with no EC number.
ilvC	2-hydroxy-3-ketol-acid reductoisomerase	EC:1.1.1.86	EC:1.1.1.382	EC:1.1.1.383	curated:CharProtDB::CH_123630

# IlvD is involved in the biosynthesis of isoleucine, with
# (R)-2,3-dihydroxy-3-methylpentanoate as the substrate, and in the
# biosynthesis of valine and leucine, with 
# (R)-2,3-dihydroxy-3-methylbutanoate as the substrate.
# The ignored enzyme is involved in salinosporamide A biosynthesis but does a very similar reaction
# and is >50% identical to N515DRAFT_0569, which is confirmed by fitness data to be biosynthetic
ilvD	dihydroxy-acid dehydratase	EC:4.2.1.9	ignore:metacyc::MONOMER-15882

# Q8NS92 is ignored because it is primarily a transcriptional regulator.
# Similarity to aromatic amino acid transaminases or tyrosine transaminases is ignored as they
# are often non-specific.
ilvE	valine transaminase	EC:2.6.1.42	EC:2.6.1.66	ignore:SwissProt::Q8NS92	ignore_other:2.6.1.57	ignore_other:2.6.1.5

3-methyl-2-oxobutanoate: ilvH ilvI ilvC ilvD

all: 3-methyl-2-oxobutanoate ilvE

Links

Downloads

Related tools

About GapMind

Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.

A candidate for a step is "high confidence" if either:

where "other" refers to the best ublast hit to a sequence that is not annotated as performing this step (and is not "ignored").

Otherwise, a candidate is "medium confidence" if either:

Other blast hits with at least 50% coverage are "low confidence."

Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:

GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).

For more information, see:

If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know

by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory