GapMind for catabolism of small carbon sources

 

Alignments for a candidate for aapM in Maridesulfovibrio zosterae DSM 11974

Align AapM, component of General L-amino acid porter; transports basic and acidic amino acids preferentially, but also transports aliphatic amino acids (catalyzes both uptake and efflux) (characterized)
to candidate WP_027721172.1 H589_RS0105870 amino acid ABC transporter permease

Query= TCDB::Q52814
         (384 letters)



>NCBI__GCF_000425265.1:WP_027721172.1
          Length = 477

 Score =  263 bits (671), Expect = 1e-74
 Identities = 148/346 (42%), Positives = 213/346 (61%), Gaps = 10/346 (2%)

Query: 42  ILTILALALIAWAVPHLVNWLF-IQAVWSGPDRTFCATTLQGGIQPDGWSGACWAFISAK 100
           I+ ++A  L+   VP ++ +LF +  ++ G   T  +  L   I   G++G  +  ++  
Sbjct: 136 IMALVAALLLFGPVPMIIVFLFAVLPMFFGRLSTNGSVPL---ITLQGFAGVAFRLLAGI 192

Query: 101 YDQFIFGRYP--LGERWRPAIVGILF-ILLLVPMLIPSAPRKGLNAILLFAVLPVIAFWL 157
               I G     LG       +G++  +L+ V + + +        ++LF   P++AF L
Sbjct: 193 VIGCIAGSISKELGAAETATYIGLVSGVLIWVLLQVRNLGAGAWQWVMLFTCFPLLAFIL 252

Query: 158 LHGG-FGLEVVETPLWGGLMVTLVLSFVGIAVSLPVGILLALGRRSRMPVIRMLCVTFIE 216
           L G  FGL  VET  WGGL +TLV++  G+  +LP+GILLALGRRS +PVIR +C+ FIE
Sbjct: 253 LSGNAFGLAHVETHYWGGLFLTLVVAVTGMGTALPIGILLALGRRSTLPVIRTVCICFIE 312

Query: 217 VIRGVPLITVLFMASVMLPLFLPTGWNVDKLLRALIGVSIFTSAYMAEVIRGGLQAIPKG 276
             RG+PL++VLFM SVMLPL LP   N DKLLRALIG++ F +AYMAEV+RGGLQAIP  
Sbjct: 313 FARGIPLVSVLFMVSVMLPLMLPHDVNFDKLLRALIGIAFFYAAYMAEVVRGGLQAIPSQ 372

Query: 277 QFEGADSLGLGYWQKTRLIIMPQAIKLVIPSIVNTFIGTFKDTSLVTIIGMFDLLGIVKL 336
           Q+E A +LG  YW+    II+PQ ++L+IP + N F+   KDT+LV +IG+FDLLGI K 
Sbjct: 373 QYEAAQALGWTYWRMMFKIILPQTLRLIIPGLANNFLSLLKDTTLVAVIGLFDLLGISKA 432

Query: 337 NFSDANWASAVTPITGLIFAGFIFWLFCFGMSRYSGFMERHLDTGH 382
             +DA+W          +F+  +FW+ CF MSRY  ++ER   T +
Sbjct: 433 AMADADWLGFTK--ESYLFSAMVFWILCFCMSRYFLYLERKYHTSY 476



 Score = 89.0 bits (219), Expect = 3e-22
 Identities = 52/192 (27%), Positives = 94/192 (48%), Gaps = 22/192 (11%)

Query: 18  PPPPGERGAVAWIRRNLLATPKDVILTILALALIAWAVPHLVNWLFIQAVWSGPDRTFC- 76
           P P    G + W+R+NL ++P + ILT+ ++ L+   +  L+ W  + A W G  R  C 
Sbjct: 12  PSPIQSTGPIGWMRKNLFSSPANSILTLFSVYLLYILITPLIQWAVVDATWLGNSRACCD 71

Query: 77  -ATTLQGGIQPDGWSGACWAFISAKYDQFIFGRYPLGERWRPAIVGILFILLLVPMLIPS 135
            A  L       G +GACW F+  + + F++G YP  E+WR   + ++ +   +P+L+P 
Sbjct: 72  EAAAL-------GKNGACWTFVKVRLNMFLYGFYPKAEQWRINFIMLILLFHALPILLPD 124

Query: 136 APRKGLNAILLFAVLPVIAFWLLHGGFGLEVV----ETPLWGGLMVT-------LVLSFV 184
              +     + F ++ ++A  LL G   + +V      P++ G + T        +  F 
Sbjct: 125 IINR--KKRIGFTIMALVAALLLFGPVPMIIVFLFAVLPMFFGRLSTNGSVPLITLQGFA 182

Query: 185 GIAVSLPVGILL 196
           G+A  L  GI++
Sbjct: 183 GVAFRLLAGIVI 194


Lambda     K      H
   0.330    0.145    0.469 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 1
Number of Hits to DB: 679
Number of extensions: 43
Number of successful extensions: 5
Number of sequences better than 1.0e-02: 1
Number of HSP's gapped: 2
Number of HSP's successfully gapped: 2
Length of query: 384
Length of database: 477
Length adjustment: 32
Effective length of query: 352
Effective length of database: 445
Effective search space:   156640
Effective search space used:   156640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 51 (24.3 bits)

This GapMind analysis is from Apr 09 2024. The underlying query database was built on Sep 17 2021.

Links

Downloads

Related tools

About GapMind

Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.

A candidate for a step is "high confidence" if either:

where "other" refers to the best ublast hit to a sequence that is not annotated as performing this step (and is not "ignored").

Otherwise, a candidate is "medium confidence" if either:

Other blast hits with at least 50% coverage are "low confidence."

Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:

GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).

For more information, see:

If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know

by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory