GapMind for Amino acid biosynthesis

 

Alignments for a candidate for metH in Shewanella amazonensis SB2B

Align methionine synthase; EC 2.1.1.13 (characterized)
to candidate 6936570 Sama_0758 B12-dependent methionine synthase (RefSeq)

Query= CharProtDB::CH_090726
         (1227 letters)



>FitnessBrowser__SB2B:6936570
          Length = 1246

 Score = 1643 bits (4255), Expect = 0.0
 Identities = 820/1232 (66%), Positives = 973/1232 (78%), Gaps = 8/1232 (0%)

Query: 3    SKVEQLRAQLNERILVLDGGMGTMIQSYRLNEADFRGERFADWPCDLKGNNDLLVLSKPE 62
            S+ ++L   L+ RIL+LDG MGTMIQ ++L E  +RG RFADW CD+KGNNDLLVL++PE
Sbjct: 16   SRQQRLNEDLSTRILILDGAMGTMIQGHKLEEEHYRGSRFADWHCDVKGNNDLLVLTQPE 75

Query: 63   VIAAIHNAYFEAGADIIETNTFNSTTIAMADYQMESLSAEINFAAAKLARACADEWTART 122
            +I  IH  Y  AGADIIETNTFN+TT+AMADY M+SLSAEIN   A++AR  ADE  A+T
Sbjct: 76   IIKGIHREYLLAGADIIETNTFNATTVAMADYDMQSLSAEINLVGARIAREVADEVEAQT 135

Query: 123  PEKPRYVAGVLGPTNRTASISPDVNDPAFRNITFDGLVAAYRESTKALVEGGADLILIET 182
               PRYVAGVLGPTNRT SISPDVNDP +RNI FD LV AYREST AL+EGGAD+I++ET
Sbjct: 136  GI-PRYVAGVLGPTNRTCSISPDVNDPGYRNIHFDDLVTAYRESTAALIEGGADIIMVET 194

Query: 183  VFDTLNAKAAVFAVKTEFEALGVELPIMISGTITDASGRTLSGQTTEAFYNSLRHAEALT 242
            +FDTLNAKAA+FA+++ F+ +G+ LP+MISGTITDASGRTL+GQTTEAFYNSLRH + ++
Sbjct: 195  IFDTLNAKAALFAIESIFDEVGLRLPVMISGTITDASGRTLTGQTTEAFYNSLRHIKPIS 254

Query: 243  FGLNCALGPDELRQYVQELSRIAECYVTAHPNAGLPNAFGEYDLDADTMAKQIREWAQAG 302
             GLNCALGP ELR YV+ELSRI+ECYV+AHPNAGLPN FG YD     MA  I +WA  G
Sbjct: 255  MGLNCALGPKELRPYVEELSRISECYVSAHPNAGLPNEFGGYDETPKEMADIIVQWAIEG 314

Query: 303  FLNIVGGCCGTTPQHIAAMSRAVEGLAPRKLPEIPVACRLSGLEPLNIGEDSLFVNVGER 362
             LNIVGGCCGTTP HI  +  AVE  APRKLPE+PVACRL+GLEPL I  DSLFVNVGER
Sbjct: 315  MLNIVGGCCGTTPDHIRVIREAVEKHAPRKLPELPVACRLAGLEPLTISADSLFVNVGER 374

Query: 363  TNVTGSAKFKRLIKEEKYSEALDVARQQVENGAQIIDINMDEGMLDAEAAMVRFLNLIAG 422
            TNVTGSAKF +LIKE +Y  ALDVAR QVENGAQIIDINMDEGMLD E  M  FLNL+A 
Sbjct: 375  TNVTGSAKFLKLIKEGQYETALDVARDQVENGAQIIDINMDEGMLDGEEVMTTFLNLVAS 434

Query: 423  EPDIARVPIMIDSSKWDVIEKGLKCIQGKGIVNSISMKEGVDAFIHHAKLLRRYGAAVVV 482
            EP+I++VPIMIDSSKW+VIE GLKC+QGK IVNSIS+KEG   FI  A L++RYGAA ++
Sbjct: 435  EPEISKVPIMIDSSKWEVIEAGLKCVQGKCIVNSISLKEGEAKFIEQATLVKRYGAAAII 494

Query: 483  MAFDEQGQADTRARKIEICRRAYKILTEEVGFPPEDIIFDPNIFAVATGIEEHNNYAQDF 542
            MAFDE GQADTRARKIEIC RAY+IL ++VGFPPEDIIFDPNIFAVATGIEEH+NYA DF
Sbjct: 495  MAFDETGQADTRARKIEICTRAYRILVDKVGFPPEDIIFDPNIFAVATGIEEHDNYAVDF 554

Query: 543  IGACEDIKRELPHALISGGVSNVSFSFRGNDPVREAIHAVFLYYAIRNGMDMGIVNAGQL 602
            I A  DIK  LPHA+ISGGVSNVSFSFRGN+PVREAIHAVFLY+AI+ GMDMGIVNAGQL
Sbjct: 555  IEAVRDIKATLPHAMISGGVSNVSFSFRGNNPVREAIHAVFLYHAIKEGMDMGIVNAGQL 614

Query: 603  AIYDDLPAELRDAVEDVILN-----RRDDGTERLLELAEKYRGSKTDDTANAQQAEWRSW 657
            AIYDD+PAEL++ VE V+LN          TE+LLE+AEKYRG         +  +WRS 
Sbjct: 615  AIYDDIPAELKERVEAVVLNLPCPVEDSTNTEQLLEIAEKYRGGGGSGAGKKEDLQWRSL 674

Query: 658  EVNKRLEYSLVKGITEFIEQDTEEARQQATRPIEVIEGPLMDGMNVVGDLFGEGKMFLPQ 717
             VNKRLE++LVKGITEFI+ DTEEARQQATRP++VIEGPLMDGMNVVGDLFGEGKMFLPQ
Sbjct: 675  PVNKRLEHALVKGITEFIDADTEEARQQATRPLDVIEGPLMDGMNVVGDLFGEGKMFLPQ 734

Query: 718  VVKSARVMKQAVAYLEPFIEASKEQGKTNGKMVIATVKGDVHDIGKNIVGVVLQCNNYEI 777
            VVKSARVMK+AVAYL PFIEA K  G++NGK+++ TVKGDVHDIGKNIVGVVL CN YE+
Sbjct: 735  VVKSARVMKKAVAYLNPFIEAEKVAGQSNGKVLMVTVKGDVHDIGKNIVGVVLACNGYEV 794

Query: 778  VDLGVMVPAEKILRTAKEVNADLIGLSGLITPSLDEMVNVAKEMERQGFTIPLLIGGATT 837
            +DLGVMVP EKI+  AK+   D+IG+SGLITPSLDEMV+  K  ER+G T+P +IGGAT 
Sbjct: 795  IDLGVMVPVEKIVEVAKKEQVDIIGMSGLITPSLDEMVHNVKTFEREGLTLPAIIGGATC 854

Query: 838  SKAHTAVKIEQNYSGPTVYVQNASRTVGVVAALLSDTQRDDFVARTRKEYETVRIQHGRK 897
            SK HTAVKI  +Y    +Y+ +ASR V +V+ L+++  R   +  T  EY+ +R +   +
Sbjct: 855  SKIHTAVKIAPHYPHGAIYIPDASRAVPMVSKLINEETRAATIKATYDEYDVMREKRLSQ 914

Query: 898  KPRTPPVTLEAARDNDFAFDWQAYTPPVAHRLGVQEVE-ASIETLRNYIDWTPFFMTWSL 956
              R   +++EAAR+N    DW  Y P V ++LG+Q  E   ++ L + IDWTPFF  W L
Sbjct: 915  AKRKEIISIEAARENRCQLDWANYQPKVPNKLGIQVFEDYPLDDLVDRIDWTPFFRAWEL 974

Query: 957  AGKYPRILEDEVVGVEAQRLFKDANDMLDKLSAEKTLNPRGVVGLFPANRVG-DDIEIYR 1015
             G +PRILEDEVVG EA++LF DA  ML  +  EK L  +GV+GLFPAN V  DDIE+Y 
Sbjct: 975  HGHFPRILEDEVVGEEARKLFADAKAMLQTIIDEKWLTAKGVIGLFPANTVNHDDIELYT 1034

Query: 1016 DETRTHVINVSHHLRQQTEKTGFANYCLADFVAPKLSGKADYIGAFAVTGGLEEDALADA 1075
            DE+R+ V+  +HHLR Q E+ G  N+CL+DFVAPK SG  DY G FAV  G   D     
Sbjct: 1035 DESRSQVLMTTHHLRMQIERVGNDNFCLSDFVAPKDSGVVDYTGGFAVCAGHGIDEHLAR 1094

Query: 1076 FEAQHDDYNKIMVKALADRLAEAFAEYLHERVRKVYWGYAPNENLSNEELIRENYQGIRP 1135
            FEA HDDYN IM+K LADRLAEAFAE +HERVRK +WGYA +ENL NE LIRE Y+GIRP
Sbjct: 1095 FEANHDDYNAIMLKVLADRLAEAFAERMHERVRKEFWGYASDENLDNEALIREKYRGIRP 1154

Query: 1136 APGYPACPEHTEKATIWELLEVEKHTGMKLTESFAMWPGASVSGWYFSHPDSKYYAVAQI 1195
            APGYPACP+HTEK  +W+LL+  +   + +TESFAM+P A+VSGWYF+HP+++Y+ V  I
Sbjct: 1155 APGYPACPDHTEKGLLWDLLKPNECIDLNITESFAMYPTAAVSGWYFAHPEARYFGVTNI 1214

Query: 1196 QRDQVEDYARRKGMSVTEVERWLAPNLGYDAD 1227
             RDQVEDYARRKGM+V E E+WLAP L YD +
Sbjct: 1215 GRDQVEDYARRKGMTVAETEKWLAPILDYDPE 1246


Lambda     K      H
   0.318    0.134    0.391 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 1
Number of Hits to DB: 3781
Number of extensions: 145
Number of successful extensions: 5
Number of sequences better than 1.0e-02: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 1227
Length of database: 1246
Length adjustment: 48
Effective length of query: 1179
Effective length of database: 1198
Effective search space:  1412442
Effective search space used:  1412442
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)

Align candidate 6936570 Sama_0758 (B12-dependent methionine synthase (RefSeq))
to HMM TIGR02082 (metH: methionine synthase (EC 2.1.1.13))

# hmmsearch :: search profile(s) against a sequence database
# HMMER 3.3.1 (Jul 2020); http://hmmer.org/
# Copyright (C) 2020 Howard Hughes Medical Institute.
# Freely distributed under the BSD open source license.
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
# query HMM file:                  ../tmp/path.aa/TIGR02082.hmm
# target sequence database:        /tmp/gapView.30457.genome.faa
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Query:       TIGR02082  [M=1182]
Accession:   TIGR02082
Description: metH: methionine synthase
Scores for complete sequences (score includes all domains):
   --- full sequence ---   --- best 1 domain ---    -#dom-
    E-value  score  bias    E-value  score  bias    exp  N  Sequence                         Description
    ------- ------ -----    ------- ------ -----   ---- --  --------                         -----------
          0 1780.0   0.0          0 1779.8   0.0    1.0  1  lcl|FitnessBrowser__SB2B:6936570  Sama_0758 B12-dependent methioni


Domain annotation for each sequence (and alignments):
>> lcl|FitnessBrowser__SB2B:6936570  Sama_0758 B12-dependent methionine synthase (RefSeq)
   #    score  bias  c-Evalue  i-Evalue hmmfrom  hmm to    alifrom  ali to    envfrom  env to     acc
 ---   ------ ----- --------- --------- ------- -------    ------- -------    ------- -------    ----
   1 ! 1779.8   0.0         0         0       2    1182 .]      26    1211 ..      25    1211 .. 0.98

  Alignments for each domain:
  == domain 1  score: 1779.8 bits;  conditional E-value: 0
                         TIGR02082    2 nkrilvlDGamGtqlqsanLteadFrge.eadlarelkGnndlLnltkPeviaaihrayfeaGaDivetntFnst 75  
                                        + ril+lDGamGt++q ++L+e+++rg+ +ad+++++kGnndlL+lt+Pe+i+ ihr+y+ aGaDi+etntFn+t
  lcl|FitnessBrowser__SB2B:6936570   26 STRILILDGAMGTMIQGHKLEEEHYRGSrFADWHCDVKGNNDLLVLTQPEIIKGIHREYLLAGADIIETNTFNAT 100 
                                        78************************************************************************* PP

                         TIGR02082   76 eialadYdledkayelnkkaaklarevadeftltpekkRfvaGslGPtnklatlspdverpefrnvtydelvdaY 150 
                                        ++a+adYd++++++e+n  +a++arevade++ ++  +R+vaG+lGPtn++ ++spdv++p++rn+++d+lv aY
  lcl|FitnessBrowser__SB2B:6936570  101 TVAMADYDMQSLSAEINLVGARIAREVADEVEAQTGIPRYVAGVLGPTNRTCSISPDVNDPGYRNIHFDDLVTAY 175 
                                        *************************************************************************** PP

                         TIGR02082  151 keqvkglldGGvDllLietvfDtlnakaalfaveevfeekgrelPilisgvivdksGrtLsGqtleaflasleha 225 
                                        +e++ +l++GG+D++++et+fDtlnakaalfa+e++f+e g +lP++isg+i+d+sGrtL+Gqt+eaf++sl+h 
  lcl|FitnessBrowser__SB2B:6936570  176 RESTAALIEGGADIIMVETIFDTLNAKAALFAIESIFDEVGLRLPVMISGTITDASGRTLTGQTTEAFYNSLRHI 250 
                                        *************************************************************************** PP

                         TIGR02082  226 eililGLnCalGadelrefvkelsetaealvsviPnaGLPnalgeYdltpeelakalkefaeegllnivGGCCGt 300 
                                        + +++GLnCalG++elr++v+els+++e++vs++PnaGLPn++g Yd+tp+e+a ++ ++a eg+lnivGGCCGt
  lcl|FitnessBrowser__SB2B:6936570  251 KPISMGLNCALGPKELRPYVEELSRISECYVSAHPNAGLPNEFGGYDETPKEMADIIVQWAIEGMLNIVGGCCGT 325 
                                        *************************************************************************** PP

                         TIGR02082  301 tPehiraiaeavkdikprkrqeleeksvlsglealkiaqessfvniGeRtnvaGskkfrklikaedyeealkiak 375 
                                        tP+hir i eav++ +prk +el+ +++l+gle+l+i+ +s fvn+GeRtnv+Gs+kf klik+++ye al++a+
  lcl|FitnessBrowser__SB2B:6936570  326 TPDHIRVIREAVEKHAPRKLPELPVACRLAGLEPLTISADSLFVNVGERTNVTGSAKFLKLIKEGQYETALDVAR 400 
                                        *************************************************************************** PP

                         TIGR02082  376 qqveeGaqilDinvDevllDgeadmkkllsllasepdiakvPlmlDssefevleaGLkviqGkaivnsislkdGe 450 
                                        +qve+Gaqi+Din+De++lDge+ m+++l+l+asep+i+kvP+m+Dss++ev+eaGLk++qGk+ivnsislk+Ge
  lcl|FitnessBrowser__SB2B:6936570  401 DQVENGAQIIDINMDEGMLDGEEVMTTFLNLVASEPEISKVPIMIDSSKWEVIEAGLKCVQGKCIVNSISLKEGE 475 
                                        *************************************************************************** PP

                         TIGR02082  451 erFlekaklikeyGaavvvmafDeeGqartadkkieiakRayklltekvgfppediifDpniltiatGieehdry 525 
                                        ++F+e+a l+k+yGaa+++mafDe Gqa+t+++kiei++Ray++l++kvgfppediifDpni+++atGieehd+y
  lcl|FitnessBrowser__SB2B:6936570  476 AKFIEQATLVKRYGAAAIIMAFDETGQADTRARKIEICTRAYRILVDKVGFPPEDIIFDPNIFAVATGIEEHDNY 550 
                                        *************************************************************************** PP

                         TIGR02082  526 aidfieaireikeelPdakisgGvsnvsFslrgndavRealhsvFLyeaikaGlDmgivnagklavyddidkelr 600 
                                        a+dfiea+r+ik +lP+a isgGvsnvsFs+rgn++vRea+h+vFLy+aik G+Dmgivnag+la+yddi++el+
  lcl|FitnessBrowser__SB2B:6936570  551 AVDFIEAVRDIKATLPHAMISGGVSNVSFSFRGNNPVREAIHAVFLYHAIKEGMDMGIVNAGQLAIYDDIPAELK 625 
                                        *************************************************************************** PP

                         TIGR02082  601 evvedlildrr.....reatekLlelaelykgtkeksskeaqeaewrnlpveeRLeralvkGeregieedleear 670 
                                        e ve ++l+       + +te+Lle+ae+y+g   + + +++  +wr+lpv++RLe+alvkG++e+i++d+eear
  lcl|FitnessBrowser__SB2B:6936570  626 ERVEAVVLNLPcpvedSTNTEQLLEIAEKYRGGGGSGAGKKEDLQWRSLPVNKRLEHALVKGITEFIDADTEEAR 700 
                                        *********7666665999***************9999999999******************************* PP

                         TIGR02082  671 kklkapleiiegpLldGmkvvGdLFGsGkmfLPqvvksarvmkkavayLePylekekeedkskGkivlatvkGDv 745 
                                        +++++pl++iegpL+dGm+vvGdLFG+GkmfLPqvvksarvmkkavayL+P++e+ek + +s+Gk+++ tvkGDv
  lcl|FitnessBrowser__SB2B:6936570  701 QQATRPLDVIEGPLMDGMNVVGDLFGEGKMFLPQVVKSARVMKKAVAYLNPFIEAEKVAGQSNGKVLMVTVKGDV 775 
                                        *************************************************************************** PP

                         TIGR02082  746 hDiGknivdvvLscngyevvdlGvkvPvekileaakkkkaDviglsGLivksldemvevaeemerrgvkiPlllG 820 
                                        hDiGkniv+vvL+cngyev+dlGv+vPveki+e+akk++ D+ig+sGLi++sldemv++++ +er+g+++P ++G
  lcl|FitnessBrowser__SB2B:6936570  776 HDIGKNIVGVVLACNGYEVIDLGVMVPVEKIVEVAKKEQVDIIGMSGLITPSLDEMVHNVKTFEREGLTLPAIIG 850 
                                        *************************************************************************** PP

                         TIGR02082  821 GaalskahvavkiaekYkgevvyvkdaseavkvvdkllsekkkaeelekikeeyeeirekfgekkeklialseka 895 
                                        Ga+ sk h+avkia++Y    +y  das+av +v+kl++e+++a+ ++++ +ey+ +rek  ++ ++++ +s++a
  lcl|FitnessBrowser__SB2B:6936570  851 GATCSKIHTAVKIAPHYPHGAIYIPDASRAVPMVSKLINEETRAATIKATYDEYDVMREKRLSQAKRKEIISIEA 925 
                                        *************************************************************************** PP

                         TIGR02082  896 arkevfaldrsedlevpapkflGtkvleas.ieellkyiDwkalFvqWelrgkypkilkdeleglearklfkdak 969 
                                        ar+++ +ld+  ++++ +p++lG++v+e++ +++l++ iDw+++F +Wel+g++p+il+de++g+earklf+dak
  lcl|FitnessBrowser__SB2B:6936570  926 ARENRCQLDWA-NYQPKVPNKLGIQVFEDYpLDDLVDRIDWTPFFRAWELHGHFPRILEDEVVGEEARKLFADAK 999 
                                        ***********.9************************************************************** PP

                         TIGR02082  970 elldklsaekllrargvvGlfPaqsv.gddieiytdetvsqetkpiatvrekleqlrqqsdrylclaDfiaskes 1043
                                        ++l++++ ek l+a+gv+GlfPa++v +ddie+ytde++sq      t+++ + q+++  + + cl+Df+a+k+s
  lcl|FitnessBrowser__SB2B:6936570 1000 AMLQTIIDEKWLTAKGVIGLFPANTVnHDDIELYTDESRSQVL---MTTHHLRMQIERVGNDNFCLSDFVAPKDS 1071
                                        ************************762699********95544...44444455666666669************ PP

                         TIGR02082 1044 GikDylgallvtaglgaeelakkleakeddydsilvkaladrlaealaellhervRkelwgyaeeenldkedllk 1118
                                        G +Dy g ++v ag g++e   ++ea++ddy++i++k ladrlaea+ae +hervRke+wgya++enld+e l++
  lcl|FitnessBrowser__SB2B:6936570 1072 GVVDYTGGFAVCAGHGIDEHLARFEANHDDYNAIMLKVLADRLAEAFAERMHERVRKEFWGYASDENLDNEALIR 1146
                                        *************************************************************************** PP

                         TIGR02082 1119 erYrGirpafGYpacPdhtekatlleLleaer.iGlklteslalaPeasvsglyfahpeakYfav 1182
                                        e+YrGirpa+GYpacPdhtek  l++Ll++++ i l++tes+a++P+a+vsg+yfahpea+Yf v
  lcl|FitnessBrowser__SB2B:6936570 1147 EKYRGIRPAPGYPACPDHTEKGLLWDLLKPNEcIDLNITESFAMYPTAAVSGWYFAHPEARYFGV 1211
                                        ******************************999******************************86 PP



Internal pipeline statistics summary:
-------------------------------------
Query model(s):                            1  (1182 nodes)
Target sequences:                          1  (1246 residues searched)
Passed MSV filter:                         1  (1); expected 0.0 (0.02)
Passed bias filter:                        1  (1); expected 0.0 (0.02)
Passed Vit filter:                         1  (1); expected 0.0 (0.001)
Passed Fwd filter:                         1  (1); expected 0.0 (1e-05)
Initial search space (Z):                  1  [actual number of targets]
Domain search space  (domZ):               1  [number of targets reported over threshold]
# CPU time: 0.08u 0.03s 00:00:00.11 Elapsed: 00:00:00.10
# Mc/sec: 13.68
//
[ok]

This GapMind analysis is from Apr 09 2024. The underlying query database was built on Apr 09 2024.

Links

Downloads

Related tools

About GapMind

Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.

A candidate for a step is "high confidence" if either:

where "other" refers to the best ublast hit to a sequence that is not annotated as performing this step (and is not "ignored").

Otherwise, a candidate is "medium confidence" if either:

Other blast hits with at least 50% coverage are "low confidence."

Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:

GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).

For more information, see:

If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know

by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory