GapMind for Amino acid biosynthesis

 

Alignments for a candidate for metH in Thiomicrospira pelophila DSM 1534

Align methionine synthase; EC 2.1.1.13 (characterized)
to candidate WP_029933551.1 N746_RS0101265 methionine synthase

Query= CharProtDB::CH_090726
         (1227 letters)



>NCBI__GCF_000711195.1:WP_029933551.1
          Length = 1239

 Score = 1575 bits (4077), Expect = 0.0
 Identities = 781/1234 (63%), Positives = 970/1234 (78%), Gaps = 15/1234 (1%)

Query: 4    KVEQLRAQLNERILVLDGGMGTMIQSYRLNEADFRGERFADWPCDLKGNNDLLVLSKPEV 63
            ++ QL++ L ER++VLDG MGTMIQ+ +L+E DFRG RFAD+  D+KGNND+LVL++P++
Sbjct: 7    RIAQLKSLLTERVVVLDGAMGTMIQNLQLSEDDFRGARFADYHMDIKGNNDILVLTQPQL 66

Query: 64   IAAIHNAYFEAGADIIETNTFNSTTIAMADYQMESLSAEINFAAAKLARACADEWTARTP 123
            I  IH ++   G D++ETN+FN+TTIA ADY MES   +IN  AA++AR  A E      
Sbjct: 67   IRDIHLSFLRQGVDLLETNSFNATTIAQADYDMESQVRDINLHAAQVARE-ACEIAQNED 125

Query: 124  EKPRYVAGVLGPTNRTASISPDVNDPAFRNITFDGLVAAYRESTKALVEGGADLILIETV 183
             KPR+VAGVLGPTNRTASISPDVNDP FRN  FD LV AY ++T+AL++GGAD+ILIET+
Sbjct: 126  GKPRFVAGVLGPTNRTASISPDVNDPGFRNTHFDELVEAYVQATQALLDGGADVILIETI 185

Query: 184  FDTLNAKAAVFAVKTEFEALGVELPIMISGTITDASGRTLSGQTTEAFYNSLRHAEALTF 243
            FDTLNAKAA+FAVK   + LG E+PIMISGTITDASGRTLSGQTTEAFYN++RHA+ L+ 
Sbjct: 186  FDTLNAKAAIFAVKQVEDTLGEEVPIMISGTITDASGRTLSGQTTEAFYNAIRHAKPLSI 245

Query: 244  GLNCALGPDELRQYVQELSRIAECYVTAHPNAGLPNAFGEYDLDADTMAKQIREWAQAGF 303
            GLNCALGP+ELR YV+ELSRI E +V+ HPNAGLPN FGEYD   + MA +++ WA+ G+
Sbjct: 246  GLNCALGPEELRPYVEELSRICETHVSVHPNAGLPNEFGEYDETPEQMAAEVKTWAERGW 305

Query: 304  LNIVGGCCGTTPQHIAAMSRAVEGLAPRKLPEIPVACRLSGLEPLNIGEDSLFVNVGERT 363
            LNI+GGCCGTTP H+AAM++A    APR +P   + CRLSGLEP+NIGE SLFVNVGER 
Sbjct: 306  LNIIGGCCGTTPDHVAAMAKAALAFAPRNIPTPKIECRLSGLEPMNIGETSLFVNVGERN 365

Query: 364  NVTGSAKFKRLIKEEKYSEALDVARQQVENGAQIIDINMDEGMLDAEAAMVRFLNLIAGE 423
            NVTGSA FKRLI E+ Y +A+++A +QV +GAQIID+NMDEGML+A+A M RFLN++AGE
Sbjct: 366  NVTGSALFKRLIIEDNYEQAVEIAVKQVNDGAQIIDVNMDEGMLNAKACMTRFLNMMAGE 425

Query: 424  PDIARVPIMIDSSKWDVIEKGLKCIQGKGIVNSISMKEGVDAFIHHAKLLRRYGAAVVVM 483
            PD ARVPIMIDSSKW+ IE GLKCIQGKGIVNSIS+KEG + F+  AK+++RYGAA +VM
Sbjct: 426  PDAARVPIMIDSSKWEAIEAGLKCIQGKGIVNSISLKEGEENFLAQAKMIQRYGAATIVM 485

Query: 484  AFDEQGQADTRARKIEICRRAYKILTEEVGFPPEDIIFDPNIFAVATGIEEHNNYAQDFI 543
            AFDE GQADT  RK EIC+R+Y +L    GFPPEDIIFDPNIFA+ATGIEEHNNYA DFI
Sbjct: 486  AFDEDGQADTFERKKEICKRSYDLLVAN-GFPPEDIIFDPNIFAIATGIEEHNNYAMDFI 544

Query: 544  GACEDIKRELPHALISGGVSNVSFSFRGNDPVREAIHAVFLYYAIRNGMDMGIVNAGQLA 603
             A + IK+ LPHALISGGVSNVSFSFRGN+PVREAIH+VFLYYAI+ GMDMGIVNA Q+A
Sbjct: 545  NAVDWIKKNLPHALISGGVSNVSFSFRGNNPVREAIHSVFLYYAIQKGMDMGIVNAAQMA 604

Query: 604  IYDDLPAELRDAVEDVILNRRDDGTERLLELAEKYRGSKTDDTANAQQAE--WRSWEVNK 661
            +YDDL  EL+ AVEDV+LN+     +RLLE+AEK+RG   D +  +++++  WR   V K
Sbjct: 605  VYDDLDPELKQAVEDVVLNKDSGAADRLLEVAEKFRG---DGSVQSKESDIRWREASVEK 661

Query: 662  RLEYSLVKGITEFIEQDTEEARQQATRPIEVIEGPLMDGMNVVGDLFGEGKMFLPQVVKS 721
            RLE+SLVKGIT++IE DTEEAR     P++VIEGPLMDGMNVVGDLFG GKMFLPQVVKS
Sbjct: 662  RLEHSLVKGITDYIEDDTEEARTTLGSPLQVIEGPLMDGMNVVGDLFGAGKMFLPQVVKS 721

Query: 722  ARVMKQAVAYLEPFIEASKEQGKTNGKMVIATVKGDVHDIGKNIVGVVLQCNNYEIVDLG 781
            ARVMK+AVAYL+P++EA K  G+  GK+V+ATVKGDVHDIGKNIVGVVLQCNN++++DLG
Sbjct: 722  ARVMKRAVAYLQPYLEAEKASGQAKGKIVMATVKGDVHDIGKNIVGVVLQCNNFDVIDLG 781

Query: 782  VMVPAEKILRTAKEVNADLIGLSGLITPSLDEMVNVAKEMERQGFTIPLLIGGATTSKAH 841
            VMVP EKIL TA + NA++IGLSGLITPSL+EMVNVAKEM+R+G  +PLLIGGATTSKAH
Sbjct: 782  VMVPTEKILDTAIKENANVIGLSGLITPSLEEMVNVAKEMQRRGMNLPLLIGGATTSKAH 841

Query: 842  TAVKIEQNYSGPTVYVQNASRTVGVVAALLSDTQRDDFVARTRKEYETVRIQHGRKKPRT 901
            TAVKIE  Y  P VYV++ASR VGV  +L+S+  + D+ A+ R EYE +R +   +  + 
Sbjct: 842  TAVKIEPQYDHPVVYVKDASRAVGVAQSLISNDLKVDYAAKIRAEYEQLREERKARSKQV 901

Query: 902  PPVTLEAARDNDFAFDWQAYTPPVAHRLGVQEV-EASIETLRNYIDWTPFFMTWSLAGKY 960
                L  AR+N    DW  Y PP    LG +   +  +  L + +DW+PFF +W L G Y
Sbjct: 902  KRTPLNKARENPIPVDWTNYVPPKPSFLGTKVFDDIPLTELLDRLDWSPFFQSWDLHGLY 961

Query: 961  PRILEDEVVGVEAQRLFKDANDMLDKLSAEKTLNPRGVVGLFPANRVGDDIEIYRDETRT 1020
            PRIL+D++VG EA+++F DA  ML  +  EK L  R V+G +PAN V DDIE+Y+D++R 
Sbjct: 962  PRILDDKLVGEEARKVFADAKTMLQTIIKEKWLTARAVIGFYPANSVVDDIELYKDDSRE 1021

Query: 1021 HVINVSHHLRQQTE-KTGFANYCLADFVAPKLSGK------ADYIGAFAVTGGLEEDALA 1073
             V+   H+LRQQ + K G  N CL+D++APK + +       DYIGAFAVT G+  D   
Sbjct: 1022 EVLTRLHNLRQQADKKAGQYNQCLSDYIAPKANKQQPGLVLEDYIGAFAVTAGIGIDEHI 1081

Query: 1074 DAFEAQHDDYNKIMVKALADRLAEAFAEYLHERVRKVYWGYAPNENLSNEELIRENYQGI 1133
              FEA HDDY  IM+KALADRLAEA AEY+HE+VRK +WGYA +E L+NEE+I+E Y+GI
Sbjct: 1082 ARFEAAHDDYQAIMLKALADRLAEALAEYMHEKVRKEFWGYAADEQLTNEEMIKEKYRGI 1141

Query: 1134 RPAPGYPACPEHTEKATIWELLEVEKHTGMKLTESFAMWPGASVSGWYFSHPDSKYYAVA 1193
            RPAPGYPA PEHTEK T+W LL+ +   G++LT S+AM P A+VSGWYF+HP+SKY+ V 
Sbjct: 1142 RPAPGYPANPEHTEKGTLWSLLKPDSEIGLELTSSYAMTPTAAVSGWYFAHPESKYFGVG 1201

Query: 1194 QIQRDQVEDYARRKGMSVTEVERWLAPNLGYDAD 1227
             I RDQ EDYA+RK  ++ E E+WLAPNLGYD +
Sbjct: 1202 SIGRDQAEDYAKRKNWTIQEAEKWLAPNLGYDPE 1235


Lambda     K      H
   0.318    0.134    0.391 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 1
Number of Hits to DB: 3748
Number of extensions: 151
Number of successful extensions: 7
Number of sequences better than 1.0e-02: 1
Number of HSP's gapped: 1
Number of HSP's successfully gapped: 1
Length of query: 1227
Length of database: 1239
Length adjustment: 47
Effective length of query: 1180
Effective length of database: 1192
Effective search space:  1406560
Effective search space used:  1406560
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)

Align candidate WP_029933551.1 N746_RS0101265 (methionine synthase)
to HMM TIGR02082 (metH: methionine synthase (EC 2.1.1.13))

# hmmsearch :: search profile(s) against a sequence database
# HMMER 3.3.1 (Jul 2020); http://hmmer.org/
# Copyright (C) 2020 Howard Hughes Medical Institute.
# Freely distributed under the BSD open source license.
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
# query HMM file:                  ../tmp/path.aa/TIGR02082.hmm
# target sequence database:        /tmp/gapView.12619.genome.faa
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Query:       TIGR02082  [M=1182]
Accession:   TIGR02082
Description: metH: methionine synthase
Scores for complete sequences (score includes all domains):
   --- full sequence ---   --- best 1 domain ---    -#dom-
    E-value  score  bias    E-value  score  bias    exp  N  Sequence                                 Description
    ------- ------ -----    ------- ------ -----   ---- --  --------                                 -----------
          0 1747.4   0.7          0 1747.2   0.7    1.0  1  lcl|NCBI__GCF_000711195.1:WP_029933551.1  N746_RS0101265 methionine syntha


Domain annotation for each sequence (and alignments):
>> lcl|NCBI__GCF_000711195.1:WP_029933551.1  N746_RS0101265 methionine synthase
   #    score  bias  c-Evalue  i-Evalue hmmfrom  hmm to    alifrom  ali to    envfrom  env to     acc
 ---   ------ ----- --------- --------- ------- -------    ------- -------    ------- -------    ----
   1 ! 1747.2   0.7         0         0       2    1182 .]      16    1200 ..      15    1200 .. 0.98

  Alignments for each domain:
  == domain 1  score: 1747.2 bits;  conditional E-value: 0
                                 TIGR02082    2 nkrilvlDGamGtqlqsanLteadFrge.eadlarelkGnndlLnltkPeviaaihrayfeaGaDiv 67  
                                                 +r++vlDGamGt++q+ +L e+dFrg+ +ad++++ kGnnd+L+lt+P++i++ih +++  G D++
  lcl|NCBI__GCF_000711195.1:WP_029933551.1   16 TERVVVLDGAMGTMIQNLQLSEDDFRGArFADYHMDIKGNNDILVLTQPQLIRDIHLSFLRQGVDLL 82  
                                                68***************************************************************** PP

                                 TIGR02082   68 etntFnsteialadYdledkayelnkkaaklarevadeftltpekkRfvaGslGPtnklatlspdve 134 
                                                etn Fn+t+ia+adYd+e++++++n +aa++are+++  + +  k+RfvaG+lGPtn++a++spdv+
  lcl|NCBI__GCF_000711195.1:WP_029933551.1   83 ETNSFNATTIAQADYDMESQVRDINLHAAQVAREACEIAQNEDGKPRFVAGVLGPTNRTASISPDVN 149 
                                                ****************************************9************************** PP

                                 TIGR02082  135 rpefrnvtydelvdaYkeqvkglldGGvDllLietvfDtlnakaalfaveevfeekgrelPilisgv 201 
                                                +p+frn+++delv+aY++++++lldGG+D++Liet+fDtlnakaa+fav++v +  g+e+Pi+isg+
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  150 DPGFRNTHFDELVEAYVQATQALLDGGADVILIETIFDTLNAKAAIFAVKQVEDTLGEEVPIMISGT 216 
                                                ******************************************************************* PP

                                 TIGR02082  202 ivdksGrtLsGqtleaflaslehaeililGLnCalGadelrefvkelsetaealvsviPnaGLPnal 268 
                                                i+d+sGrtLsGqt+eaf+++++ha+ l++GLnCalG++elr++v+els++ e+ vsv+PnaGLPn++
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  217 ITDASGRTLSGQTTEAFYNAIRHAKPLSIGLNCALGPEELRPYVEELSRICETHVSVHPNAGLPNEF 283 
                                                ******************************************************************* PP

                                 TIGR02082  269 geYdltpeelakalkefaeegllnivGGCCGttPehiraiaeavkdikprkrqeleeksvlsgleal 335 
                                                geYd+tpe++a+++k +ae g lni+GGCCGttP+h++a+a+a+   +pr+ ++ + +++lsgle++
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  284 GEYDETPEQMAAEVKTWAERGWLNIIGGCCGTTPDHVAAMAKAALAFAPRNIPTPKIECRLSGLEPM 350 
                                                ******************************************************************* PP

                                 TIGR02082  336 kiaqessfvniGeRtnvaGskkfrklikaedyeealkiakqqveeGaqilDinvDevllDgeadmkk 402 
                                                +i++ s fvn+GeR nv+Gs+ f++li++++ye+a++ia +qv++Gaqi+D+n+De++l+++a m++
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  351 NIGETSLFVNVGERNNVTGSALFKRLIIEDNYEQAVEIAVKQVNDGAQIIDVNMDEGMLNAKACMTR 417 
                                                ******************************************************************* PP

                                 TIGR02082  403 llsllasepdiakvPlmlDssefevleaGLkviqGkaivnsislkdGeerFlekaklikeyGaavvv 469 
                                                +l+++a+epd a+vP+m+Dss++e +eaGLk+iqGk+ivnsislk+Gee+Fl +ak+i++yGaa +v
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  418 FLNMMAGEPDAARVPIMIDSSKWEAIEAGLKCIQGKGIVNSISLKEGEENFLAQAKMIQRYGAATIV 484 
                                                ******************************************************************* PP

                                 TIGR02082  470 mafDeeGqartadkkieiakRayklltekvgfppediifDpniltiatGieehdryaidfieairei 536 
                                                mafDe+Gqa+t ++k ei+kR y+ll++  gfppediifDpni++iatGieeh++ya+dfi+a+ +i
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  485 MAFDEDGQADTFERKKEICKRSYDLLVA-NGFPPEDIIFDPNIFAIATGIEEHNNYAMDFINAVDWI 550 
                                                ***************************9.8************************************* PP

                                 TIGR02082  537 keelPdakisgGvsnvsFslrgndavRealhsvFLyeaikaGlDmgivnagklavyddidkelrevv 603 
                                                k++lP+a isgGvsnvsFs+rgn++vRea+hsvFLy+ai+ G+Dmgivna+++avydd+d+el+++v
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  551 KKNLPHALISGGVSNVSFSFRGNNPVREAIHSVFLYYAIQKGMDMGIVNAAQMAVYDDLDPELKQAV 617 
                                                ******************************************************************* PP

                                 TIGR02082  604 edlildrrreatekLlelaelykgtkeksskeaqeaewrnlpveeRLeralvkGeregieedleear 670 
                                                ed++l++++ a ++Lle+ae+++g  + +sk +    wr+ +ve+RLe++lvkG++++ie+d+eear
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  618 EDVVLNKDSGAADRLLEVAEKFRGDGSVQSK-ESDIRWREASVEKRLEHSLVKGITDYIEDDTEEAR 683 
                                                **************************99777.66789****************************** PP

                                 TIGR02082  671 kklkapleiiegpLldGmkvvGdLFGsGkmfLPqvvksarvmkkavayLePylekekeedkskGkiv 737 
                                                 +l +pl++iegpL+dGm+vvGdLFG+GkmfLPqvvksarvmk+avayL+Pyle+ek++ ++kGkiv
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  684 TTLGSPLQVIEGPLMDGMNVVGDLFGAGKMFLPQVVKSARVMKRAVAYLQPYLEAEKASGQAKGKIV 750 
                                                ******************************************************************* PP

                                 TIGR02082  738 latvkGDvhDiGknivdvvLscngyevvdlGvkvPvekileaakkkkaDviglsGLivksldemvev 804 
                                                +atvkGDvhDiGkniv+vvL+cn+++v+dlGv+vP+ekil++a k++a+viglsGLi++sl+emv+v
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  751 MATVKGDVHDIGKNIVGVVLQCNNFDVIDLGVMVPTEKILDTAIKENANVIGLSGLITPSLEEMVNV 817 
                                                ******************************************************************* PP

                                 TIGR02082  805 aeemerrgvkiPlllGGaalskahvavkiaekYkgevvyvkdaseavkvvdkllsekkkaeelekik 871 
                                                a+em+rrg+++Pll+GGa++skah+avki+++Y+ +vvyvkdas+av v+++l+s++ k ++++ki+
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  818 AKEMQRRGMNLPLLIGGATTSKAHTAVKIEPQYDHPVVYVKDASRAVGVAQSLISNDLKVDYAAKIR 884 
                                                ******************************************************************* PP

                                 TIGR02082  872 eeyeeirekfgekkeklialsekaarkevfaldrsedlevpapkflGtkvleas.ieellkyiDwka 937 
                                                +eye++re+ + + ++++  ++++ar++ + +d++ ++ +p+p flGtkv++++ + ell+ +Dw +
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  885 AEYEQLREERKARSKQVKRTPLNKARENPIPVDWT-NYVPPKPSFLGTKVFDDIpLTELLDRLDWSP 950 
                                                ***********************************.9****************************** PP

                                 TIGR02082  938 lFvqWelrgkypkilkdeleglearklfkdakelldklsaekllrargvvGlfPaqsvgddieiytd 1004
                                                +F +W+l+g yp+il+d+l+g+eark+f+dak +l+++++ek l+ar+v+G++Pa+sv ddie+y+d
  lcl|NCBI__GCF_000711195.1:WP_029933551.1  951 FFQSWDLHGLYPRILDDKLVGEEARKVFADAKTMLQTIIKEKWLTARAVIGFYPANSVVDDIELYKD 1017
                                                ******************************************************************9 PP

                                 TIGR02082 1005 etvsqetkpiatvrekleqlrqqsdr.ylclaDfiaskes....G..ikDylgallvtaglgaeela 1064
                                                +++    ++++ +++ ++q+ ++ ++ ++cl+D+ia+k+     G   +Dy+ga++vtag+g++e  
  lcl|NCBI__GCF_000711195.1:WP_029933551.1 1018 DSR---EEVLTRLHNLRQQADKKAGQyNQCLSDYIAPKANkqqpGlvLEDYIGAFAVTAGIGIDEHI 1081
                                                999...44555555556777777777799********955445652258****************** PP

                                 TIGR02082 1065 kkleakeddydsilvkaladrlaealaellhervRkelwgyaeeenldkedllkerYrGirpafGYp 1131
                                                 ++ea +ddy++i++kaladrlaealae++he+vRke+wgya++e+l +e+++ke+YrGirpa+GYp
  lcl|NCBI__GCF_000711195.1:WP_029933551.1 1082 ARFEAAHDDYQAIMLKALADRLAEALAEYMHEKVRKEFWGYAADEQLTNEEMIKEKYRGIRPAPGYP 1148
                                                ******************************************************************* PP

                                 TIGR02082 1132 acPdhtekatlleLleaer.iGlklteslalaPeasvsglyfahpeakYfav 1182
                                                a P+htek tl++Ll++++ iGl+lt s+a++P+a+vsg+yfahpe+kYf v
  lcl|NCBI__GCF_000711195.1:WP_029933551.1 1149 ANPEHTEKGTLWSLLKPDSeIGLELTSSYAMTPTAAVSGWYFAHPESKYFGV 1200
                                                ******************99******************************86 PP



Internal pipeline statistics summary:
-------------------------------------
Query model(s):                            1  (1182 nodes)
Target sequences:                          1  (1239 residues searched)
Passed MSV filter:                         1  (1); expected 0.0 (0.02)
Passed bias filter:                        1  (1); expected 0.0 (0.02)
Passed Vit filter:                         1  (1); expected 0.0 (0.001)
Passed Fwd filter:                         1  (1); expected 0.0 (1e-05)
Initial search space (Z):                  1  [actual number of targets]
Domain search space  (domZ):               1  [number of targets reported over threshold]
# CPU time: 0.11u 0.05s 00:00:00.16 Elapsed: 00:00:00.15
# Mc/sec: 9.31
//
[ok]

This GapMind analysis is from Apr 10 2024. The underlying query database was built on Apr 09 2024.

Links

Downloads

Related tools

About GapMind

Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.

A candidate for a step is "high confidence" if either:

where "other" refers to the best ublast hit to a sequence that is not annotated as performing this step (and is not "ignored").

Otherwise, a candidate is "medium confidence" if either:

Other blast hits with at least 50% coverage are "low confidence."

Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:

GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).

For more information, see:

If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know

by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory