Align 5-methyltetrahydropteroyltriglutamate-homocysteine S-methyltransferase (EC 2.1.1.14) (characterized)
to candidate AO353_16215 AO353_16215 5-methyltetrahydropteroyltriglutamate-- homocysteine methyltransferase
Query= BRENDA::P25665 (753 letters) >FitnessBrowser__pseudo3_N2E3:AO353_16215 Length = 770 Score = 936 bits (2420), Expect = 0.0 Identities = 473/764 (61%), Positives = 569/764 (74%), Gaps = 16/764 (2%) Query: 4 LNHTLGFPRVGLRRELKKAQESYWAGNSTREELLAVGRELRARHWDQQKQAGIDLLPVGD 63 L HTLGFPR+G RELKKA ESYW G+ + L +VGRELRARHW QK AGIDLLPVGD Sbjct: 3 LAHTLGFPRIGADRELKKALESYWKGDLDQAGLQSVGRELRARHWQVQKDAGIDLLPVGD 62 Query: 64 FAWYDHVLTTSLLLGNVPARHQN-KD--GSVDIDTLFRIGRGRAPT---GEPAA---AAE 114 FAWYD VLT SL G +P R + KD G +DTLF + RG + GE A A E Sbjct: 63 FAWYDQVLTHSLTFGVIPERFDSAKDAQGLPTLDTLFAMARGASAKCCDGESAKTHHAQE 122 Query: 115 MTKWFNTNYHYMVPEFVKGQQFKLTWTQLLDEVDEALALGHKVKPVLLGPVTWLWLGKVK 174 +TKWF+TNYHY+VPEF Q FKL+W QL DEVDEA ALGH+VKPV++GP+T+LWLGK K Sbjct: 123 LTKWFDTNYHYLVPEFSADQPFKLSWEQLFDEVDEAHALGHRVKPVIIGPLTYLWLGKTK 182 Query: 175 GEQFDRLSLLNDILPVYQQVLAELAKRGIEWVQIDEPALVLELPQAWLDAYKPAYDALQ- 233 G FD+L LL +LP+Y ++L L +G+EWVQIDEP L L+LPQAW +A++ AY LQ Sbjct: 183 GNDFDKLELLERLLPLYGEILNRLKAQGVEWVQIDEPILTLDLPQAWKNAFERAYHILQY 242 Query: 234 GQVKLLLTTYFEGVTPNLDTITALPVQGLHVDLVHGKDDVAELHKRLPSDWLLSAGLING 293 +K L+ TYF G+ NL +LPV GLH+D V + + ++ RLP+ +LS GL+NG Sbjct: 243 SPLKKLVATYFSGLEDNLGLAVSLPVDGLHIDAVRAPEQLGQVLDRLPTYKVLSVGLVNG 302 Query: 294 RNVWRADLTEKYAQIKDIVGKR--DLWVASSCSLLHSPIDLSVETRLDAEVKSWFAFALQ 351 RNVWR +L + AQ+K + +LWV+SSCSLLHSP+DL E LDAE+KSW AFA+Q Sbjct: 303 RNVWRCELEQALAQLKPAQERYGDNLWVSSSCSLLHSPVDLDREDTLDAELKSWLAFAVQ 362 Query: 352 KCHELALLRDALNSGDT----AALAEWSAPIQARRHSTRVHNPAVEKRLAAITAQDSQRA 407 KC E+A+LRDALN AALA+ A +R S R+H V+ R+ AI DSQR Sbjct: 363 KCAEIAVLRDALNDPQAPKVQAALAQNRAVQASRAQSPRIHKAEVQARIRAIRDTDSQRH 422 Query: 408 NVYEVRAEAQRARFKLPAWPTTTIGSFPQTTEIRTLRLDFKKGNLDANNYRTGIAEHIKQ 467 +V+ R E QRAR LPA+PTTTIGSFPQT IR R FK+G L AN+Y + I+ Sbjct: 423 SVFAKRIELQRARLNLPAFPTTTIGSFPQTASIRLARQSFKQGKLSANDYTDAMHSEIRH 482 Query: 468 AIVEQERLGLDVLVHGEAERNDMVEYFGEHLDGFVFTQNGWVQSYGSRCVKPPIVIGDIS 527 A+ QERLGLDVLVHGEAERNDMVEYF E LDG++FT+ GWVQSYGSRCVKP I+ GD+S Sbjct: 483 AVQIQERLGLDVLVHGEAERNDMVEYFAEQLDGYLFTRFGWVQSYGSRCVKPAIIYGDLS 542 Query: 528 RPAPITVEWAKYAQSLTDKPVKGMLTGPVTILCWSFPREDVSRETIAKQIALALRDEVAD 587 RP +TV+W YAQSLTDK +KGMLTGPVT+L WSFPREDVSR+ A+Q+ALALRDEV D Sbjct: 543 RPKAMTVDWITYAQSLTDKVMKGMLTGPVTMLMWSFPREDVSRKVQAQQLALALRDEVVD 602 Query: 588 LEAAGIGIIQIDEPALREGLPLRRSDWDAYLQWGVEAFRINAAVAKDDTQIHTHMCYCEF 647 LE AGI I+QIDE A REGLPLRR+ W YL W V+AFR++A+ +D+TQIHTHMCY EF Sbjct: 603 LEQAGIKIVQIDEAAFREGLPLRRAQWQEYLDWAVQAFRLSASGVRDETQIHTHMCYSEF 662 Query: 648 NDIMDSIAALDADVITIETSRSDMELLESFEEFDYPNEIGPGVYDIHSPNVPSVEWIEAL 707 ND+++SIAA+DADVITIETSRSDMELLE+F+ FDYPN+IGPGVYDIHSP VP + L Sbjct: 663 NDVIESIAAMDADVITIETSRSDMELLEAFKAFDYPNDIGPGVYDIHSPRVPDTTEMVKL 722 Query: 708 LKKAAKRIPAERLWVNPDCGLKTRGWPETRAALANMVQAAQNLR 751 + KA +RIPAERLWVNPDCGLKTR WPET AAL NMV AA+ LR Sbjct: 723 MSKAVQRIPAERLWVNPDCGLKTRAWPETEAALVNMVAAARQLR 766 Lambda K H 0.319 0.135 0.416 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 1 Number of Hits to DB: 1733 Number of extensions: 84 Number of successful extensions: 7 Number of sequences better than 1.0e-02: 1 Number of HSP's gapped: 1 Number of HSP's successfully gapped: 1 Length of query: 753 Length of database: 770 Length adjustment: 40 Effective length of query: 713 Effective length of database: 730 Effective search space: 520490 Effective search space used: 520490 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 55 (25.8 bits)
Align candidate AO353_16215 AO353_16215 (5-methyltetrahydropteroyltriglutamate-- homocysteine methyltransferase)
to HMM TIGR01371 (metE: 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase (EC 2.1.1.14))
# hmmsearch :: search profile(s) against a sequence database # HMMER 3.3.1 (Jul 2020); http://hmmer.org/ # Copyright (C) 2020 Howard Hughes Medical Institute. # Freely distributed under the BSD open source license. # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - # query HMM file: ../tmp/path.aa/TIGR01371.hmm # target sequence database: /tmp/gapView.24509.genome.faa # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: TIGR01371 [M=754] Accession: TIGR01371 Description: met_syn_B12ind: 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase Scores for complete sequences (score includes all domains): --- full sequence --- --- best 1 domain --- -#dom- E-value score bias E-value score bias exp N Sequence Description ------- ------ ----- ------- ------ ----- ---- -- -------- ----------- 0 1117.9 0.0 0 1117.7 0.0 1.0 1 lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 AO353_16215 5-methyltetrahydropt Domain annotation for each sequence (and alignments): >> lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 AO353_16215 5-methyltetrahydropteroyltriglutamate-- homocysteine methyl # score bias c-Evalue i-Evalue hmmfrom hmm to alifrom ali to envfrom env to acc --- ------ ----- --------- --------- ------- ------- ------- ------- ------- ------- ---- 1 ! 1117.7 0.0 0 0 1 754 [] 7 768 .. 7 768 .. 0.98 Alignments for each domain: == domain 1 score: 1117.7 bits; conditional E-value: 0 TIGR01371 1 lgfPrigekRelkkalekywkgkiskeellkvakdlrkkalkkqkeagvdvipvndfslYDhvLd 65 lgfPrig+ Relkkale+ywkg+++++ l++v ++lr+++++ qk+ag+d++pv+df++YD+vL+ lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 7 LGFPRIGADRELKKALESYWKGDLDQAGLQSVGRELRARHWQVQKDAGIDLLPVGDFAWYDQVLT 71 79*************************************************************** PP TIGR01371 66 tavllgaiperfkeladde..sdldtyFaiaRGtek........kdvaalemtkwfntnYhYlvP 120 ++ +g+iperf++ +d++ +ldt+Fa+aRG + k ++a+e+tkwf+tnYhYlvP lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 72 HSLTFGVIPERFDSAKDAQglPTLDTLFAMARGASAkccdgesaKTHHAQELTKWFDTNYHYLVP 136 ************99876652146**********9889*****99999****************** PP TIGR01371 121 elskeeefklsknklleeykeakelgvetkPvllGpitflkLakakeeeekellellekllpvYk 185 e+s+++ fkls+++l++e++ea++lg+ +kPv++Gp+t+l+L+k+k ++++++lelle+llp+Y lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 137 EFSADQPFKLSWEQLFDEVDEAHALGHRVKPVIIGPLTYLWLGKTK-GNDFDKLELLERLLPLYG 200 ********************************************99.589*************** PP TIGR01371 186 evlkklaeagvewvqidePvlvldlskeelaavkeayeeleeaskelklllqtYfdsveealekl 250 e+l++l+++gvewvqideP+l ldl++++++a+++ay+ l+ + lk l++tYf +e++l + lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 201 EILNRLKAQGVEWVQIDEPILTLDLPQAWKNAFERAYHILQY--SPLKKLVATYFSGLEDNLGLA 263 ****************************************96..6778899************** PP TIGR01371 251 vslpvealglDlveakeelelakakfeedkvLvaGvidGrniwkadlekslkllkkleakagdkl 315 vslpv++l++D v+a+e+l ++ ++++ kvL++G+++Grn+w+ +le++l++lk +++ gd+l lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 264 VSLPVDGLHIDAVRAPEQLGQVLDRLPTYKVLSVGLVNGRNVWRCELEQALAQLKPAQERYGDNL 328 ***************************************************************** PP TIGR01371 316 vvstscsllhvpvdleleekldkelkellafakekleelkvlkealeg.ea.avaealeaeaaai 378 +vs+scsllh+pvdl++e++ld+elk++lafa++k++e++vl++al++ +a +v++al++++a lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 329 WVSSSCSLLHSPVDLDREDTLDAELKSWLAFAVQKCAEIAVLRDALNDpQApKVQAALAQNRAVQ 393 ************************************************85415667778887788 PP TIGR01371 379 aarkkskrvadekvkerlealkekkarressfeeRaeaqekklnlPllPtttiGsfPqtkevRka 443 a+r++s+r+++++v++r++a+++++++r+s f++R e q+++lnlP++PtttiGsfPqt+++R a lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 394 ASRAQSPRIHKAEVQARIRAIRDTDSQRHSVFAKRIELQRARLNLPAFPTTTIGSFPQTASIRLA 458 8999************************************************************* PP TIGR01371 444 RakfrkgeiseeeYekfikeeikkviklqeelglDvLvhGefeRnDmveyFgeklaGfaftqngW 508 R++f++g++s+++Y++++++ei++++++qe+lglDvLvhGe+eRnDmveyF+e+l+G++ft++gW lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 459 RQSFKQGKLSANDYTDAMHSEIRHAVQIQERLGLDVLVHGEAERNDMVEYFAEQLDGYLFTRFGW 523 ***************************************************************** PP TIGR01371 509 vqsYGsRcvkPpiiygdvsrpkpmtvkeskyaqsltskpvkGmLtGPvtilnWsfvReDlprkei 573 vqsYGsRcvkP+iiygd+srpk+mtv++++yaqslt+k +kGmLtGPvt+l Wsf+ReD++rk + lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 524 VQSYGSRCVKPAIIYGDLSRPKAMTVDWITYAQSLTDKVMKGMLTGPVTMLMWSFPREDVSRKVQ 588 ***************************************************************** PP TIGR01371 574 aeqialalrdevkdLeeagikiiqiDepalReglPlrksdkeeYldwaveaFrlaasgvkdetqi 638 a+q+alalrdev+dLe+agiki+qiDe+a+ReglPlr+++++eYldwav+aFrl+asgv+detqi lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 589 AQQLALALRDEVVDLEQAGIKIVQIDEAAFREGLPLRRAQWQEYLDWAVQAFRLSASGVRDETQI 653 ***************************************************************** PP TIGR01371 639 hthmCYsefneiieaiaaldaDvisieasrsdmelldalkeikkyekeiGlGvyDihsprvPske 703 hthmCYsefn++ie+iaa+daDvi+ie+srsdmell+a+k+ ++y+++iG+GvyDihsprvP + lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 654 HTHMCYSEFNDVIESIAAMDADVITIETSRSDMELLEAFKA-FDYPNDIGPGVYDIHSPRVPDTT 717 *****************************************.77********************* PP TIGR01371 704 elaellekalkklpkerlWvnPDCGLktRkweevkaalknlveaakelRek 754 e+ +l++ka++++p+erlWvnPDCGLktR+w+e++aal n+v+aa++lR++ lcl|FitnessBrowser__pseudo3_N2E3:AO353_16215 718 EMVKLMSKAVQRIPAERLWVNPDCGLKTRAWPETEAALVNMVAAARQLRNQ 768 *************************************************86 PP Internal pipeline statistics summary: ------------------------------------- Query model(s): 1 (754 nodes) Target sequences: 1 (770 residues searched) Passed MSV filter: 1 (1); expected 0.0 (0.02) Passed bias filter: 1 (1); expected 0.0 (0.02) Passed Vit filter: 1 (1); expected 0.0 (0.001) Passed Fwd filter: 1 (1); expected 0.0 (1e-05) Initial search space (Z): 1 [actual number of targets] Domain search space (domZ): 1 [number of targets reported over threshold] # CPU time: 0.04u 0.02s 00:00:00.06 Elapsed: 00:00:00.05 # Mc/sec: 10.62 // [ok]
This GapMind analysis is from Apr 09 2024. The underlying query database was built on Apr 09 2024.
Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.
A candidate for a step is "high confidence" if either:
Otherwise, a candidate is "medium confidence" if either:
Other blast hits with at least 50% coverage are "low confidence."
Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:
GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).
For more information, see:
If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know
by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory