Align L-2-aminoadipate reductase; Alpha-aminoadipate reductase; Alpha-AR; L-aminoadipate-semialdehyde dehydrogenase; EC 1.2.1.31; EC 1.2.1.95 (characterized)
to candidate AO356_28220 AO356_28220 non-ribosomal peptide synthetase
Query= SwissProt::P40976 (1419 letters) >FitnessBrowser__pseudo5_N2C3_1:AO356_28220 Length = 8596 Score = 256 bits (653), Expect = 9e-71 Identities = 215/740 (29%), Positives = 332/740 (44%), Gaps = 95/740 (12%) Query: 215 LKLIYNQLLFSESRVNIVADQLLKLVVSASKDVTGPIGALDLMTPTQMNVLPDPTV-DLD 273 L IYN+ F + +AD+ + ++ +D T PI L T + ++L D Sbjct: 401 LHWIYNESYFQADEIESMADRFMHVLEQGLRDDTLPIEGFVLPTAAEADMLQAWNASDAR 460 Query: 274 WSGYRGAIQDIFASNAAKFPDRECIVVTPSVTIDAPVTSYTYRQIDESSNILAHHLVKNG 333 + I +F + PD +V + TY +++ +N +AH L+ G Sbjct: 461 TYEHDHTIHGLFEAQVRARPD--------AVAVLHEDRCLTYGELNARANQVAHRLLILG 512 Query: 334 IERGDVVMVYAYRGVDLVVAVMGVLKAGATFSVIDPAYPPARQIIYLSVAKPRALVVLED 393 + D V + RG+D+++ ++G+LK+GA + +DPAYP R L + P AL+ Sbjct: 513 VHPDDRVAICVERGLDMIIGLLGILKSGAGYVPLDPAYPQERLAFMLDDSAPVALLTQST 572 Query: 394 AGVLSPTVVEYVEKSLELKTYVPALKLAKDGSLTGGSVSKGADDILQHVLHLKSEQTGVV 453 V P + VP L L + + + S+ D+ L S V Sbjct: 573 LQVQLPAL------------QVPVLLLDQAEAAGITAQSRYNPDVRT----LASHHLAYV 616 Query: 454 VGPDSTPTLSFTSGSEGIPKGVKGRHFSLAYYFDWMAQEFNLSESDRFTMLSGIAHDPIQ 513 + +TSGS G+PKGV H ++A F F D + + A D Sbjct: 617 I---------YTSGSTGLPKGVMVEHRNVARLFSATQPWFEFGPQDVWALFHSFAFDFSV 667 Query: 514 RDIFTPLFLGASLIVPTAEDIGTPGQLAQWANKYKVTVTHLTP-AMGQLLAAQADEPI-P 571 +I+ L G L+V +P + VTV + TP A QL+AAQ D + Sbjct: 668 WEIWGALTHGGRLLVVPQLVSRSPQDCYALLCEAGVTVLNQTPSAFRQLIAAQGDSDLCH 727 Query: 572 SLHHAFFVGDILTKRDCLRLQVLANNVN--VVNMYGTTETQRSVSYFVVPARSQDQTFLE 629 SL F G+ L A N +VNMYG TET V+Y + A T Sbjct: 728 SLRQVIFGGEALETSMLKPWYARATNAGTQLVNMYGITETTVHVTYRALCAADAQLT--- 784 Query: 630 SQKDVIPAGRGMKNVQLLVINRFDTNKICGIGEVGEIYLRAGGLAEGYLGNDELTSKKFL 689 V P G+ + +++L +++++ + +G GE+Y+ G+A GYL +L +FL Sbjct: 785 ---GVSPIGKRIPDLRLYLLDKY--GQPVPVGVEGELYVGGAGVARGYLNRPQLDETRFL 839 Query: 690 KSWFADPSKFVDRTPENAPWKPYWFGIRDRMYRSGDLGRYLPTGNVECSGRADDQIKIRG 749 P+ G RMYR+GDLGR+L G +E GR D+Q+KIRG Sbjct: 840 AD-------------------PFDGGAHARMYRTGDLGRWLKNGELEYLGRNDEQVKIRG 880 Query: 750 FRIELGEINTHLSRHPNVRENITLVRRDKDEEPTLVAYIVPQGLNKDDFDSATESEDIVV 809 FRIELGEI L+ VRE + + R D + LVAY+V D+ + Sbjct: 881 FRIELGEIEAKLAVCAQVREAVVIAREDNPGDKRLVAYVVA--------DAGRQLS---- 928 Query: 810 NGLKKYRKLIHDIREYLKTKLPSYAIPSVIVPLHKMPLNPNGKIDKPALPFPDTSQLAAA 869 + D+R+ L L Y +PS V L +PL NGK+D+ ALP PD SQ A Sbjct: 929 ---------VADLRDQLLGLLADYMVPSAFVLLDALPLTTNGKLDRKALPAPD-SQAYAR 978 Query: 870 SRSHSKHGVDETLTATERDIRDIWLRIIPHATDVNKKASFFDIGGHSILATRLIFELRKK 929 + G ET+ A +W ++ V + FF++GGHS+LA +LI E ++ Sbjct: 979 RSYEAPEGEVETVLAR------LWAELL-GVEQVGRHDRFFELGGHSLLAVKLI-ERMRQ 1030 Query: 930 FAVNVPLGLVFSEPTIEGLA 949 ++ +G++F +PT+ LA Sbjct: 1031 VKLHADVGVLFGQPTLASLA 1050 Score = 247 bits (630), Expect = 4e-68 Identities = 203/726 (27%), Positives = 330/726 (45%), Gaps = 126/726 (17%) Query: 281 IQDIFASNAAKFPDRECIVVTPSVTIDAPVTSYTYRQIDESSNILAHHLVKNGIERGDVV 340 + +F A + PD +V + TY +++E +N LAH+L K G+E V Sbjct: 1561 VHGLFEEQAQRTPD--------AVAVIRGEQRLTYHELNERANRLAHYLRKQGVEPDSRV 1612 Query: 341 MVYAYRGVDLVVAVMGVLKAGATFSVIDPAYPPARQIIYLSVAKPRALVVLEDAGVLSPT 400 + RG+D+VV ++ +LKAG + +DPAYP R L + P A VL+ T Sbjct: 1613 AICVERGIDMVVGLLAILKAGGGYVPLDPAYPLDRIAYMLDDSAP--------AAVLAQT 1664 Query: 401 VVEYVEKSLELKTYVPALKLAKDGSLTGGSVSKG--ADDILQH--VLHLKSEQTGVVVGP 456 L+L + S+ ++ G D+ +Q+ V L S V+ Sbjct: 1665 AT---------------LELLAEASMPVINLDSGDWQDESVQNPEVTELTSSHLAYVI-- 1707 Query: 457 DSTPTLSFTSGSEGIPKGVKGRHFSLAYYFDWMAQEFNLSESDRFTMLSGIAHDPIQRDI 516 +TSGS G+PKGV H + + W + F+ + + + D + Sbjct: 1708 -------YTSGSTGLPKGVMIEHRNTVNFLTWAHRSFDAQTLSKTLFSTSLNFDLAVYEC 1760 Query: 517 FTPLFLGASLIVPTAEDIGTPGQLAQWANKYKVTVTHLTPAMGQLLAAQAD--EPIPSLH 574 F PL G S+ V T L ++ +T+ + P+ + L E + +++ Sbjct: 1761 FAPLTSGGSIEVVT-------NVLELQQGEHDITLINTVPSALKALLESGGLGEGVDTVN 1813 Query: 575 HAFFVGDILTKRDCLRLQVLANNVNVVNMYGTTETQRSVSYFVVPARSQDQTFLESQKDV 634 A G+ L + L + N+YG +ET S+ + +++D Sbjct: 1814 VA---GEALKRSLVETLFEQTQVKRLCNLYGPSETTTYSSWVSM-----------AREDG 1859 Query: 635 IPA--GRGMKNVQLLVINRFDTNKICGIGEVGEIYLRAGGLAEGYLGNDELTSKKFLKSW 692 A G+ + N Q +++ + + +G GEIY+ G+A GYL D+LT+++FLK Sbjct: 1860 FAAHIGKPVANTQFYLLD--EHKQPVPLGVPGEIYIGGAGVARGYLNRDDLTAERFLKDP 1917 Query: 693 FADPSKFVDRTPENAPWKPYWFGIRDRMYRSGDLGRYLPTGNVECSGRADDQIKIRGFRI 752 F+ T NA RMY++GDLGRYLP GN+E GR DDQ+KIRGFRI Sbjct: 1918 FS--------TTPNA-----------RMYKTGDLGRYLPDGNIEYLGRNDDQVKIRGFRI 1958 Query: 753 ELGEINTHLSRHPNVRENITLVRRDKDEEPTLVAYIVPQGLNKDDFDSATESEDIVVNGL 812 ELGEI L++ PN++E + L R D + LVAY F + E + + L Sbjct: 1959 ELGEIEAKLAQAPNIKETVVLAREDVPGDKRLVAY----------FTQHSPDETVEIEAL 2008 Query: 813 KKYRKLIHDIREYLKTKLPSYAIPSVIVPLHKMPLNPNGKIDKPALPFPDTSQLAAASRS 872 R +L+ +LP+Y +P V L +PL PNGK+D+ ALP PD + Sbjct: 2009 ----------RTHLQAQLPAYMVPVAYVRLDALPLTPNGKLDRKALPAPDLDAVIT---- 2054 Query: 873 HSKHGVDETLTATERDIRDIWLRIIPHATDVNKKASFFDIGGHSILATRLIFELRKKFAV 932 G + TE + IW ++ V + FF++GGHS+LA LI +R+ + Sbjct: 2055 ---RGYEAPQGETETTLAQIWQDVL-KVERVGRHDHFFELGGHSLLAVSLIERMRQA-GL 2109 Query: 933 NVPLGLVFSEPTIEGLAKEIERMKSGEMISV----MDIGKEETREPEIEYGKDALDLVDL 988 + + ++F++PT+ LA + +G ++V + +G E + K + + +D Sbjct: 2110 SADVRILFNQPTLAALAAAV---GTGNEVTVPANLIPLGCEHITPAMLPLAKLSQEAIDR 2166 Query: 989 IPKEFP 994 I P Sbjct: 2167 IVSTVP 2172 Score = 243 bits (619), Expect = 8e-67 Identities = 194/647 (29%), Positives = 299/647 (46%), Gaps = 111/647 (17%) Query: 314 TYRQIDESSNILAHHLVKNGIERGDVVMVYAYRGVDLVVAVMGVLKAGATFSVIDPAYPP 373 TY +++E +N LAH+L K G+E V + RG+D+VV ++ +LKAG + +DPAYP Sbjct: 4806 TYHELNERANRLAHYLRKQGVEPDSRVAICVERGIDMVVGLLAILKAGGGYVPLDPAYPL 4865 Query: 374 ARQIIYLSVAKPRALVVLEDAGVLSPTVVEYVEKSLELKTYVPALKLAKDGSLTGGSVSK 433 R L + P A VL+ T L+L + S+ ++ Sbjct: 4866 DRIAYMLDDSAP--------AAVLAQTAT---------------LELLAEASMPVINLDS 4902 Query: 434 G--ADDILQH--VLHLKSEQTGVVVGPDSTPTLSFTSGSEGIPKGVKGRHFSLAYYFDWM 489 G D+ +Q+ V L S V+ +TSGS G+PKGV H + + W Sbjct: 4903 GDWQDESVQNPEVAELTSSHLAYVI---------YTSGSTGLPKGVMIEHRNTVNFLTWA 4953 Query: 490 AQEFNLSESDRFTMLSGIAHDPIQRDIFTPLFLGASLIVPTAEDIGTPGQLAQWANKYKV 549 + F+ + + + + D + F PL G S+ V T L ++ + Sbjct: 4954 HRSFDDATLSKTLFSTSLNFDLAVYECFAPLTSGGSIEVVT-------NVLELQQGEHDI 5006 Query: 550 TVTHLTPAMGQLLAAQAD--EPIPSLHHAFFVGDILTKRDCLRLQVLANNVNVVNMYGTT 607 T+ + P+ + L E + +++ A G+ L + L + N+YG + Sbjct: 5007 TLINTVPSALKALLESGGLGEGVDTVNVA---GEALKRSLVETLFEQTQVKRLCNLYGPS 5063 Query: 608 ETQRSVSYFVVPARSQDQTFLESQKDVIPA--GRGMKNVQLLVINRFDTNKICGIGEVGE 665 ET S+ + +++D A G+ + N Q +++ + + +G GE Sbjct: 5064 ETTTYSSWVSM-----------AREDGFAAHIGKPVANTQFYLLD--EHKQPVPLGVPGE 5110 Query: 666 IYLRAGGLAEGYLGNDELTSKKFLKSWFADPSKFVDRTPENAPWKPYWFGIRDRMYRSGD 725 IY+ G+A GYL D+LT+++FLK F RT NA RMY++GD Sbjct: 5111 IYIGGAGVARGYLNRDDLTAERFLKDPF--------RTAPNA-----------RMYKTGD 5151 Query: 726 LGRYLPTGNVECSGRADDQIKIRGFRIELGEINTHLSRHPNVRENITLVRRDKDEEPTLV 785 LGRYLP GN+E GR DDQ+KIRGFRIELGEI L++H + E + L R D + LV Sbjct: 5152 LGRYLPDGNIEYLGRNDDQVKIRGFRIELGEIEAKLAQHAALNETVVLAREDVPGDKRLV 5211 Query: 786 AYIVPQGLNKDDFDSATESEDIVVNGLKKYRKLIHDIREYLKTKLPSYAIPSVIVPLHKM 845 AY S ES D I +R YL+ LPSY +P V L + Sbjct: 5212 AYFTQH--------SPDESVD------------IEALRIYLQALLPSYMVPVAYVRLDAL 5251 Query: 846 PLNPNGKIDKPALPFPDTSQLAAASRSHSKHGVDETLTATERDIRDIWLRIIPHATDVNK 905 PL PNGK+D+ ALP PD L G + E + IW ++ V + Sbjct: 5252 PLTPNGKLDRKALPAPDLDALIT-------RGYEAPQGEVEISLAQIWQDVL-KVERVGR 5303 Query: 906 KASFFDIGGHSILATRLIFELRKKFAVNVPLGLVFSEPTIEGLAKEI 952 FF++GGHS+LA LI +R+ ++ + ++F +PT+ LA + Sbjct: 5304 HDHFFELGGHSLLAVTLIERMRQA-GLSADVRVLFGQPTLAALAAAV 5349 Score = 242 bits (617), Expect = 1e-66 Identities = 197/645 (30%), Positives = 300/645 (46%), Gaps = 107/645 (16%) Query: 314 TYRQIDESSNILAHHLVKNGIERGDVVMVYAYRGVDLVVAVMGVLKAGATFSVIDPAYPP 373 TY +++E +N LAH+L K G+E V + RG+D+VV ++ +LKAG + +DPAYP Sbjct: 2659 TYHELNERANRLAHYLRKQGVEPDSRVAICVERGIDMVVGLLAILKAGGGYVPLDPAYPL 2718 Query: 374 ARQIIYLSVAKPRALVVLEDAGVLSPTVVEYVEKSLELKTYVPALKLAKDGSLTGGSVSK 433 R I Y+ L+D+ +P VV +LEL + D L Sbjct: 2719 DR-IAYM----------LDDS---APAVVLAQTATLELLAAASMPVIDLDSGLW------ 2758 Query: 434 GADDILQH--VLHLKSEQTGVVVGPDSTPTLSFTSGSEGIPKGVKGRHFSLAYYFDWMAQ 491 D+ +Q+ V L S V+ +TSGS G+PKGV H + + W + Sbjct: 2759 -QDESVQNPEVAELTSSHLAYVI---------YTSGSTGLPKGVMIEHRNTVNFLTWAHR 2808 Query: 492 EFNLSESDRFTMLSGIAHDPIQRDIFTPLFLGASLIVPTAEDIGTPGQLAQWANKYKVTV 551 F+ + + + D + F PL G S+ V T L ++ +T+ Sbjct: 2809 SFDSQTLAKTLFSTSLNFDLAVYECFAPLTSGGSIEVVT-------NVLELQQGEHDITL 2861 Query: 552 THLTPAMGQLLAAQAD--EPIPSLHHAFFVGDILTKRDCLRLQVLANNVNVVNMYGTTET 609 + P+ + L E + +++ A G+ L + L + N+YG +ET Sbjct: 2862 INTVPSALKALLESGGLGEGVDTVNVA---GEALKRSLVETLFEQTQVKRLCNLYGPSET 2918 Query: 610 QRSVSYFVVPARSQDQTFLESQKDVIPA--GRGMKNVQLLVINRFDTNKICGIGEVGEIY 667 S+ + +++D A G+ + N Q +++ + + +G GEIY Sbjct: 2919 TTYSSWVSM-----------AREDGFAAHIGKPVANTQFYLLD--EHKQPVPLGVPGEIY 2965 Query: 668 LRAGGLAEGYLGNDELTSKKFLKSWFADPSKFVDRTPENAPWKPYWFGIRDRMYRSGDLG 727 + G+A GYL D+LT+++FLK DP V RMY++GDLG Sbjct: 2966 IGGAGVARGYLNRDDLTAERFLK----DPFSAVPNA---------------RMYKTGDLG 3006 Query: 728 RYLPTGNVECSGRADDQIKIRGFRIELGEINTHLSRHPNVRENITLVRRDKDEEPTLVAY 787 RYLP GN+E GR DDQ+KIRGFRIELGEI L++H + E + L R D + LVAY Sbjct: 3007 RYLPDGNIEYLGRNDDQVKIRGFRIELGEIEAKLAQHAALNETVVLAREDVPGDKRLVAY 3066 Query: 788 IVPQGLNKDDFDSATESEDIVVNGLKKYRKLIHDIREYLKTKLPSYAIPSVIVPLHKMPL 847 S ES D I +R YL+ LPSY +P V L +PL Sbjct: 3067 FTQH--------SPDESVD------------IEALRAYLQALLPSYMVPVAYVRLDALPL 3106 Query: 848 NPNGKIDKPALPFPDTSQLAAASRSHSKHGVDETLTATERDIRDIWLRIIPHATDVNKKA 907 PNGK+D+ ALP PD A SR G + TE + IW ++ V + Sbjct: 3107 TPNGKLDRKALPAPDLD--AVISR-----GYEAPQGETETTLAQIWQDLL-GLQQVGRHD 3158 Query: 908 SFFDIGGHSILATRLIFELRKKFAVNVPLGLVFSEPTIEGLAKEI 952 FF++GGHS+LA LI +R+ ++ + ++F +PT+ LA + Sbjct: 3159 HFFELGGHSLLAVTLIERMRQA-GLSADVRILFGQPTLAALAAAV 3202 Score = 242 bits (617), Expect = 1e-66 Identities = 199/661 (30%), Positives = 309/661 (46%), Gaps = 118/661 (17%) Query: 314 TYRQIDESSNILAHHLVKNGIERGDVVMVYAYRGVDLVVAVMGVLKAGATFSVIDPAYPP 373 TY +++ +N +AH L+ G+ D V + R +++VV ++GVLKAGA + +DPAYP Sbjct: 3732 TYCELNARANQVAHRLLALGVCPDDRVAICVERSLEMVVGLLGVLKAGAGYVPVDPAYP- 3790 Query: 374 ARQIIYLSVAKPRALVVLEDAGVLSPTVVEYVEKSLELKTYVPALKLAKDGSLTGGSVSK 433 A +I YL L+D+ ++ +V+ V + L VP + L G SVS Sbjct: 3791 AERIAYL----------LQDSAPVA-VLVQAVTQGLLAAGAVPVINLDNAG-WQDESVSN 3838 Query: 434 GADDILQHVLHLKSEQTGVVVGPDSTPTLSFTSGSEGIPKGVKGRHFSLAYYFDWMAQEF 493 A V L++ V+ +TSGS G+PKGV H +L+ W Q F Sbjct: 3839 PA------VPGLEARHLAYVI---------YTSGSTGLPKGVMVEHRNLSNLVGWHCQAF 3883 Query: 494 NLSESDRFTMLSGIAHDPIQRDIFTPLFLGASLIVPTA----EDIGTPGQLAQWANKYKV 549 N+ R + ++G D +I+ L GA+L++P A ED+G L W + Sbjct: 3884 NVKRGSRTSSVAGFGFDAAAWEIWPSLCAGATLLLPPAHAGSEDVGA---LLDWWQAQAL 3940 Query: 550 TVTHLTPAMGQLLAAQADEPIPSLHHAFF-------VGDILTKRDCLRLQVLANNVNVVN 602 V L P P +AF + +L D LR ++N Sbjct: 3941 DVCFL--------------PTPIAEYAFGRNLGHDQLRTLLIGGDRLRKLPADLPFELIN 3986 Query: 603 MYGTTETQRSVSYFVVPARSQDQTFLESQKDVIPAGRGMKNVQLLVINRFDTNKICGIGE 662 YG TET VV + +++ + V+ G+ + N Q+ +++ + +G Sbjct: 3987 NYGPTETT------VVATSGR----IDASQAVLHIGKPVANTQVYLLDAH--LQPVPVGV 4034 Query: 663 VGEIYLRAGGLAEGYLGNDELTSKKFLKSWFADPSKFVDRTPENAPWKPYWFGIRDRMYR 722 GE+Y+ G+A GYL +LT+++F+K DP V RMYR Sbjct: 4035 AGELYIGGAGVARGYLNRGQLTAERFVK----DPFSLVQDA---------------RMYR 4075 Query: 723 SGDLGRYLPTGNVECSGRADDQIKIRGFRIELGEINTHLSRHPNVRENITLVRRDKDEEP 782 +GDLGRYLP GN++ GR D Q+KIRG RIELGEI L VRE + + + ++ Sbjct: 4076 TGDLGRYLPDGNIDYLGRNDSQLKIRGLRIELGEIEARLGACAGVREAVVVAVGEAPDDQ 4135 Query: 783 TLVAYIVPQGLNKDDFDSATESEDIVVNGLKKYRKLIHDIREYLKTKLPSYAIPSVIVPL 842 LVAY D D A ++ +RE L+ LPS+ +P+ + L Sbjct: 4136 RLVAYYTAH----DTLDQALTAD---------------SLREQLQVHLPSHMVPAAYMCL 4176 Query: 843 HKMPLNPNGKIDKPALPFPDTSQLAAASRSHSKHGVDETLTATERDIRDIWLRIIPHATD 902 +PL PNGK+D ALP PD S ++S G E + IW ++ Sbjct: 4177 DALPLTPNGKLDHRALPVPD-------SEAYSGRGYAAPQGEVETALARIWSELL-KVEQ 4228 Query: 903 VNKKASFFDIGGHSILATRLIFELRKKFAVNVPLGLVFSEPTIEGLAKEIERMKSGEMIS 962 V + +FF++GGHS++A LI E ++ ++ + ++FS+PT+ LA + SG +S Sbjct: 4229 VGRYDNFFELGGHSLMAVSLI-ERMRQVGLSADVRVLFSQPTLAALAAAV---GSGSEVS 4284 Query: 963 V 963 V Sbjct: 4285 V 4285 Score = 237 bits (604), Expect = 4e-65 Identities = 191/644 (29%), Positives = 297/644 (46%), Gaps = 91/644 (14%) Query: 314 TYRQIDESSNILAHHLVKNGIERGDVVMVYAYRGVDLVVAVMGVLKAGATFSVIDPAYPP 373 TY +++ +N LAH L+ GI D V + RG+D++V ++G+LK+GA + +DPA P Sbjct: 8031 TYGELNAQANQLAHRLLSLGIRPDDRVAICVERGLDMLVGLLGILKSGAGYVPLDPASPA 8090 Query: 374 ARQIIYLSVAKPRALVVLEDAGVLSPTVVEYVEKSLELKTYVPALKLAKDGSLTGGSVSK 433 R L + P A+VV L + E + +EL + PAL+ + Sbjct: 8091 ERIAYMLEDSAPVAIVVHAATQAL---LAEESVRLIELDS--PALRSQSTAN-------- 8137 Query: 434 GADDILQHVLHLKSEQTGVVVGPDSTPTLSFTSGSEGIPKGVKGRHFSLAYYFDWMAQEF 493 V S Q V+ +TSGS G+PKGV H ++A F F Sbjct: 8138 ------PQVPGQTSSQLAYVI---------YTSGSTGLPKGVMVEHRNVARLFSATQPWF 8182 Query: 494 NLSESDRFTMLSGIAHDPIQRDIFTPLFLGASLIVPTAEDIGTPGQLAQWANKYKVTVTH 553 + D + + A D +I+ L G L+V +P + + VTV + Sbjct: 8183 EFGQQDVWALFHSFAFDFSVWEIWGALIHGGRLLVVPQLVSRSPQECYALLCQAGVTVLN 8242 Query: 554 LTP-AMGQLLAAQADEPIP-SLHHAFFVGDILTKRDCLR---LQVLANNVNVVNMYGTTE 608 TP A QL+ AQ + + SL F G+ L + L+ +V +VNMYG TE Sbjct: 8243 QTPSAFRQLIVAQGESDLRHSLRQVIFGGEAL-ETAMLKPWYARVANAGTQLVNMYGITE 8301 Query: 609 TQRSVSYFVVPARSQDQTFLESQKDVIPAGRGMKNVQLLVINRFDTNKICGIGEVGEIYL 668 T V+Y + A T V P G+ + ++QL V++ + +G VGE+Y+ Sbjct: 8302 TTVHVTYRPLEAADAQLT------GVSPIGKRIPDLQLYVLDA--RREPVPVGVVGEMYV 8353 Query: 669 RAGGLAEGYLGNDELTSKKFLKSWFADPSKFVDRTPENAPWKPYWFGIRDRMYRSGDLGR 728 G+A GYL ELT ++F+ F+ G R+YR+GDLGR Sbjct: 8354 GGAGVARGYLNRPELTQERFIADTFSG-------------------GEGARLYRTGDLGR 8394 Query: 729 YLPTGNVECSGRADDQIKIRGFRIELGEINTHLSRHPNVRENITLVRRDKDEEPTLVAYI 788 +L G++E GR DDQ+KIRGFRIELGEI L+ V + + + R D + LV Y+ Sbjct: 8395 WLADGSIEYLGRNDDQVKIRGFRIELGEIEATLAACEGVSDALVIAREDAPGDKRLVGYV 8454 Query: 789 VPQGLNKDDFDSATESEDIVVNGLKKYRKLIHDIREYLKTKLPSYAIPSVIVPLHKMPLN 848 + D A + L ++R L L Y +PS V L PL Sbjct: 8455 IAA-------DGA--------------QLLAAELRAQLLASLADYMVPSAFVVLEAFPLT 8493 Query: 849 PNGKIDKPALPFPDTSQLAAASRSHSKHGVDETLTATERDIRDIWLRIIPHATDVNKKAS 908 NGK+D+ ALP PD S +A + + + A +D+ D+ + + + Sbjct: 8494 TNGKLDRKALPAPDQSAVATREYEAPQGETETVIAAIWQDLLDL--------ERIGRHDN 8545 Query: 909 FFDIGGHSILATRLIFELRKKFAVNVPLGLVFSEPTIEGLAKEI 952 FF++GGHS+LA +L+ E + ++ + ++F +PT+ LA + Sbjct: 8546 FFELGGHSLLAVKLL-ERMRHVGLSADVRVLFGQPTLAALAAAV 8588 Score = 228 bits (580), Expect = 3e-62 Identities = 185/645 (28%), Positives = 295/645 (45%), Gaps = 104/645 (16%) Query: 314 TYRQIDESSNILAHHLVKNGIERGDVVMVYAYRGVDLVVAVMGVLKAGATFSVIDPAYPP 373 +Y Q++E +N LAHHL+ G++ D V + R ++L+ + + +LK A + +D Sbjct: 6956 SYAQLNEQANRLAHHLIGLGVQPDDCVAILLPRSIELLASQLAILKCAAAYVPLDRNASL 7015 Query: 374 ARQIIYLSVAKPRALVVLEDAGVLSPTVVEYVEKSLELKTYVPALKLAKDGSLTGGSVSK 433 RQ L + + L+ + TV E + ++L T L G + ++++ Sbjct: 7016 ERQGFMLDDCQAKCLLTFS-----TETVPEGASR-IDLDT------LDSQGPVHNPALAQ 7063 Query: 434 GADDILQHVLHLKSEQTGVVVGPDSTPTLSFTSGSEGIPKGVKGRHFSLAY------YFD 487 ++ I + +TSGS G PKGV H ++ Y D Sbjct: 7064 SSESIAY---------------------IMYTSGSTGQPKGVLVPHRAINRLVINNGYAD 7102 Query: 488 WMAQEFNLSESDRFTMLSGIAHDPIQRDIFTPLFLGASLIVPTAEDIGTPGQLAQWANKY 547 + AQ DR S A D D++ L G ++V E + P + AQ Sbjct: 7103 FNAQ-------DRIAFASNPAFDASTMDVWGALLNGGQVVVIDHETLLEPSRFAQVLQDS 7155 Query: 548 KVTVTHLTPAMGQLLAAQADEPIPSLHHAFFVGDILTKRDCLRLQVLANNVNVVNMYGTT 607 VTV +T A+ + + L G+ RL LA + +V+ YG T Sbjct: 7156 GVTVLFVTTAIFNQYVQLIPQALGGLRILLCGGERADVASFRRLLDLAPGLRLVHCYGPT 7215 Query: 608 ETQRSVSYFVVPARSQDQTFLESQKDVIPAGRGMKNVQLLVINRFDTNKICGIGEVGEIY 667 ET + V A + D + +P G + N Q+ V++ ++ +G VGE+Y Sbjct: 7216 ETTTYATTLEVKAVALD-------AECVPIGGPIGNTQVYVLDA--RQQLAPLGVVGEMY 7266 Query: 668 LRAGGLAEGYLGNDELTSKKFLKSWFADPSKFVDRTPENAPWKPYWFGIRDRMYRSGDLG 727 + G+A+GYL +LT++KF+ ADP P+ +YR+GDLG Sbjct: 7267 IGGQGVAKGYLNRPDLTAEKFI----ADP---FSHEPDAL------------LYRTGDLG 7307 Query: 728 RYLPTGNVECSGRADDQIKIRGFRIELGEINTHLSRHPNVRENITLVRRDKDEEPTLVAY 787 R+LP G++EC GR DDQ+KIRGFRIELGEI L VR+ + LVR D+ E LVAY Sbjct: 7308 RWLPEGSLECLGRNDDQVKIRGFRIELGEIEAKLVACDGVRDAVVLVRADETGEKRLVAY 7367 Query: 788 IVPQGLNKDDFDSATESEDIVVNGLKKYRKLIHDIREYLKTKLPSYAIPSVIVPLHKMPL 847 ++ Q + V GL RE L + L Y +P+ V L PL Sbjct: 7368 VIAQ-----------PQVTLSVAGL----------REQLSSTLSEYMVPAAFVMLPAFPL 7406 Query: 848 NPNGKIDKPALPFPDTSQLAAASRSHSKHGVDETLTATERDIRDIWLRIIPHATDVNKKA 907 NGK+D+ ALP PD A+ + + + +++ L +W ++ + + Sbjct: 7407 TLNGKVDRKALPAPDAEAYASQAYAAPQGEIEQVLAG-------MWAELL-KVERIGRHD 7458 Query: 908 SFFDIGGHSILATRLIFELRKKFAVNVPLGLVFSEPTIEGLAKEI 952 FF++GGHS+LA LI +R+ ++ + ++FS+PT+ LA I Sbjct: 7459 HFFELGGHSLLAVTLIERMRQA-GLSADVRVLFSQPTLAALAAAI 7502 Score = 226 bits (576), Expect = 8e-62 Identities = 191/654 (29%), Positives = 297/654 (45%), Gaps = 100/654 (15%) Query: 312 SYTYRQIDESSNILAHHLVKNGIERGDVVMVYAYRGVDLVVAVMGVLKAGATFSVIDPAY 371 S TY +++ +N LA HLV G+ GD V + R ++L+V+ + +LK A + +D + Sbjct: 5877 SLTYAELNHRANRLARHLVGLGVRPGDRVAIALERSLELLVSQLAILKCAAVYVPLDVSA 5936 Query: 372 PPARQIIYLSVAKPRALVVLEDAGVLSPTVVEYVEKSLELKTYVPALKLAKDGSLTGGSV 431 P RQ V A VVL ++P + V+ L T V Sbjct: 5937 PLERQ--QFMVQDSGAQVVLTSGTAVAPEASQRVD----LDTLV---------------F 5975 Query: 432 SKGADDILQHVLHLKSEQTGVVVGPDSTPTLSFTSGSEGIPKGVKGRHFSL-AYYFDWMA 490 ++ AD+ LHL Q+G +S + +TSGS G PKGV H ++ + Sbjct: 5976 NEAADN-----LHLT--QSG-----ESVAYIMYTSGSTGTPKGVLVPHRAINRLVINNGY 6023 Query: 491 QEFNLSESDRFTMLSGIAHDPIQRDIFTPLFLGASLIVPTAEDIGTPGQLAQWANKYKVT 550 EFN DR S A D D++ PL G ++V + + T + A + V+ Sbjct: 6024 AEFNAQ--DRVAFASNPAFDASTLDVWAPLLNGGCVVVVDQDVLLTQERFAALLQEQSVS 6081 Query: 551 VTHLTPAMGQLLAAQADEPIPSLHHAFFVGDILTKRDCLRLQVLANNVNVVNMYGTTETQ 610 V +T + AA L + GD+L R+ ++++N YG TE Sbjct: 6082 VLWMTAGLFHQYAAGLMSVFAQLRYLIVGGDVLDPAVIGRVLKEGAPLHLLNGYGPTEAT 6141 Query: 611 RSVSYFVVPARSQDQTFLESQKDVIPAGRGMKNVQLLVINRFDTNKICGIGEVGEIYLRA 670 + + + + IP GR + N ++ V++ + IG GE+Y+ Sbjct: 6142 TFTTTHEIKSVGEGG---------IPIGRPIGNTRVYVLDA--NQQPVPIGVAGELYIGG 6190 Query: 671 GGLAEGYLGNDELTSKKFLKSWF-ADPSKFVDRTPENAPWKPYWFGIRDRMYRSGDLGRY 729 G+A+GYL EL+++KF+ F ADP +YR+GDL R+ Sbjct: 6191 DGVAKGYLNRPELSAEKFVADPFNADPGAL--------------------LYRTGDLARW 6230 Query: 730 LPTGNVECSGRADDQIKIRGFRIELGEINTHLSRHPNVRENITLVRRDKDEEPTLVAYIV 789 G V+ GR DDQ+KIRGFRIELGEI L +H V++ + LVR D E LVAY Sbjct: 6231 RADGTVDYLGRNDDQVKIRGFRIELGEIEARLGQHDEVKDVVVLVREDVPGEKRLVAYFT 6290 Query: 790 PQGLNKDDFDSATESEDIVVNGLKKYRKLIHDIREYLKTKLPSYAIPSVIVPLHKMPLNP 849 P+ D D A I +R +L+ +LP Y IP + L +PL Sbjct: 6291 PR-----DLDVAPH---------------IETLRTHLQGQLPDYMIPVAYIRLDTLPLTA 6330 Query: 850 NGKIDKPALPFPDTSQLAAASRSHSKHGVDETLTATERDIRDIWLRIIPHATDVNKKASF 909 NGK+D+ ALP PD+ + + V++ L +W ++ V + +F Sbjct: 6331 NGKLDRRALPAPDSEAYVSREYEAPQGEVEQLLA-------QLWAELL-RVEQVGRHDNF 6382 Query: 910 FDIGGHSILATRLIFELRKKFAVNVPLGLVFSEPTIEGLAKEIERMKSGEMISV 963 F++GGHS+LA LI E ++ ++ + ++F +PT+ LA I SG +SV Sbjct: 6383 FELGGHSLLAVTLI-ERMRQVGLSADVRVLFGQPTLAALAAAI---GSGREVSV 6432 Lambda K H 0.318 0.136 0.395 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 1 Number of Hits to DB: 25,409 Number of extensions: 1183 Number of successful extensions: 60 Number of sequences better than 1.0e-02: 1 Number of HSP's gapped: 13 Number of HSP's successfully gapped: 8 Length of query: 1419 Length of database: 8596 Length adjustment: 63 Effective length of query: 1356 Effective length of database: 8533 Effective search space: 11570748 Effective search space used: 11570748 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 67 (30.4 bits)
This GapMind analysis is from Sep 17 2021. The underlying query database was built on Sep 17 2021.
Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.
A candidate for a step is "high confidence" if either:
Otherwise, a candidate is "medium confidence" if either:
Other blast hits with at least 50% coverage are "low confidence."
Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:
GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).
For more information, see:
If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know
by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory