Align Putative phosphoglycerate dehydrogenase (EC:1.1.1.95) (characterized)
to candidate WP_066923369.1 ACG33_RS12360 FAD-linked oxidase
Query= reanno::HerbieS:HSERO_RS19500 (1333 letters) >NCBI__GCF_001579945.1:WP_066923369.1 Length = 1315 Score = 1904 bits (4933), Expect = 0.0 Identities = 933/1314 (71%), Positives = 1088/1314 (82%), Gaps = 14/1314 (1%) Query: 20 PPRLREIPYNYTSFSDREIVIRLLGEESWSLLDELRGKRQTGRSARMLYEVLGDIWVVRR 79 P RLREIPYNYTSFSDREIVIRLLGE +W LL+ELRG+R+TGRSARMLYE LGDIWVV+R Sbjct: 16 PARLREIPYNYTSFSDREIVIRLLGEPAWDLLNELRGERRTGRSARMLYEALGDIWVVQR 75 Query: 80 NPYLQDDLLDNPKRRAALIEALNHRLGEVDKRRLATDQAEAGDAEAQRRSASVEALLKAA 139 NPYLQDDL+D+ KRR L+EAL HRL EV++RR TD AE R+ V LL+AA Sbjct: 76 NPYLQDDLVDSAKRRRQLVEALQHRLNEVERRRDDTD------AERDRK---VGELLRAA 126 Query: 140 KKAIADFAEEFRQTYDLRKRATKVLGRFTAKDNIKFDGLSRVSHVTDATDWRVEYPFVVL 199 ++A+ FA+ Q LRKRA ++L R T DNI+ D +RVSHVTDATDWRVEYPF+VL Sbjct: 127 REAVESFADGLEQVASLRKRARRLLERHTRADNIRLDAFARVSHVTDATDWRVEYPFLVL 186 Query: 200 TPDTEDEMAGLVKACIELGLTIIPRGGGTGYTGGAIPLTPMSAVINTEKLEQLGAVEMEI 259 +PD EDE+ LV+ CIELGLTIIPRGGGTGYTGGA+PLTP +AVINTEKLE LG VE Sbjct: 187 SPDGEDEIPALVRGCIELGLTIIPRGGGTGYTGGAVPLTPRAAVINTEKLEALGKVETRP 246 Query: 260 LPGLDKPYATIYSGAGVVTKRVSDAAEKAGFVFAVDPTSAEASCIGGNVAMNAGGKKAVL 319 LPG +I +GAGVVT+RV++AA++AG VFAVDPTSA+ASCIGGNVAMNAGGKKAVL Sbjct: 247 LPGTTDDAPSILTGAGVVTRRVAEAADRAGLVFAVDPTSADASCIGGNVAMNAGGKKAVL 306 Query: 320 WGTALDNLASWRMVDPQGDWLEVTRLDHNLGKIHDVEVARFKLEWSHPGEKGQKTEVFKT 379 WGTA+DNL WRMVDP G+WLEVTRL+H+ GKIH+V+ A+F+L W V +T Sbjct: 307 WGTAVDNLLWWRMVDPDGNWLEVTRLEHDRGKIHEVDQAKFELVWKDGRAAPAHARVLRT 366 Query: 380 EILEISGKKFRKEGLGKDVTDKFLSGLPGVQKEGCDGLITSARWILHKMPKQTRTVCLEF 439 ++ I G++FRK GLGKDVTDKFL GLPGVQKEGCDG+ITSARW+LH+MP RTVCLEF Sbjct: 367 GVVAIEGRRFRKAGLGKDVTDKFLGGLPGVQKEGCDGVITSARWLLHRMPNHVRTVCLEF 426 Query: 440 FGQARDAIPSIVEIKDYLDAETKKGGAILAGLEHLDERYLRAVGYATKSKRGVLPKMVLI 499 FG AR AIPSIVEIK +LDA+ + GG LAGLEHLDERYLRAVGY+T+SKRG LPKMVL+ Sbjct: 427 FGHARHAIPSIVEIKSHLDAQARSGGVQLAGLEHLDERYLRAVGYSTRSKRGALPKMVLL 486 Query: 500 GDIVGDDENAVAAAASEVIRMANNRVGEGFVAVSPEARKKFWLDRSRTAAIAKHTNAFKI 559 GDIVGDDE+AVA A SEV+R+AN+R GEGFVAV+PE+R++FWLDRSRTAAIAKHTNAFKI Sbjct: 487 GDIVGDDEDAVARATSEVVRIANSRSGEGFVAVNPESRRQFWLDRSRTAAIAKHTNAFKI 546 Query: 560 NEDVVIPLNRMGEYTDGIERINIELSIKNKLQLLAELDSFFVKGNLPLGKSDDAEGDDIP 619 NEDVVIPL+RMG+YTD IERINIELS KNKL+LL EL +F L LGK +D + + Sbjct: 547 NEDVVIPLDRMGDYTDAIERINIELSFKNKLKLLDELLEYF-STELTLGKIEDPDLEHAS 605 Query: 620 AAEMLEDRVHQAESLLEQTHARWSYLLANLDKPLGEAKGELAALGLEKMLPVFEQRLVDQ 679 E L +R QA +LEQ ARW YL +LD PL +A LAALG E + FE+RL + Sbjct: 606 RDERLGERQRQARHVLEQARARWRYLYEHLDTPLAQALPALAALGFEPLRGDFERRLERE 665 Query: 680 PEAAVFHVVQDRTVRISWKQEVRAQLRQIFSGAAFKLILEECQAIHKRVLRGRVFVALHM 739 PEA +F +VQDR+VR SWK EVRA LR+ F G+ F ++E +AIHKRVLRGRV++ALHM Sbjct: 666 PEARIFDLVQDRSVRTSWKTEVRAHLRRSFVGSTFAPVIEAVEAIHKRVLRGRVWIALHM 725 Query: 740 HAGDGNVHTNIPVNSDHYEMLQDAHVAVARIMKLARSLNGVISGEHGIGITKLEFLTEDE 799 HAGDGNVHTNIPVNSD YEMLQ+A+ AV RIM++ARSL GVISGEHGIGITKLEFL+ E Sbjct: 726 HAGDGNVHTNIPVNSDDYEMLQEANRAVKRIMQIARSLGGVISGEHGIGITKLEFLSAAE 785 Query: 800 IGEFREYKKRVDPEGRFNKGKLLNLPGMEADLSNAYTPSFGLMGHESLIMQQSDIGAIAS 859 +FR YK R+DP+G FN+GKL+ PG DL NAYTPSF LMGHESLIMQQSDI +IA+ Sbjct: 786 TADFRRYKNRIDPDGHFNRGKLM--PG--GDLRNAYTPSFKLMGHESLIMQQSDIASIAA 841 Query: 860 SVKDCLRCGKCKPVCSTHVPRANLLYSPRNKILATSLLVEAFLYEEQTRRGVSIKHWEEF 919 +VKDCLRCGKCKPVC+THVPRANLLYSPRNKILATSLL+EAFLYEEQTRRGVSI+H++EF Sbjct: 842 AVKDCLRCGKCKPVCATHVPRANLLYSPRNKILATSLLIEAFLYEEQTRRGVSIRHFDEF 901 Query: 920 EDVADHCTVCHKCVTPCPVDIDFGDVSMNMRNLLRKMGKKSFNAGTNAAMFFLNATDPAT 979 DVA+HCT+CHKC PCPVDIDFGDVSM MR+LLR+MGKK FN GT A+MFFLNAT P T Sbjct: 902 ADVAEHCTLCHKCEKPCPVDIDFGDVSMAMRDLLRRMGKKRFNPGTAASMFFLNATHPET 961 Query: 980 INATRKVMTQWGFKAQRLGNDLMKKFAKKQTQKPPATVGKPPVKEQVIHFINKKMPGNLP 1039 I TRKVM +WGFKAQRL M FA QT +PPAT GKPPV EQV+HF+NKKMP LP Sbjct: 962 IKLTRKVMLEWGFKAQRLAGRAMSVFASPQTARPPATTGKPPVHEQVVHFVNKKMPAGLP 1021 Query: 1040 KKTARALLDIEDDKIVPIIRNPKTTTADTEAVFYFPGCGSERLFSQVGLATQAMLWNVGV 1099 K+TARALLDIED VPIIR+P T++++EAVFYFPGCGSERLFSQVGLATQAMLW +GV Sbjct: 1022 KRTARALLDIEDRNYVPIIRDPARTSSESEAVFYFPGCGSERLFSQVGLATQAMLWQIGV 1081 Query: 1100 QTVLPPGYLCCGYPQRGTGDFEKGEKIITDNRVLFHRMANTLNYLDIKTVVVSCGTCYDQ 1159 QTVLPPGY+CCGYPQRG GDF K +K+ITDNRVLFHR++ TLNYLDI+TVVVSCGTCYDQ Sbjct: 1082 QTVLPPGYMCCGYPQRGNGDFSKADKMITDNRVLFHRVSTTLNYLDIRTVVVSCGTCYDQ 1141 Query: 1160 LQGYEFEKIFPGCRIIDIHEYLLEKGVKLEGVTGTRYMYHDPCHSPMKQQDPLKTVNSLI 1219 LQ Y+FE+IFPGCRI+DIHE+L+E+GV+L G TG RYMYHDPCH+P+K DP++ VN+LI Sbjct: 1142 LQDYKFEEIFPGCRIVDIHEFLMERGVRLAGATGVRYMYHDPCHTPIKTHDPMQVVNTLI 1201 Query: 1220 TTIDAQKIEKNDRCCGESGTFGVSRPDVSTQVRFRKEEEMRKGSDKVRADGFTGDVKILT 1279 ++EKN+RCCGESGTF +SRPDV+TQVRFRKEEEMR G+ K+RADGF G+VKILT Sbjct: 1202 NEQVNGRVEKNERCCGESGTFAISRPDVATQVRFRKEEEMRAGAQKLRADGFQGEVKILT 1261 Query: 1280 SCPSCFQGLSRYNEDAGTTADYIVVEMARHLLGENWMPEYVERANNGGIERILV 1333 SCPSC QGL RYN+DA T ADYIV+E+ARH+LGE W+PEYV+ AN GGIER+LV Sbjct: 1262 SCPSCMQGLKRYNDDAATEADYIVIEIARHVLGERWLPEYVKAANAGGIERVLV 1315 Lambda K H 0.319 0.136 0.402 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 1 Number of Hits to DB: 4213 Number of extensions: 159 Number of successful extensions: 4 Number of sequences better than 1.0e-02: 1 Number of HSP's gapped: 1 Number of HSP's successfully gapped: 1 Length of query: 1333 Length of database: 1315 Length adjustment: 49 Effective length of query: 1284 Effective length of database: 1266 Effective search space: 1625544 Effective search space used: 1625544 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 59 (27.3 bits)
This GapMind analysis is from Apr 10 2024. The underlying query database was built on Apr 09 2024.
Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.
A candidate for a step is "high confidence" if either:
Otherwise, a candidate is "medium confidence" if either:
Other blast hits with at least 50% coverage are "low confidence."
Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:
GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).
For more information, see:
If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know
by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory