Align Exo-alpha-(1->6)-L-arabinopyranosidase; APY; Beta-D-galactopyranosidase; EC 3.2.1.- (characterized)
to candidate Ga0059261_0769 Ga0059261_0769 Beta-glucosidase-related glycosidases
Query= SwissProt::E7CY69 (757 letters) >FitnessBrowser__Korea:Ga0059261_0769 Length = 838 Score = 285 bits (728), Expect = 9e-81 Identities = 164/400 (41%), Positives = 238/400 (59%), Gaps = 34/400 (8%) Query: 12 LTLEEKASLTSGGDAWHLQGVESKGIPSYMITDGPHGLRKSLASSAGETDLDDSVPATCF 71 LTL+EK L +G + L + GIP+ +TDGP G+R D PAT F Sbjct: 44 LTLDEKILLLAGESSMALNPIPRLGIPAVKMTDGPTGVRSP-----------DGKPATVF 92 Query: 72 PPAAGLSSSWNPELIHKVGEAMAEECIQEKVAVILGPGVNIKRNPLGGRCFEYWSEDPYL 131 P L++SWNP+L VG A+A E V+L P VNI R P GR FE +SEDP+L Sbjct: 93 PVGVALAASWNPDLAATVGAAVARETKAHGADVLLAPTVNIVRTPRWGRNFETYSEDPWL 152 Query: 132 AGHEAIGIVEGVQSKGVGTSLKHFAANNQETDRLRVDARISPRALREIYFPAFEHIVKKA 191 +G +G V GVQ +G+G SLKHFA NNQE+ R VD+ + R L EIY PAFE +V++A Sbjct: 153 SGRMGLGYVRGVQGEGIGVSLKHFAVNNQESYRFVVDSIVDQRTLHEIYLPAFEMVVREA 212 Query: 192 QPWTIMCSYNRINGVHSAQNHWLLTDVLRDEWGFDGIVMSDWGADHDRGASLNAGLNLEM 251 PW++M SYN+ING ++A+N WLLTD+L+ EWG+ G V+SDWGA H ++NAG++LEM Sbjct: 213 DPWSVMASYNKINGTYAAENRWLLTDLLKKEWGYRGFVVSDWGATHTTAEAVNAGMDLEM 272 Query: 252 --PPSYTDDQIVYAVRDGLITPAQLDRMAQGMIDLVNKTRAAMSIDNYRFDVDA-----H 304 PP + D++ AV +G ++ AQ+D A+ M+ L+ ++ I+ +A Sbjct: 273 PGPPKHFGDKLKAAVAEGKVSTAQIDDNARRMVRLIVRSGV---IERGASRPEAATPLRQ 329 Query: 305 DEVAHQAAIESIVMLKNDDAILPLNAGPVANPSATPQKIAVIGEFARTPRYQGGGSSHIT 364 +A +AA E++V+LKN + +LP + + + +AVIG A R QGGGSS++ Sbjct: 330 SAIARKAAEEAMVLLKN-NGVLPFD--------RSIRSLAVIGPNADVVRMQGGGSSNVV 380 Query: 365 PTKMTSFLDTL--AERGIKADFAPGFTLDLEPADPALESE 402 P + + LD L A G++ + G +D E PA +++ Sbjct: 381 PFETQTPLDALRAALPGVRITYEKG--VDNEETPPAADAK 418 Score = 183 bits (465), Expect = 3e-50 Identities = 109/302 (36%), Positives = 156/302 (51%), Gaps = 14/302 (4%) Query: 380 IKADFAPGFT------LDLEPADPALESEAVETAKNADVVLMFLGLPEAVESEGFDRDTL 433 I+ D+A G T + P+ ++ AV AK AD ++ +G E EG+DR + Sbjct: 542 IRIDYANGGTPYENLSFGVREPQPSFDA-AVAAAKGADAAIVIVGSSSTTEGEGYDRANI 600 Query: 434 DMPAKQIALLEQVAAANQNVVVVLSNGSVITVAPWAKNAKGILESWLLGQSGGPALADVI 493 D+P +Q L+E +AAAN VV++ G+ +T+ PW + A I+ WL G+ G ALADV+ Sbjct: 601 DLPGEQNRLVEAIAAANPKTAVVVNAGAAMTM-PWHEKAPAIVSMWLPGEGGAAALADVL 659 Query: 494 FGQVSPSGKLAQSIPLDINDDPSMLNWPGEEGHVDYGEGVFAGYRYYDTYGKAVDYPFGY 553 G+V+PSGKL + P DD + L + Y EG+ GYR Y G +PFG+ Sbjct: 660 TGKVNPSGKLPVTFPQRSGDDLADL----KTSKSAYSEGLLVGYRGYQARGVKPLFPFGH 715 Query: 554 GLSYATFEIT--GVAVAKTGANTATVTATVTNTSDVDAAETVQVYVVPGKADVARPKHEL 611 GLSY +F + V +GA TV TV NT E VQ+YV P + P L Sbjct: 716 GLSYTSFAYSPLNVPARASGAKPVTVKLTVRNTGKRTGQEVVQLYVEPVEPMAGDPPRTL 775 Query: 612 KGFTKAFLKAGESKTVAIDLDERAFAYWSEKYNDWHVEAGEYAIEVGVSSRDIADTVAVA 671 + F K + AG + V + LD RAF+++ K W V G Y + G SS DI + Sbjct: 776 RAFQKVAIPAGGKREVTLTLDPRAFSFYDVKAGGWRVRPGAYRVLAGSSSEDIRQAGEIR 835 Query: 672 LD 673 +D Sbjct: 836 VD 837 Lambda K H 0.315 0.132 0.393 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 1 Number of Hits to DB: 1564 Number of extensions: 73 Number of successful extensions: 8 Number of sequences better than 1.0e-02: 1 Number of HSP's gapped: 2 Number of HSP's successfully gapped: 2 Length of query: 757 Length of database: 838 Length adjustment: 41 Effective length of query: 716 Effective length of database: 797 Effective search space: 570652 Effective search space used: 570652 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.0 bits) S2: 55 (25.8 bits)
This GapMind analysis is from Sep 17 2021. The underlying query database was built on Sep 17 2021.
Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.
A candidate for a step is "high confidence" if either:
Otherwise, a candidate is "medium confidence" if either:
Other blast hits with at least 50% coverage are "low confidence."
Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:
GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).
For more information, see:
If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know
by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory