Align Glycine betaine/proline/choline/ectoine transporter VP1456 (characterized)
to candidate WP_019895193.1 A377_RS0105425 BCCT family transporter
Query= SwissProt::Q87PP5 (562 letters) >NCBI__GCF_000384235.1:WP_019895193.1 Length = 543 Score = 358 bits (918), Expect = e-103 Identities = 196/499 (39%), Positives = 299/499 (59%), Gaps = 16/499 (3%) Query: 67 NPVFGISAGLVVFCLI---SLLLVEPVTARDALNGIKNGIIEQFDAFFMWSTNFFLLFAV 123 NP+ +SA +V + S E AL G N +E + ++ F L F + Sbjct: 14 NPLLTVSALFLVVLSVMAGSFYTDESALLISALRGALNPFLEWY---YVVLVAFLLFFMI 70 Query: 124 GLLFSPLGKIRLGGKEATPDHSTVSWLSMLFAAGMGIGLLFWSVAEPTAYFTDWWGTPLN 183 L +RLG P+ + SW+SMLFAAG G+G+LFWSVA+P F + ++ Sbjct: 71 WLGVGRYKNVRLGRDFEKPEFTFFSWVSMLFAAGTGVGILFWSVAQPILQFRE--NPFID 128 Query: 184 AEAYSADAKSLAMGATMFHWGVHGWSIYALVALALAFFAFNKGLPLSLRAAFYPIFGDRA 243 A ADA +A+ T FHWG++GW+I++ VALA+ +FAF LPL++R+ YPI +R Sbjct: 129 VAATDADAAVVALRLTFFHWGINGWAIFSFVALAMGYFAFRHDLPLTVRSVLYPILKERV 188 Query: 244 WGWLGHVIDILAVLSTLFGLATSLGLGAQQATSGINHVFGLNGGIGTQMVVIAFVTFIAV 303 G G ++D+LAV +T+FG+AT+LGLG QQ +G+ VFG Q++VI + IA Sbjct: 189 HGRWGDLVDLLAVFATIFGIATTLGLGVQQMNAGLESVFGWETTTQLQLIVIGVIMTIAT 248 Query: 304 LSVVRGIDGGVKLLSNVNMIVAFALLIFI-TFITFDTAMGSLVDTTMAYIQNIIPLS-NP 361 +S+V G+ GV+LLS +N ++ A ++FI T +G V+ T YIQN++ ++ + Sbjct: 249 VSIVSGVSKGVRLLSEMNFWLSIAAVVFILTLGPTQYLIGLTVEATGDYIQNLLRMTFHT 308 Query: 362 HGREDETWMHGWTVFYWAWWVSWSPFVGMFIARVSKGRTVREFLFAVIVIPTLVTLVWMS 421 + W WTVF+W WW++WSPFVGMFIAR+S+GRT REF+ V+++PTL+T+VW+ Sbjct: 309 NVTHQTEWQADWTVFFWGWWLAWSPFVGMFIARISRGRTFREFVMGVLLVPTLITIVWIG 368 Query: 422 VFGGIALDQVVNKVGELGANGLTDISLTLFHVYDVLP---YSSVISILSIVLILVFFITS 478 +FGG AL + G + A ++S L+ + L +++S L IVLI F ITS Sbjct: 369 LFGGTALFHELFGGGGVVAAVENNVSTALYVTIERLDLGWIGTMVSGLLIVLIGTFLITS 428 Query: 479 SDSGSLVIDSITAGGKIDAPVPQRIFWACIEGSIAAVMLWVGGKEALQALQSGVVATGLP 538 +++G+LV+++I +GG + P RI W G + A +L +GG L+ LQS V+A +P Sbjct: 429 ANAGTLVVNTILSGGDSNPPTLHRIIWGIFLGGLTATLLVLGG---LETLQSAVIAAAVP 485 Query: 539 FTFVLLLMCVSLVKGLRTE 557 F+ ++++M V LVK L E Sbjct: 486 FSVIVVMMIVGLVKALEDE 504 Lambda K H 0.327 0.141 0.445 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 1 Number of Hits to DB: 932 Number of extensions: 49 Number of successful extensions: 5 Number of sequences better than 1.0e-02: 1 Number of HSP's gapped: 1 Number of HSP's successfully gapped: 1 Length of query: 562 Length of database: 543 Length adjustment: 36 Effective length of query: 526 Effective length of database: 507 Effective search space: 266682 Effective search space used: 266682 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 53 (25.0 bits)
This GapMind analysis is from Sep 24 2021. The underlying query database was built on Sep 17 2021.
Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.
A candidate for a step is "high confidence" if either:
Otherwise, a candidate is "medium confidence" if either:
Other blast hits with at least 50% coverage are "low confidence."
Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:
GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).
For more information, see:
If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know
by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory