Align Bifunctional aspartokinase/homoserine dehydrogenase 1, chloroplastic; AK-HD 1; AK-HSDH 1; Beta-aspartyl phosphate homoserine 1; EC 2.7.2.4; EC 1.1.1.3 (characterized)
to candidate WP_083491847.1 ABB28_RS00495 bifunctional aspartate kinase/homoserine dehydrogenase I
Query= SwissProt::Q9SA18 (911 letters) >NCBI__GCF_001431535.1:WP_083491847.1 Length = 837 Score = 648 bits (1671), Expect = 0.0 Identities = 365/824 (44%), Positives = 514/824 (62%), Gaps = 13/824 (1%) Query: 86 VHKFGGTCVGNSERIKDVAAVVVKDDSERKLVVVSAMSKVTDMMYDLIHRAESRDDSYLS 145 VHKFGGT V +++ + VA ++ ++ VVSAM VTD + L A + D + Sbjct: 25 VHKFGGTSVADADCYRHVARLLQARPEALQVTVVSAMKGVTDALIALARLAAAGDAQWRE 84 Query: 146 ALSGVLEKHRATAVDLLDGDELSSFLARLNDDINNLKAMLRAIYIAGHATESFSDFVVGH 205 + +HR A LL G+ + + L+ + L +L A+ + G E D V G Sbjct: 85 DWHDLRARHRGAAASLL-GEHAGASVEWLDARFDELAELLAALAVIGGLPEEVLDRVQGL 143 Query: 206 GELWSAQMLAAVVRKSGLDCTWMDARDVLVVIPTSSNQVDPDFVESEKRLEKWFTQNSAK 265 GE+ SA +L + SG C +DAR+VLVV V+ D+ S +RL W N + Sbjct: 144 GEVCSAHLLGQHFQASGEPCAVLDAREVLVV-EHGELGVNVDWETSAQRLHAWRQANVQQ 202 Query: 266 IIIATGFIASTPQNIPTTLKRDGSDFSAAIMSALFRSHQLTIWTDVDGVYSADPRKVSEA 325 I+ TGFIA ++ TTL R+GSD+S AI +ALF + +L IWTDVDGV SADPR V EA Sbjct: 203 RIVVTGFIARDRRDRITTLGRNGSDYSGAIFAALFNADELQIWTDVDGVLSADPRLVPEA 262 Query: 326 VVLKTLSYQEAWEMSYFGANVLHPRTIIPVMKYDIPIVIRNIFNLSAPGTMICRQIDDED 385 V L+ LSY EA E++YFGA V+HP+T+ P ++ +PI+IRN F PGT I + Sbjct: 263 VQLEALSYDEACELAYFGAKVVHPQTMSPAIQRGLPILIRNTFQPEHPGTRITAERS--- 319 Query: 386 GFKLDAPVKGFATIDNLALVNVEGTGMAGVPGTASAIFSAVKEVGANVIMISQASSEHSV 445 + P+KG LA++N+EGTG+ GVPGTA +FSA+++ +V+MISQ SSEHS+ Sbjct: 320 ---VRGPIKGLTLSAELAVLNLEGTGLIGVPGTAERVFSALRQAQVSVVMISQGSSEHSI 376 Query: 446 CFAVPEKEVKAVSEALNSRFRQALAGGRLSQIEIIPNCSILAAVGQKMASTPGVSATFFN 505 C V E + +AL F LA G++ ++++ S+LAAVG MA PGV+A F Sbjct: 377 CCVVRGNESERGRDALLQAFAHELAVGQVQRVQLRGGVSVLAAVGDGMAGQPGVAARLFE 436 Query: 506 ALAKANINIRAIAQGCSEFNITVVVKREDCIRALRAVHSRFYLSRTTLAVGIIGPGLIGG 565 +L +A +NI AIAQG SE NI+V V D RALRA H+ F+LS T AVG+IGPG +G Sbjct: 437 SLGRAQVNILAIAQGSSERNISVAVDSGDATRALRAAHAGFWLSPQTFAVGVIGPGNVGA 496 Query: 566 TLLDQIRDQAAVLKEEFKIDLRVIGITGSSKMLMSESGIDLSRWRELMKEEGEKADMEKF 625 LLDQ++ L + IDLR+ + M ++ + WR + + +D++ F Sbjct: 497 ALLDQLQAAQPQLLGKANIDLRLRAVASRRGMHLAPRALQ-GPWRAALADAASASDLDAF 555 Query: 626 TQYVKGNHFIPNSVMVDCTADADIASCYYDWLLRGIHVVTPNKKANSGPLDQYLKIRDLQ 685 T ++ H +P++V++DC+ A++A Y WL GIHVVTPNK+A SGPL +Y IR Sbjct: 556 TAHLLSAH-LPHAVVIDCSGSAEVAGRYEGWLAAGIHVVTPNKQAGSGPLARYQSIRAAA 614 Query: 686 RKSYTHYFYEATVGAGLPIISTLRGLLETGDKILRIEGIFSGTLSYLFNNFVGTRSFSEV 745 S + YEATVGAGLP+I+TLR L++TGD++L IEGIFSGTL++LFN F G++ FS + Sbjct: 615 AASGARFRYEATVGAGLPVITTLRDLVDTGDEVLSIEGIFSGTLAWLFNRFDGSQPFSAL 674 Query: 746 VAEAKQAGFTEPDPRDDLSGTDVARKVTILARESGLKLDLEGLPVQNLVPKPLQACASAE 805 V +A+ G+TEPDPRDDLSG DVARK+ ILARE+G L LE + V++LVP+ L+A S + Sbjct: 675 VEQARGMGYTEPDPRDDLSGIDVARKLVILAREAGRDLSLEQVQVESLVPEALRA-GSVD 733 Query: 806 EFMEKLPQFDEELSKQREEAEAAGEVLRYVGVVDAVEKKGTVELKRYKKDHPFAQLSGAD 865 FM++L D+ L ++ ++A A G VLRYV + A + +V L DH FA L D Sbjct: 734 AFMQRLGDNDDALLQRLQQARARGAVLRYVAALSA--QGASVGLVEIPLDHAFANLKLTD 791 Query: 866 NIIAFTTKRYKEQPLIVRGPGAGAQVTAGGIFSDILRLAFYLGA 909 N++ F T+RY + PL+V+GPGAG +VTA G+F+D+LR+A GA Sbjct: 792 NVVQFRTRRYCDNPLVVQGPGAGPEVTAAGVFADLLRVAAGEGA 835 Lambda K H 0.319 0.134 0.385 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 1 Number of Hits to DB: 1479 Number of extensions: 55 Number of successful extensions: 9 Number of sequences better than 1.0e-02: 1 Number of HSP's gapped: 1 Number of HSP's successfully gapped: 1 Length of query: 911 Length of database: 837 Length adjustment: 42 Effective length of query: 869 Effective length of database: 795 Effective search space: 690855 Effective search space used: 690855 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 56 (26.2 bits)
This GapMind analysis is from Apr 10 2024. The underlying query database was built on Apr 09 2024.
Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.
A candidate for a step is "high confidence" if either:
Otherwise, a candidate is "medium confidence" if either:
Other blast hits with at least 50% coverage are "low confidence."
Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:
GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).
For more information, see:
If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know
by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory