Align carbamoyl-phosphate synthase (glutamine-hydrolysing) (EC 6.3.5.5) (characterized)
to candidate 7023738 Shewana3_0968 carbamoyl phosphate synthase large subunit (RefSeq)
Query= BRENDA::P00968 (1073 letters) >FitnessBrowser__ANA3:7023738 Length = 1074 Score = 1790 bits (4637), Expect = 0.0 Identities = 895/1072 (83%), Positives = 973/1072 (90%), Gaps = 1/1072 (0%) Query: 1 MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEM 60 MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEM Sbjct: 1 MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEM 60 Query: 61 ADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATA 120 ADATYIEPI WEVVR II KERPDA+LPTMGGQTALNCALELE +GVL EF V MIGATA Sbjct: 61 ADATYIEPIQWEVVRNIIAKERPDAILPTMGGQTALNCALELEAKGVLAEFNVEMIGATA 120 Query: 121 DAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGSGGG 180 DAIDKAEDR RFD AMK IGLE R+GIAH+MEEA V VGFPCIIRPSFTMGGSGGG Sbjct: 121 DAIDKAEDRSRFDKAMKSIGLECPRAGIAHSMEEAYGVLDLVGFPCIIRPSFTMGGSGGG 180 Query: 181 IAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAM 240 IAYN+EEFEEIC++GLDLSPTKELLIDESLIGWKEYEMEVVRD+NDNCIIVCSIENFD M Sbjct: 181 IAYNKEEFEEICSQGLDLSPTKELLIDESLIGWKEYEMEVVRDRNDNCIIVCSIENFDPM 240 Query: 241 GIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEM 300 G+HTGDSITVAPAQTLTDKEYQ+MRNASMAVLREIGVETGGSNVQF +NPK+GR+++IEM Sbjct: 241 GVHTGDSITVAPAQTLTDKEYQLMRNASMAVLREIGVETGGSNVQFGINPKDGRMVIIEM 300 Query: 301 NPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIP 360 NPRVSRSSALASKATGFPIAK+AAKLAVG+TLDELMNDITGGRTPASFEP+IDYVVTK+P Sbjct: 301 NPRVSRSSALASKATGFPIAKIAAKLAVGFTLDELMNDITGGRTPASFEPAIDYVVTKVP 360 Query: 361 RFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEA 420 RFNFEKFAG+NDRLTTQMKSVGEVMAIGRT QESLQKALRGLEV GFDP L +A Sbjct: 361 RFNFEKFAGSNDRLTTQMKSVGEVMAIGRTFQESLQKALRGLEVSRHGFDPITDLTKADA 420 Query: 421 LTKIRRELKDAGADRIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVG 480 L +IR ELK+ G DRIWYIADA RAGL++D +F LTNID WFLVQIEEL++LE +VAE G Sbjct: 421 LARIRLELKEPGCDRIWYIADAMRAGLTLDEIFRLTNIDPWFLVQIEELIKLEGQVAEGG 480 Query: 481 ITGLNADFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDT 540 + GLN + LR+LKRKGFADARLA + GV E E+RKLRD++D+HPVYKRVDTCAAEFATDT Sbjct: 481 LAGLNEELLRKLKRKGFADARLAAVLGVNETEVRKLRDRFDIHPVYKRVDTCAAEFATDT 540 Query: 541 AYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVN 600 AYMYSTYEEECEANPS DREKIMVLGGGPNRIGQGIEFDYCCVHA+LALREDGYETIMVN Sbjct: 541 AYMYSTYEEECEANPS-DREKIMVLGGGPNRIGQGIEFDYCCVHAALALREDGYETIMVN 599 Query: 601 CNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVP 660 CNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVP Sbjct: 600 CNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVP 659 Query: 661 VIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVL 720 +IGTSPDAIDRAEDRERFQ A++RL++KQP N TVT +E AV A+ IGYPLVVRPSYVL Sbjct: 660 IIGTSPDAIDRAEDRERFQQAIQRLEMKQPENDTVTTVEGAVIAAERIGYPLVVRPSYVL 719 Query: 721 GGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIME 780 GGRAMEIVYD+ DL RYF AVSVSN +PVLLDHFLDDA+EVD+DA+CDGE V+IG IME Sbjct: 720 GGRAMEIVYDQQDLLRYFNEAVSVSNASPVLLDHFLDDAIEVDIDAVCDGETVVIGAIME 779 Query: 781 HIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLI 840 HIEQAGVHSGDS CSLP YTLSQ IQD MR QV+KLA EL V GLMNVQFAVKNNE+Y+I Sbjct: 780 HIEQAGVHSGDSGCSLPPYTLSQAIQDEMRVQVRKLAMELGVVGLMNVQFAVKNNEIYMI 839 Query: 841 EVNPRAARTVPFVSKATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKF 900 EVNPRAARTVPFVSKATGVPLAK+AARVMAG+SL Q T+EVIPP+YSVKEVVLPFNKF Sbjct: 840 EVNPRAARTVPFVSKATGVPLAKIAARVMAGQSLKAQNFTQEVIPPFYSVKEVVLPFNKF 899 Query: 901 PGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDL 960 PGVDPLLGPEMRSTGEVMGVG TFAEA+AKAQLG+ S + K GRALLSVR DK+RV DL Sbjct: 900 PGVDPLLGPEMRSTGEVMGVGDTFAEAYAKAQLGATSEVPKSGRALLSVRNSDKKRVADL 959 Query: 961 AAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGR 1020 AAKL++ G+++DATHGTA++LGEAGINPRLVNKVHEGRPHI DRIKNGEYTYI+NTT GR Sbjct: 960 AAKLIELGYQIDATHGTAVILGEAGINPRLVNKVHEGRPHILDRIKNGEYTYIVNTTEGR 1019 Query: 1021 RAIEDSRVIRRSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQI 1072 +AIEDSR +RR AL+YKV+Y TT+N FAT MA AD V SVQE+H ++ Sbjct: 1020 QAIEDSRQLRRGALRYKVNYTTTMNAAFATCMAHAADDRTNVTSVQELHQRV 1071 Lambda K H 0.318 0.135 0.383 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 1 Number of Hits to DB: 3348 Number of extensions: 121 Number of successful extensions: 12 Number of sequences better than 1.0e-02: 1 Number of HSP's gapped: 1 Number of HSP's successfully gapped: 1 Length of query: 1073 Length of database: 1074 Length adjustment: 45 Effective length of query: 1028 Effective length of database: 1029 Effective search space: 1057812 Effective search space used: 1057812 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 58 (26.9 bits)
Align candidate 7023738 Shewana3_0968 (carbamoyl phosphate synthase large subunit (RefSeq))
to HMM TIGR01369 (carB: carbamoyl-phosphate synthase, large subunit (EC 6.3.5.5))
# hmmsearch :: search profile(s) against a sequence database # HMMER 3.3.1 (Jul 2020); http://hmmer.org/ # Copyright (C) 2020 Howard Hughes Medical Institute. # Freely distributed under the BSD open source license. # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - # query HMM file: ../tmp/path.aa/TIGR01369.hmm # target sequence database: /tmp/gapView.24233.genome.faa # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: TIGR01369 [M=1052] Accession: TIGR01369 Description: CPSaseII_lrg: carbamoyl-phosphate synthase, large subunit Scores for complete sequences (score includes all domains): --- full sequence --- --- best 1 domain --- -#dom- E-value score bias E-value score bias exp N Sequence Description ------- ------ ----- ------- ------ ----- ---- -- -------- ----------- 0 1574.5 0.0 0 1574.3 0.0 1.0 1 lcl|FitnessBrowser__ANA3:7023738 Shewana3_0968 carbamoyl phosphat Domain annotation for each sequence (and alignments): >> lcl|FitnessBrowser__ANA3:7023738 Shewana3_0968 carbamoyl phosphate synthase large subunit (RefSeq) # score bias c-Evalue i-Evalue hmmfrom hmm to alifrom ali to envfrom env to acc --- ------ ----- --------- --------- ------- ------- ------- ------- ------- ------- ---- 1 ! 1574.3 0.0 0 0 1 1051 [. 2 1052 .. 2 1053 .. 0.98 Alignments for each domain: == domain 1 score: 1574.3 bits; conditional E-value: 0 TIGR01369 1 pkredikkvlviGsGpivigqAaEFDYsGsqalkalkeegievvLvnsniAtvmtdeeladkvYiePltveavek 75 pkr+dik++l++G+GpivigqA+EFDYsG+qa+kal+eeg++v+Lvnsn+At+mtd+e+ad++YieP+++e+v++ lcl|FitnessBrowser__ANA3:7023738 2 PKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIQWEVVRN 76 689************************************************************************ PP TIGR01369 76 iiekErpDailltlGGqtaLnlaveleekGvLekygvkllGtkveaikkaedRekFkealkeineevakseives 150 ii kErpDail+t+GGqtaLn+a+ele kGvL++++v+++G++ +ai+kaedR +F++a+k i++e +++ i++s lcl|FitnessBrowser__ANA3:7023738 77 IIAKERPDAILPTMGGQTALNCALELEAKGVLAEFNVEMIGATADAIDKAEDRSRFDKAMKSIGLECPRAGIAHS 151 *************************************************************************** PP TIGR01369 151 veealeaaeeigyPvivRaaftlgGtGsgiaeneeelkelvekalkaspikqvlvekslagwkEiEyEvvRDskd 225 +eea + + +g+P+i+R++ft+gG+G+gia+n+ee++e+++++l++sp+k++l+++sl gwkE+E+EvvRD++d lcl|FitnessBrowser__ANA3:7023738 152 MEEAYGVLDLVGFPCIIRPSFTMGGSGGGIAYNKEEFEEICSQGLDLSPTKELLIDESLIGWKEYEMEVVRDRND 226 *************************************************************************** PP TIGR01369 226 nciivcniEnlDplGvHtGdsivvaPsqtLtdkeyqllRdaslkiirelgvege.cnvqfaldPeskryvviEvn 299 nciivc+iEn+Dp+GvHtGdsi+vaP+qtLtdkeyql+R+as++++re+gve++ +nvqf+++P++ r+v+iE+n lcl|FitnessBrowser__ANA3:7023738 227 NCIIVCSIENFDPMGVHTGDSITVAPAQTLTDKEYQLMRNASMAVLREIGVETGgSNVQFGINPKDGRMVIIEMN 301 ****************************************************988******************** PP TIGR01369 300 pRvsRssALAskAtGyPiAkvaaklavGysLdelkndvtk.etvAsfEPslDYvvvkiPrwdldkfekvdrklgt 373 pRvsRssALAskAtG+PiAk+aaklavG++Ldel+nd+t+ +t+AsfEP++DYvv+k+Pr++++kf++ +++l+t lcl|FitnessBrowser__ANA3:7023738 302 PRVSRSSALASKATGFPIAKIAAKLAVGFTLDELMNDITGgRTPASFEPAIDYVVTKVPRFNFEKFAGSNDRLTT 376 ***************************************879********************************* PP TIGR01369 374 qmksvGEvmaigrtfeealqkalrsleekllglklkekeae..sdeeleealkkpndrRlfaiaealrrgvsvee 446 qmksvGEvmaigrtf+e+lqkalr le + +g++ ++ ++ + +++ +lk+p +R+++ia+a+r+g++++e lcl|FitnessBrowser__ANA3:7023738 377 QMKSVGEVMAIGRTFQESLQKALRGLEVSRHGFDPITDLTKadALARIRLELKEPGCDRIWYIADAMRAGLTLDE 451 ********************************87654444300445677899*********************** PP TIGR01369 447 vyeltkidrffleklkklvelekeleeeklkelkkellkkakklGfsdeqiaklvkvseaevrklrkelgivpvv 521 +++lt+id +fl ++++l++le +++e l l++ell+k+k++Gf+d+++a++++v+e+evrklr++ +i+pv+ lcl|FitnessBrowser__ANA3:7023738 452 IFRLTNIDPWFLVQIEELIKLEGQVAEGGLAGLNEELLRKLKRKGFADARLAAVLGVNETEVRKLRDRFDIHPVY 526 *************************************************************************** PP TIGR01369 522 krvDtvaaEfeaktpYlYstyeeekddvevtekkkvlvlGsGpiRigqgvEFDycavhavlalreagyktiliny 596 krvDt+aaEf ++t+Y+Ystyeee+ +++ ++++k++vlG+Gp+Rigqg+EFDyc+vha+lalre gy+ti++n+ lcl|FitnessBrowser__ANA3:7023738 527 KRVDTCAAEFATDTAYMYSTYEEEC-EANPSDREKIMVLGGGPNRIGQGIEFDYCCVHAALALREDGYETIMVNC 600 ************************5.55566667***************************************** PP TIGR01369 597 nPEtvstDydiadrLyFeeltvedvldiiekekvegvivqlgGqtalnlakeleeagvkilGtsaesidraEdRe 671 nPEtvstDyd++drLyFe++t+edvl+i++ ek++gvivq+gGqt+l+la++le+agv+i+Gts+++idraEdRe lcl|FitnessBrowser__ANA3:7023738 601 NPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPIIGTSPDAIDRAEDRE 675 *************************************************************************** PP TIGR01369 672 kFsklldelgikqpkgkeatsveeakeiakeigyPvlvRpsyvlgGrameiveneeeleryleeavevskekPvl 746 +F++++++l++kqp++ ++t+ve a+ +a++igyP++vRpsyvlgGrameiv+++++l ry++eav+vs+ +Pvl lcl|FitnessBrowser__ANA3:7023738 676 RFQQAIQRLEMKQPENDTVTTVEGAVIAAERIGYPLVVRPSYVLGGRAMEIVYDQQDLLRYFNEAVSVSNASPVL 750 *************************************************************************** PP TIGR01369 747 idkyledavEvdvDavadgeevliagileHiEeaGvHsGDstlvlppqklseevkkkikeivkkiakelkvkGll 821 +d++l+da+Evd+Dav+dge+v+i +i+eHiE+aGvHsGDs ++lpp +ls+ ++++++ +v+k+a el v+Gl+ lcl|FitnessBrowser__ANA3:7023738 751 LDHFLDDAIEVDIDAVCDGETVVIGAIMEHIEQAGVHSGDSGCSLPPYTLSQAIQDEMRVQVRKLAMELGVVGLM 825 *************************************************************************** PP TIGR01369 822 niqfvvkdeevyviEvnvRasRtvPfvskalgvplvklavkvllgkkleelekgvkkekksklvavkaavfsfsk 896 n+qf+vk++e+y+iEvn+Ra+RtvPfvska+gvpl+k+a++v++g++l+ ++ ++e + +++vk++v++f+k lcl|FitnessBrowser__ANA3:7023738 826 NVQFAVKNNEIYMIEVNPRAARTVPFVSKATGVPLAKIAARVMAGQSLKAQN--FTQEVIPPFYSVKEVVLPFNK 898 **************************************************55..6999***************** PP TIGR01369 897 lagvdvvlgpemkstGEvmgigrdleeallkallaskakikkkgsvllsvkdkdkeellelakklaekglkvyat 971 + gvd++lgpem+stGEvmg+g++++ea++ka+l ++++++k g++llsv+++dk+++ +la+kl e+g+++ at lcl|FitnessBrowser__ANA3:7023738 899 FPGVDPLLGPEMRSTGEVMGVGDTFAEAYAKAQLGATSEVPKSGRALLSVRNSDKKRVADLAAKLIELGYQIDAT 973 *************************************************************************** PP TIGR01369 972 egtakvleeagikaevvlkvseeaekilellkeeeielvinltskkkkaaekgykirreaveykvplvteletae 1046 +gta +l eagi+ ++v+kv+e +++il+ +k++e+++++n+t+ +++a e++ ++rr a++ykv++ t++++a lcl|FitnessBrowser__ANA3:7023738 974 HGTAVILGEAGINPRLVNKVHEGRPHILDRIKNGEYTYIVNTTE-GRQAIEDSRQLRRGALRYKVNYTTTMNAAF 1047 *****************************************997.88899999*******************999 PP TIGR01369 1047 allea 1051 a+++a lcl|FitnessBrowser__ANA3:7023738 1048 ATCMA 1052 88765 PP Internal pipeline statistics summary: ------------------------------------- Query model(s): 1 (1052 nodes) Target sequences: 1 (1074 residues searched) Passed MSV filter: 1 (1); expected 0.0 (0.02) Passed bias filter: 1 (1); expected 0.0 (0.02) Passed Vit filter: 1 (1); expected 0.0 (0.001) Passed Fwd filter: 1 (1); expected 0.0 (1e-05) Initial search space (Z): 1 [actual number of targets] Domain search space (domZ): 1 [number of targets reported over threshold] # CPU time: 0.07u 0.03s 00:00:00.10 Elapsed: 00:00:00.09 # Mc/sec: 11.40 // [ok]
This GapMind analysis is from Apr 09 2024. The underlying query database was built on Apr 09 2024.
Each pathway is defined by a set of rules based on individual steps or genes. Candidates for each step are identified by using ublast (a fast alternative to protein BLAST) against a database of manually-curated proteins (most of which are experimentally characterized) or by using HMMer with enzyme models (usually from TIGRFam). Ublast hits may be split across two different proteins.
A candidate for a step is "high confidence" if either:
Otherwise, a candidate is "medium confidence" if either:
Other blast hits with at least 50% coverage are "low confidence."
Steps with no high- or medium-confidence candidates may be considered "gaps." For the typical bacterium that can make all 20 amino acids, there are 1-2 gaps in amino acid biosynthesis pathways. For diverse bacteria and archaea that can utilize a carbon source, there is a complete high-confidence catabolic pathway (including a transporter) just 38% of the time, and there is a complete medium-confidence pathway 63% of the time. Gaps may be due to:
GapMind relies on the predicted proteins in the genome and does not search the six-frame translation. In most cases, you can search the six-frame translation by clicking on links to Curated BLAST for each step definition (in the per-step page).
For more information, see:
If you notice any errors or omissions in the step descriptions, or any questionable results, please let us know
by Morgan Price, Arkin group, Lawrence Berkeley National Laboratory