PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for GFF2930 (89 a.a., MIKTTVTISN...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 136 similar proteins in the literature:

RD2015_RS22180 HPr family phosphocarrier protein from Roseateles depolymerans
80% identity, 100% coverage

Rmet_0245 HPrNtr from Ralstonia metallidurans CH34
72% identity, 100% coverage

Reut_A0296 Phosphoryl transfer system, HPr from Ralstonia eutropha JMP134
72% identity, 100% coverage

phbH / AAA21977.1 protein H from Cupriavidus necator (see paper)
H16_A0325 Phosphocarrier protein (HPr) from Ralstonia eutropha H16
71% identity, 100% coverage

OFBG_01695 HPr family phosphocarrier protein from Oxalobacter formigenes OXCC13
66% identity, 100% coverage

NE2184 Phosphocarrier HPr protein from Nitrosomonas europaea ATCC 19718
70% identity, 100% coverage

NMCC_0141 sugar transport PTS system phosphocarrier protein HPR from Neisseria meningitidis 053442
60% identity, 100% coverage

NGK_2199 PtsH from Neisseria gonorrhoeae NCCP11945
NGO2037 PtsH from Neisseria gonorrhoeae FA 1090
60% identity, 100% coverage

NMB2045 phosphocarrier protein HPr from Neisseria meningitidis MC58
58% identity, 100% coverage

TASI_1133 HPr family phosphocarrier protein from Taylorella asinigenitalis MCE3
54% identity, 100% coverage

Q83DI7 Phosphocarrier protein HPr from Coxiella burnetii (strain RSA 493 / Nine Mile phase I)
51% identity, 98% coverage

Sde_3182 HNH nuclease from Saccharophagus degradans 2-40
54% identity, 92% coverage

SPV1_10234 hypothetical protein from Mariprofundus ferrooxydans PV-1
49% identity, 100% coverage

PP0948, PP_0948 phosphotransferase system, NPr protein from Pseudomonas putida KT2440
53% identity, 99% coverage

XF1403 phosphotransferase system HPr enzyme from Xylella fastidiosa 9a5c
48% identity, 92% coverage

PA14_57980 putative phosphoryl carrier protein from Pseudomonas aeruginosa UCBPP-PA14
Q9HVV2 Phosphocarrier protein HPr from Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1)
PA4466 probable phosphoryl carrier protein from Pseudomonas aeruginosa PAO1
51% identity, 99% coverage

CCNA_00241 phosphocarrier protein HPr from Caulobacter crescentus NA1000
CC0241 phosphocarrier protein HPr from Caulobacter crescentus CB15
52% identity, 95% coverage

XC_1305 phosphotransferase system HPr enzyme from Xanthomonas campestris pv. campestris str. 8004
48% identity, 98% coverage

TepiRe1_0766 HPr family phosphocarrier protein from Tepidanaerobacter acetatoxydans Re1
45% identity, 98% coverage

AMUC_RS09380 HPr family phosphocarrier protein from Akkermansia muciniphila
47% identity, 98% coverage

Pcar_1930 phosphocarrier protein HPr from Pelobacter carbinolicus str. DSM 2380
48% identity, 92% coverage

A1S_0589 phosphocarrier protein (HPr-like) from Acinetobacter baumannii ATCC 17978
44% identity, 100% coverage

Ethha_2064 HPr family phosphocarrier protein from Ethanoligenens harbinense YUAN-3
42% identity, 97% coverage

TP0589 phosphocarrier protein HPr (ptsH) from Treponema pallidum subsp. pallidum str. Nichols
44% identity, 98% coverage

GJQ69_02120 HPr family phosphocarrier protein from Caproicibacterium lactatifermentans
39% identity, 98% coverage

BMEI2031 PHOSPHOCARRIER PROTEIN HPR from Brucella melitensis 16M
42% identity, 86% coverage

H604_RS0100980 HPr family phosphocarrier protein from Ruminococcus gauvreauii DSM 19829
42% identity, 98% coverage

SPO0715 HPr family phosphocarrier protein from Ruegeria pomeroyi DSS-3
50% identity, 99% coverage

LEPBI_I1652 phosphocarrier protein HPr (histidine-containing protein) from Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'
44% identity, 80% coverage

Fisuc_1002 Phosphotransferase system, phosphocarrier protein HPr from Fibrobacter succinogenes subsp. succinogenes S85
43% identity, 97% coverage

VP2674 phosphocarrier protein NPr from Vibrio parahaemolyticus RIMD 2210633
44% identity, 92% coverage

BAW_12011 HPr family phosphocarrier protein from Brucella abortus
BAB1_2097 Phosphocarrier HPr protein:Histidine phosphorylation site in HPr protein from Brucella melitensis biovar Abortus 2308
41% identity, 83% coverage

msl5090 phosphocarrier protein HPr from Mesorhizobium loti MAFF303099
46% identity, 83% coverage

L21SP2_3294 HPr family phosphocarrier protein from Salinispira pacifica
33% identity, 98% coverage

Csac_1163 Phosphotransferase system, phosphocarrier protein HPr from Caldicellulosiruptor saccharolyticus DSM 8903
40% identity, 82% coverage

Npr / b3206 phosphorelay protein NPr from Escherichia coli K-12 substr. MG1655 (see 10 papers)
npr / AAB60167.1 NPr from Escherichia coli (see 3 papers)
b3206 phosphohistidinoprotein-hexose phosphotransferase component of N-regulated PTS system (Npr) from Escherichia coli str. K-12 substr. MG1655
NP_417673 phosphorelay protein NPr from Escherichia coli str. K-12 substr. MG1655
SF5M90T_3161 PTS phosphocarrier protein NPr from Shigella flexneri 5a str. M90T
39% identity, 96% coverage

CRH_BACSU / O06976 HPr-like protein Crh; Catabolite repression HPr from Bacillus subtilis (strain 168) (see 3 papers)
NP_391354 catabolite repression HPr-like protein from Bacillus subtilis subsp. subtilis str. 168
39% identity, 94% coverage

RBAM_RS16170 HPr family phosphocarrier protein from Bacillus velezensis FZB42
39% identity, 94% coverage

VC2533 phosphocarrier protein NPr from Vibrio cholerae O1 biovar eltor str. N16961
40% identity, 96% coverage

SMc02754 PUTATIVE PHOSPHOCARRIER HPR TRANSMEMBRANE PROTEIN from Sinorhizobium meliloti 1021
45% identity, 83% coverage

CS401_RS07595 HPr family phosphocarrier protein from Fusobacterium vincentii
41% identity, 98% coverage

blr8148 phosphocarrier protein HPr from Bradyrhizobium japonicum USDA 110
45% identity, 82% coverage

ECs4354 phosphotransferase system HPr enzyme from Escherichia coli O157:H7 str. Sakai
40% identity, 100% coverage

MHJ_0611 phosphocarrier protein HPr from Mycoplasma hyopneumoniae J
40% identity, 91% coverage

Z4879 putative phosphocarrier protein from Escherichia coli O157:H7 EDL933
40% identity, 100% coverage

FN1782 Phosphocarrier protein HPr from Fusobacterium nucleatum subsp. nucleatum ATCC 25586
40% identity, 98% coverage

MNF30_01490 HPr family phosphocarrier protein from Mycoplasma mycoides subsp. capri
42% identity, 87% coverage

CC0448 PTS system, fructose-specific EIIA/HPr/EI components from Caulobacter crescentus CB15
36% identity, 10% coverage

LA_2402 phosphocarrier protein Hpr from Leptospira interrogans serovar lai str. 56601
38% identity, 85% coverage

CDR20291_2644 HPr family phosphocarrier protein from Clostridioides difficile R20291
CD2756 PTS system, phosphocarrier protein from Clostridium difficile 630
CDR20291_2644 PTS system, phosphocarrier protein from Clostridium difficile R20291
40% identity, 97% coverage

B488_RS06655 HPr family phosphocarrier protein from Liberibacter crescens BT-1
41% identity, 88% coverage

ESA_03599 hypothetical protein from Enterobacter sakazakii ATCC BAA-894
ESA_03599 PTS phosphocarrier protein NPr from Cronobacter sakazakii ATCC BAA-894
39% identity, 96% coverage

LBL_1668 Phosphocarrier protein HPr-related from Leptospira borgpetersenii serovar Hardjo-bovis L550
39% identity, 81% coverage

BAS5001 phosphocarrier protein HPr from Bacillus anthracis str. Sterne
43% identity, 78% coverage

YE1207 PTS system, phosphocarrier protein from Yersinia enterocolitica subsp. enterocolitica 8081
35% identity, 93% coverage

Lreu_1325 phosphocarrier protein HPr from Lactobacillus reuteri DSM 20016
Lreu_1325 phosphocarrier protein HPr from Limosilactobacillus reuteri subsp. reuteri
40% identity, 87% coverage

Csac_2438 Phosphotransferase system, phosphocarrier protein HPr from Caldicellulosiruptor saccharolyticus DSM 8903
33% identity, 91% coverage

y0159 phosphocarrier protein HPr-like NPr from Yersinia pestis KIM
YPO3587 putative phosphocarrier protein from Yersinia pestis CO92
37% identity, 96% coverage

ORF00572 multiphosphoryl transfer protein, putative from Desulfovibrio vulgaris Hildenborough
39% identity, 10% coverage

LEUM_1780 Phosphotransferase system, HPr-related protein from Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293
50% identity, 70% coverage

OEOE_RS03075 phosphocarrier protein HPr from Oenococcus oeni PSU-1
42% identity, 87% coverage

ETAE_1132 phosphohistidinoprotein-hexose phosphotransferase component of PTS system from Edwardsiella tarda EIB202
37% identity, 91% coverage

PmVP161_0905 phosphocarrier protein Hpr from Pasteurella multocida
39% identity, 78% coverage

CBG46_00240 phosphocarrier protein Hpr from Actinobacillus succinogenes
37% identity, 91% coverage

ptsH / AAA16213.1 phosphocarrier protein from Mycoplasma capricolum (see paper)
NO343_00615 HPr family phosphocarrier protein from Mycoplasma capricolum subsp. capricolum
38% identity, 87% coverage

HI1713 phosphocarrier protein HPr (ptsH) from Haemophilus influenzae Rd KW20
38% identity, 78% coverage

Hpr / b2415 phosphocarrier protein HPr from Escherichia coli K-12 substr. MG1655 (see 31 papers)
ptsH / P0AA04 phosphocarrier protein HPr from Escherichia coli (strain K12) (see 31 papers)
1j6tB / P0AA04 Complex of enzyme iiamtl and the histidine-containing phosphocarrier protein hpr from escherichia coli nmr, restrained regularized mean structure (see paper)
ptsH phosphocarrier protein HPr; EC 2.7.11.- from Escherichia coli K12 (see 19 papers)
STM2431 PTS family, Hpr protein, phosphohistidinoprotein-hexose phosphotransferase from Salmonella typhimurium LT2
NP_416910 phosphocarrier protein HPr from Escherichia coli str. K-12 substr. MG1655
P0AA04 Phosphocarrier protein HPr from Escherichia coli (strain K12)
P0AA07 Phosphocarrier protein HPr from Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720)
NP_416910, b2415 phosphohistidinoprotein-hexose phosphotransferase component of PTS system (Hpr) from Escherichia coli str. K-12 substr. MG1655
ECs3287 PTS system protein HPr from Escherichia coli O157:H7 str. Sakai
SF5M90T_2404 phosphocarrier protein Hpr from Escherichia coli
35% identity, 93% coverage

lmo1002 PTS phosphocarrier protein Hpr (histidine containing protein) from Listeria monocytogenes EGD-e
LM6179_1319 phosphocarrier protein HPr from Listeria monocytogenes 6179
34% identity, 92% coverage

SG1701 PTS system phosphocarrier protein HPr from Sodalis glossinidius str. 'morsitans'
35% identity, 93% coverage

X276_20425 HPr family phosphocarrier protein from Clostridium beijerinckii NRRL B-598
Cbei_1219 phosphotransferase system, phosphocarrier protein HPr from Clostridium beijerincki NCIMB 8052
37% identity, 92% coverage

PMI1828 PTS system phosphocarrier protein from Proteus mirabilis HI4320
36% identity, 93% coverage

SAOUHSC_01028 phosphocarrier protein hpr, putative from Staphylococcus aureus subsp. aureus NCTC 8325
P99143 Phosphocarrier protein HPr from Staphylococcus aureus (strain N315)
SA0934 phophocarrier protein HPR, phosphohistidin-containing protein from Staphylococcus aureus subsp. aureus N315
SAV1083 phophocarrier protein Hpr from Staphylococcus aureus subsp. aureus Mu50
SAUSA300_0983 phosphocarrier protein HPr from Staphylococcus aureus subsp. aureus USA300_FPR3757
SACOL1091 phosphocarrier protein HPr from Staphylococcus aureus subsp. aureus COL
36% identity, 87% coverage

UC7_RS15600 phosphocarrier protein HPr from Enterococcus caccae ATCC BAA-1240
42% identity, 99% coverage

blr3574 ptsH from Bradyrhizobium japonicum USDA 110
38% identity, 82% coverage

PS417_23035 D-trehalose PTS system, I, HPr, and IIA components from Pseudomonas simiae WCS417
34% identity, 10% coverage

EF0709 phosphocarrier protein HPr from Enterococcus faecalis V583
H9Q64_RS06955, HMPREF9494_02513 phosphocarrier protein HPr from Enterococcus faecalis TX2137
42% identity, 88% coverage

YPTB2715 PTS system, phosphocarrier protein from Yersinia pseudotuberculosis IP 32953
YPO2993 PTS system, phosphocarrier protein from Yersinia pestis CO92
34% identity, 93% coverage

B5X54_RS01280 HPr family phosphocarrier protein from Caldicellulosiruptor bescii
33% identity, 93% coverage

PTHP_LYSSH / Q84F84 Phosphocarrier protein HPr; Histidine-containing protein from Lysinibacillus sphaericus (Bacillus sphaericus) (see paper)
38% identity, 87% coverage

EMQU_0954 phosphocarrier protein HPr from Enterococcus mundtii QU 25
42% identity, 88% coverage

ptsH / AAO61680.1 phosphocarrier protein HPr from Clostridium acetobutylicum (see paper)
CAC1820 Phosphocarrier Protein (Hpr) from Clostridium acetobutylicum ATCC 824
36% identity, 91% coverage

H16_A2203 HPr family phosphocarrier protein from Cupriavidus necator H16
33% identity, 90% coverage

Ccel_0806 HPr family phosphocarrier protein from Ruminiclostridium cellulolyticum H10
Ccel_0806 phosphocarrier, HPr family from Clostridium cellulolyticum H10
39% identity, 87% coverage

Q7NZE2 phosphoenolpyruvate--protein phosphotransferase from Chromobacterium violaceum (strain ATCC 12472 / DSM 30191 / JCM 1249 / CCUG 213 / NBRC 12614 / NCIMB 9131 / NCTC 9757 / MK)
36% identity, 10% coverage

LBA0639 phosphocarrier protein HPr from Lactobacillus acidophilus NCFM
39% identity, 87% coverage

PFL_4931 phosphoenolpyruvate-protein phosphotransferase from Pseudomonas fluorescens Pf-5
35% identity, 10% coverage

NJ56_RS11155 phosphocarrier protein Hpr from Yersinia ruckeri
34% identity, 93% coverage

Rmet_4664 phosphoryl transfer system protein HPr from Ralstonia metallidurans CH34
35% identity, 91% coverage

DHAM_PROSM / P0DN88 PEP-dependent dihydroxyacetone kinase, phosphoryl donor subunit DhaM; Dihydroxyacetone kinase subunit M; EC 2.7.1.121 from Providencia stuartii (strain MRSN 2154) (see 2 papers)
33% identity, 18% coverage

msl5292 phosphocarrier protein HP from Mesorhizobium loti MAFF303099
40% identity, 85% coverage

Dpo_5c02520 HPr family phosphocarrier protein from Desulfotignum phosphitoxidans DSM 13687
28% identity, 91% coverage

RL0032 putative phosphocarrier protein HPr from Rhizobium leguminosarum bv. viciae 3841
44% identity, 88% coverage

U876_06295 HPr family phosphocarrier protein from Aeromonas hydrophila NJ-35
34% identity, 93% coverage

H16_A2203 phosphotransferase system, HPr-related proteins from Ralstonia eutropha H16
32% identity, 86% coverage

CTK_C17590 HPr family phosphocarrier protein from Clostridium tyrobutyricum
34% identity, 92% coverage

HVO_2101 phosphocarrier protein HPr from Haloferax volcanii DS2
35% identity, 87% coverage

ptsH / AAC45390.1 HPr from Lactobacillus sakei (see paper)
O07125 Phosphocarrier protein HPr from Latilactobacillus sakei
LSA1463 Phosphocarrier protein HPr (Histidine-containing protein) from Lactobacillus sakei subsp. sakei 23K
43% identity, 73% coverage

PTHP_BACSU / P08877 Phosphocarrier protein HPr; Histidine-containing protein from Bacillus subtilis (strain 168) (see paper)
NP_389273 histidine-containing phosphocarrier protein of the phosphotransferase system (PTS) (HPr protein) from Bacillus subtilis subsp. subtilis str. 168
BSU13900 phosphocarrier protein HPr from Bacillus subtilis subsp. subtilis str. 168
36% identity, 87% coverage

RL2903 putative multiphosphoryl transfer/phosphocarrier protein from Rhizobium leguminosarum bv. viciae 3841
34% identity, 72% coverage

SACE_6246 phosphoenolpyruvate-protein phosphotransferase from Saccharopolyspora erythraea NRRL 2338
30% identity, 38% coverage

LF82_722 phosphoenolpyruvate--protein phosphotransferase from Escherichia coli LF82
28% identity, 10% coverage

CBO2398 phosphocarrier protein HPr from Clostridium botulinum A str. ATCC 3502
30% identity, 92% coverage

F502_04697 HPr family phosphocarrier protein from Clostridium pasteurianum DSM 525 = ATCC 6013
32% identity, 91% coverage

STY4004 putative PTS system phosphocarrier protein from Salmonella enterica subsp. enterica serovar Typhi str. CT18
STM3779 putative phosphotransferase system, HPr-related protein from Salmonella typhimurium LT2
t3739 putative phosphocarrier protein of PTS system from Salmonella enterica subsp. enterica serovar Typhi Ty2
31% identity, 98% coverage

L21SP2_2173 HPr family phosphocarrier protein from Salinispira pacifica
30% identity, 98% coverage

CPn0037 PTS Phosphocarrier Protein Hpr from Chlamydophila pneumoniae CWL029
28% identity, 80% coverage

VP0795 phosphocarrier protein HPr from Vibrio parahaemolyticus RIMD 2210633
Q87RJ9 Phosphocarrier protein HPr from Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633)
WU75_19555 HPr family phosphocarrier protein from Vibrio parahaemolyticus
33% identity, 92% coverage

A0J47_RS08260 fused PTS fructose transporter subunit IIA/HPr protein from Photobacterium damselae subsp. damselae
37% identity, 22% coverage

CPE1669 phosphocarrier protein from Clostridium perfringens str. 13
30% identity, 93% coverage

AO353_15995 trehalose-specific PTS system, I, HPr, and IIA components from Pseudomonas fluorescens FW300-N2E3
32% identity, 10% coverage

lp_1273 phosphocarrier protein Hpr from Lactobacillus plantarum WCFS1
40% identity, 73% coverage

TC0614 phosphocarrier protein HPr from Chlamydia muridarum Nigg
30% identity, 77% coverage

BCI_0069 phosphocarrier protein HPr from Baumannia cicadellinicola str. Hc (Homalodisca coagulata)
30% identity, 93% coverage

SSA_0772 Histidine-containing phosphocarrier protein of the PTS, putative from Streptococcus sanguinis SK36
44% identity, 62% coverage

VC0966 phosphocarrier protein HPr from Vibrio cholerae O1 biovar eltor str. N16961
31% identity, 93% coverage

B7HN27 Phosphocarrier protein HPr from Bacillus cereus (strain AH187)
33% identity, 90% coverage

VAS14_16916 HPr family phosphocarrier protein from Photobacterium angustum S14
VAS14_16916 phosphocarrier protein HPr from Vibrio angustum S14
33% identity, 91% coverage

SPD_1040 phosphocarrier protein HPr from Streptococcus pneumoniae D39
44% identity, 62% coverage

FN1794 Phosphocarrier protein HPr from Fusobacterium nucleatum subsp. nucleatum ATCC 25586
31% identity, 93% coverage

STER_1243, STER_RS06135, T303_07270 phosphocarrier protein HPr from Streptococcus thermophilus ASCC 1275
42% identity, 67% coverage

BBMN68_1152 HPr family phosphocarrier protein from Bifidobacterium longum subsp. longum BBMN68
36% identity, 74% coverage

M5005_Spy_1121 phosphocarrier protein HPr from Streptococcus pyogenes MGAS5005
44% identity, 62% coverage

Q9ZAD9 Phosphocarrier protein HPr from Lactococcus lactis subsp. cremoris
36% identity, 91% coverage

WP_010370017 phosphocarrier protein HPr from Weissella cibaria
42% identity, 64% coverage

DQM59_RS07180, SMU_674 phosphocarrier protein HPr from Streptococcus mutans
44% identity, 62% coverage

PTHP_HALVD / D4GYE3 Phosphocarrier protein HPr from Haloferax volcanii (strain ATCC 29605 / DSM 3757 / JCM 8879 / NBRC 14742 / NCIMB 2012 / VKM B-1768 / DS2) (Halobacterium volcanii) (see paper)
HVO_1497 phosphocarrier protein Hpr from Haloferax volcanii DS2
35% identity, 85% coverage

PTFAH_VIBCH / Q9KM70 Multiphosphoryl transfer protein; MTP from Vibrio cholerae serotype O1 (strain ATCC 39315 / El Tor Inaba N16961) (see 2 papers)
VCA0518 PTS system, fructose-specific IIA/FPR component from Vibrio cholerae O1 biovar eltor str. N16961
34% identity, 21% coverage

GJQ69_00385 HPr family phosphocarrier protein from Caproicibacterium lactatifermentans
33% identity, 98% coverage

M5595_19190 HPr family phosphocarrier protein from Eubacterium limosum
28% identity, 96% coverage

LSEI_1760 Phosphotransferase system, HPr-related protein from Lactobacillus casei ATCC 334
LGG_01821 PTS phosphocarrier protein Hpr (histidine containing protein) from Lactobacillus rhamnosus GG
LCAKO_1965, LGG_01821 phosphocarrier protein HPr from Lacticaseibacillus rhamnosus GG
39% identity, 75% coverage

PTHP_BACTI / Q9F166 Phosphocarrier protein HPr from Bacillus thuringiensis subsp. israelensis (see 2 papers)
BC4049 Phosphocarrier protein HPr from Bacillus cereus ATCC 14579
BAS3959 phosphocarrier protein HPr from Bacillus anthracis str. Sterne
BC_4049, BTF1_18560 phosphocarrier protein HPr from Bacillus thuringiensis HD-789
34% identity, 84% coverage

P23388 Multiphosphoryl transfer protein from Rhodobacter capsulatus
34% identity, 7% coverage

AH68_00945 HPr family phosphocarrier protein from Bifidobacterium catenulatum PV20-2
35% identity, 92% coverage

PTFAH_HAEIN / P44715 Multiphosphoryl transfer protein; MTP; Diphosphoryl transfer protein; DTP; Phosphotransferase FPr protein; Pseudo-HPr from Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) (see paper)
HI0448 PTS system, fructose-specific IIA/FPr component (fruB) from Haemophilus influenzae Rd KW20
33% identity, 17% coverage

A8AYH5 Phosphocarrier protein HPr from Streptococcus gordonii (strain Challis / ATCC 35105 / BCRC 15272 / CH1 / DL1 / V288)
SGO_1556 phosphocarrier protein HPr from Streptococcus gordonii str. Challis substr. CH1
SGO_RS07625 phosphocarrier protein HPr from Streptococcus gordonii str. Challis substr. CH1
45% identity, 57% coverage

PFLU5027 putative multiphosphoryl transfer protein from Pseudomonas fluorescens SBW25
34% identity, 8% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 793,807 different protein sequences to 1,259,118 scientific articles. Searches against EuropePMC were last performed on March 13 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory