PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for TCDB::Q9X053 Periplasmic binding protein/LacI transcriptional regulator, component of D-ribose porter (Nanavati et al., 2006). Induced by ribose (Thermotoga maritima (strain ATCC 43589 / MSB8 / DSM 3109 / JCM 10099)) (323 a.a., MKKSLFVVLV...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 250 similar proteins in the literature:

TC 3.A.1.2.19 / Q9X053 Periplasmic binding protein/LacI transcriptional regulator, component of D-ribose porter (Nanavati et al., 2006). Induced by ribose from Thermotoga maritima (strain ATCC 43589 / MSB8 / DSM 3109 / JCM 10099)
TM0958 ribose ABC transporter, periplasmic ribose-binding protein from Thermotoga maritima MSB8
TM_0958 D-ribose ABC transporter substrate-binding protein from Thermotoga maritima MSB8
100% identity, 100% coverage

CTN_1618 Ribose ABC transporter, periplasmic ribose-binding protein from Thermotoga neapolitana DSM 4359
100% identity, 99% coverage

2fn8A / Q9X053 Thermotoga maritima ribose binding protein ribose bound form (see paper)
100% identity, 90% coverage

TTE0206 Periplasmic sugar-binding proteins from Thermoanaerobacter tengcongensis MB4
42% identity, 97% coverage

2ioyA / Q8RD41 Crystal structure of thermoanaerobacter tengcongensis ribose binding protein (see paper)
45% identity, 87% coverage

APIBP_PARG4 / B1G898 D-apiose import binding protein; D-apiose binding SBP from Paraburkholderia graminis (strain ATCC 700544 / DSM 17151 / LMG 18924 / NCIMB 13744 / C4D1M) (see paper)
42% identity, 97% coverage

5ibqA / Q2JZQ5 Crystal structure of an abc solute binding protein from rhizobium etli cfn 42 (rhe_pf00037,target efi-511357) in complex with alpha-d-apiose
45% identity, 82% coverage

APIBP_RHIEC / Q2JZQ5 D-apiose import binding protein; D-apiose binding SBP from Rhizobium etli (strain ATCC 51251 / DSM 11541 / JCM 21823 / NBRC 15573 / CFN 42) (see paper)
45% identity, 82% coverage

AOT13_01795 ribose ABC transporter substrate-binding protein RbsB from Parageobacillus thermoglucosidasius
40% identity, 94% coverage

VPA1084 ribose ABC transporter, periplasmic D-ribose-binding protein from Vibrio parahaemolyticus RIMD 2210633
A79_4530 D-ribose-binding periplasmic protein from Vibrio parahaemolyticus AQ3810
40% identity, 96% coverage

DDA3937_RS00045 ribose ABC transporter substrate-binding protein RbsB from Dickeya dadantii 3937
40% identity, 87% coverage

B5S52_21960 ribose ABC transporter substrate-binding protein RbsB from Pectobacterium brasiliense
39% identity, 87% coverage

HM1_2420 ribose abc transporter, ribose-binding protein from Heliobacterium modesticaldum Ice1
38% identity, 95% coverage

ECA_RS00065 ribose ABC transporter substrate-binding protein RbsB from Pectobacterium atrosepticum SCRI1043
39% identity, 87% coverage

TC 3.A.1.2.11 / Q92WK4 EryG aka RB0335, component of The erythritol permease, EryEFG (Geddes et al., 2010) (probably orthologous to 3.A.1.2.16) from Rhizobium meliloti (Sinorhizobium meliloti) (see paper)
41% identity, 89% coverage

APIBP_RHIR8 / B9JK76 D-apiose import binding protein; D-apiose binding SBP from Rhizobium rhizogenes (strain K84 / ATCC BAA-868) (Agrobacterium radiobacter) (see paper)
41% identity, 90% coverage

Entcl_4402 ribose ABC transporter substrate-binding protein RbsB from [Enterobacter] lignolyticus SCF1
40% identity, 87% coverage

APIBP_ACTSZ / A6VKQ8 D-apiose import binding protein; D-apiose binding SBP from Actinobacillus succinogenes (strain ATCC 55618 / DSM 22257 / CCUG 43843 / 130Z) (see paper)
42% identity, 81% coverage

STM3884 ABC superfamily (peri_perm), D-ribose transport protein from Salmonella typhimurium LT2
P0A2C6 Ribose import binding protein RbsB from Salmonella typhi
40% identity, 87% coverage

Q8XAW6 D-ribose transporter subunit RbsB from Escherichia coli O157:H7
Z5252 D-ribose periplasmic binding protein from Escherichia coli O157:H7 EDL933
40% identity, 87% coverage

GM298_22990 ribose ABC transporter substrate-binding protein RbsB from Enterobacter sp. HSTU-ASh6
40% identity, 87% coverage

VCA0130 ribose ABC transporter, periplasmic D-ribose-binding protein from Vibrio cholerae O1 biovar eltor str. N16961
36% identity, 96% coverage

RbsB / b3751 ribose ABC transporter periplasmic binding protein from Escherichia coli K-12 substr. MG1655 (see 25 papers)
RbsB / P02925 ribose ABC transporter periplasmic binding protein from Escherichia coli (strain K12) (see 25 papers)
RBSB_ECOLI / P02925 Ribose import binding protein RbsB from Escherichia coli (strain K12) (see 5 papers)
TC 3.A.1.2.1 / P02925 RbsB aka RBSP aka PRLB aka B3751, component of Ribose porter from Escherichia coli (see 11 papers)
rbsB / GB|AAC76774.1 D-ribose-binding periplasmic protein; EC 3.6.3.17 from Escherichia coli K12 (see 11 papers)
b3751 D-ribose transporter subunit from Escherichia coli str. K-12 substr. MG1655
NP_418207 ribose ABC transporter periplasmic binding protein from Escherichia coli str. K-12 substr. MG1655
40% identity, 87% coverage

1dbpA / P02925 Identical mutations at corresponding positions in two homologous proteins with non-identical effects (see paper)
40% identity, 87% coverage

RBAM_RS16755 ribose ABC transporter substrate-binding protein RbsB from Bacillus velezensis FZB42
37% identity, 91% coverage

HMPREF0389_00295 ribose ABC transporter substrate-binding protein RbsB from Filifactor alocis ATCC 35896
41% identity, 88% coverage

RBSB_BACSU / P36949 Ribose import binding protein RbsB from Bacillus subtilis (strain 168) (see 2 papers)
BSU35960 ribose ABC transporter (ribose-binding lipoprotein) from Bacillus subtilis subsp. subtilis str. 168
36% identity, 91% coverage

TC 3.A.1.2.13 / A6VKT0 RbsB, component of The probable autoinducer-2 (AI-2;, a furanosyl borate diester: 3aS,6S,6aR)-2,2,6,6a-tetrahydroxy-3a-methyltetrahydrofuro[3,2-d][1,3,2]dioxaborolan-2-uide) uptake porter (Shao et al., 2007) (50-70% identical to RbsABC of E. coli; TC# 3.A.1.2.1) from Actinobacillus succinogenes (strain ATCC 55618 / 130Z)
37% identity, 95% coverage

NTHI0632 D-ribose transporter subunit RbsB from Haemophilus influenzae 86-028NP
38% identity, 95% coverage

4zjpA / A6VKT0 Structure of an abc-transporter solute binding protein (sbp_ipr025997) from actinobacillus succinogenes (asuc_0197, target efi-511067) with bound beta-d-ribopyranose
38% identity, 85% coverage

HI0504 D-ribose ABC transporter, periplasmic-binding protein (rbsB) from Haemophilus influenzae Rd KW20
38% identity, 95% coverage

TC 3.A.1.2.16 / Q1M4Q9 Periplasmic erythritol binding protein, component of The erythritol uptake permease, EryEFG (Yost et al., 2006) (probably orthologous to 3.A.1.2.11) from Rhizobium leguminosarum bv. viciae (strain 3841) (see paper)
pRL120200 putative substrate binding component of ABC transporter from Rhizobium leguminosarum bv. viciae 3841
38% identity, 89% coverage

VK055_1936 D-ribose ABC transporter substrate-binding protein from Klebsiella pneumoniae subsp. pneumoniae
37% identity, 92% coverage

eryG / Q2YIY2 putative erythritol ABC transporter substrate-binding protein from Brucella abortus (strain 2308) (see 3 papers)
BAW_20365 D-ribose ABC transporter substrate-binding protein from Brucella abortus
BAB2_0377 Periplasmic binding protein/LacI transcriptional regulator from Brucella melitensis biovar Abortus 2308
BruAb2_0373 RbsB-2, ribose ABC transporter, periplasmic D-ribose-binding protein from Brucella abortus biovar 1 str. 9-941
37% identity, 89% coverage

BRA0858 ribose ABC transporter, periplasmic D-ribose-binding protein from Brucella suis 1330
BCAN_B0875 rhizopine-binding protein precursor from Brucella canis ATCC 23365
37% identity, 89% coverage

LBA1481 D-ribose-binding protein precursor from Lactobacillus acidophilus NCFM
37% identity, 94% coverage

SXYL_01518 D-ribose ABC transporter substrate-binding protein from Staphylococcus xylosus
39% identity, 81% coverage

BA0669 ribose ABC transporter, ribose-binding protein from Bacillus anthracis str. Ames
36% identity, 96% coverage

Bbr_1417 D-ribose ABC transporter substrate-binding protein from Bifidobacterium breve UCC2003
34% identity, 98% coverage

7e7mC / Q8E283 Crystal structure analysis of the streptococcus agalactiae ribose binding protein rbsb
36% identity, 88% coverage

SAK_0166 ribose ABC transporter, ribose-binding protein from Streptococcus agalactiae A909
36% identity, 88% coverage

HMPREF0424_1336 periplasmic binding protein and sugar binding domain of the LacI family protein from Gardnerella vaginalis 409-05
38% identity, 88% coverage

SAG0114 ribose ABC transporter, periplasmic D-ribose-binding protein from Streptococcus agalactiae 2603V/R
36% identity, 88% coverage

SAN_0145 ribose ABC transporter, periplasmic D-ribose-binding protein from Streptococcus agalactiae COH1
35% identity, 88% coverage

gbs0113 Unknown from Streptococcus agalactiae NEM316
35% identity, 88% coverage

CKS_3355 D-ribose ABC transporter substrate-binding protein from Pantoea stewartii subsp. stewartii DC283
33% identity, 90% coverage

BMEII0435 D-RIBOSE-BINDING PERIPLASMIC PROTEIN PRECURSOR from Brucella melitensis 16M
37% identity, 85% coverage

LLNZ_RS04085 substrate-binding domain-containing protein from Lactococcus cremoris subsp. cremoris NZ9000
llmg_0789 ribose ABC transporter substrate binding protein RbsB from Lactococcus lactis subsp. cremoris MG1363
33% identity, 87% coverage

SCO2747 bifunctional carbohydrate binding and transport protein from Streptomyces coelicolor A3(2)
36% identity, 39% coverage

OKIT_0347 substrate-binding domain-containing protein from Oenococcus kitaharae DSM 17330
35% identity, 81% coverage

cg1413 secreted sugar-binding protein from Corynebacterium glutamicum ATCC 13032
34% identity, 85% coverage

MSMEG_3095 D-ribose-binding periplasmic protein from Mycobacterium smegmatis str. MC2 155
33% identity, 92% coverage

SMb20856 putative sugar uptake ABC transporter periplasmic solute-binding protein precursor from Sinorhizobium meliloti 1021
32% identity, 98% coverage

B2904_orf2673 sugar ABC transporter substrate-binding protein from Brachyspira pilosicoli B2904
30% identity, 98% coverage

KP1_1422 putative rhizopine uptake ABC transport system periplasmic solute-binding protein precursor from Klebsiella pneumoniae NTUH-K2044
30% identity, 89% coverage

4ry9A / A1WJM2 Crystal structure of carbohydrate transporter solute binding protein veis_2079 from verminephrobacter eiseniae ef01-2, target efi-511009, a complex with d-talitol
29% identity, 88% coverage

C9ZD81 Putative secreted solute binding protein from Streptomyces scabiei (strain 87.22)
31% identity, 81% coverage

A0R67_09330 sugar ABC transporter substrate-binding protein from Pasteurella multocida subsp. multocida
30% identity, 90% coverage

SMb21345 putative sugar uptake ABC transporter periplasmic solute-binding protein precursor from Sinorhizobium meliloti 1021
33% identity, 87% coverage

4irxA / A0A0H3C834 Crystal structure of caulobacter myo-inositol binding protein bound to myo-inositol (see paper)
30% identity, 90% coverage

CD0300 D-ribose ABC transporter, substrate-binding protein from Clostridium difficile 630
30% identity, 86% coverage

4yo7A / Q9KAG4 Crystal structure of an abc transporter solute binding protein (ipr025997) from bacillus halodurans c-125 (bh2323, target efi- 511484) with bound myo-inositol
31% identity, 89% coverage

GK1896 sugar ABC transporter (sugar-binding protein) from Geobacillus kaustophilus HTA426
30% identity, 93% coverage

RHE_CH01210 probable sugar ABC transporter, substrate-binding protein from Rhizobium etli CFN 42
34% identity, 79% coverage

PFLU3996 putative sugar-binding exported protein from Pseudomonas fluorescens SBW25
32% identity, 83% coverage

TC 3.A.1.2.21 / B8H228 Inositol ABC transporter, periplasmic inositol-binding protein IbpA, component of The myoinositol (high affinity)/ D-ribose (low affinity) transporter IatP/IatA/IbpA. The structure of IbpA with myoinositol bound has been solved from Caulobacter crescentus (strain NA1000 / CB15N)
Q9A9V2 Sugar ABC transporter, periplasmic sugar-binding protein from Caulobacter vibrioides (strain ATCC 19089 / CIP 103742 / CB 15)
CC0859, CC_0859 sugar ABC transporter, periplasmic sugar-binding protein from Caulobacter crescentus CB15
29% identity, 86% coverage

RALBP_PSEAE / Q9I2F8 D-ribose/D-allose-binding protein from Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) (see paper)
PA1946 binding protein component precursor of ABC ribose transporter from Pseudomonas aeruginosa PAO1
29% identity, 89% coverage

PSPTO_2367 ribose ABC transporter, periplasmic ribose-binding protein from Pseudomonas syringae pv. tomato str. DC3000
28% identity, 89% coverage

5dteB / A6VKG5 Crystal structure of an abc transporter periplasmic solute binding protein (ipr025997) from actinobacillus succinogenes 130z(asuc_0081, target efi-511065) with bound d-allose
31% identity, 80% coverage

YPTB3231 putative sugar ABC transporter periplasmic binding protein from Yersinia pseudotuberculosis IP 32953
28% identity, 94% coverage

Pf1N1B4_6035 D-ribose ABC transporter, substrate-binding component RbsB from Pseudomonas fluorescens FW300-N1B4
28% identity, 89% coverage

YPO2501 sugar binding protein precursor from Yersinia pestis CO92
YPK_1611 monosaccharide-transporting ATPase from Yersinia pseudotuberculosis YPIII
32% identity, 86% coverage

MSMEG_4172 D-ribose-binding periplasmic protein RbsB from Mycobacterium smegmatis str. MC2 155
29% identity, 76% coverage

y3346 periplasmic binding protein for ABC transporter from Yersinia pestis KIM
YPO0959 putative sugar ABC transporter periplasmic binding protein from Yersinia pestis CO92
27% identity, 94% coverage

HSERO_RS11480 D-ribose ABC transporter, substrate-binding component RbsB from Herbaspirillum seropedicae SmR1
29% identity, 93% coverage

Shewana3_2073 L-arabinose ABC transporter, substrate-binding component AraU from Shewanella sp. ANA-3
Shewana3_2073 periplasmic binding protein/LacI transcriptional regulator from Shewanella sp ANA-3
33% identity, 83% coverage

5ocpA / A0KWY4 The periplasmic binding protein component of the arabinose abc transporter from shewanella sp. Ana-3 bound to alpha and beta-l- arabinofuranose
33% identity, 83% coverage

Psyr_3265 Periplasmic binding protein/LacI transcriptional regulator from Pseudomonas syringae pv. syringae B728a
27% identity, 97% coverage

NP_437959 putative rhizopine uptake ABC transporter periplasmic solute-binding protein precursor from Sinorhizobium meliloti 1021
SMb20712 sugar ABC transporter substrate-binding protein from Sinorhizobium meliloti 1021
31% identity, 88% coverage

PFLU3117 putative sugar ABC transport system, lipoprotein from Pseudomonas fluorescens SBW25
29% identity, 99% coverage

PP_2454 ribose ABC transporter, periplasmic ribose-binding protein from Pseudomonas putida KT2440
27% identity, 87% coverage

ECOLIN_RS18690 substrate-binding domain-containing protein from Escherichia coli Nissle 1917
30% identity, 91% coverage

c4017 Putative ribose ABC transporter from Escherichia coli CFT073
30% identity, 91% coverage

SSA_0215 Periplasmic sugar-binding protein (ribose porter), putative from Streptococcus sanguinis SK36
27% identity, 88% coverage

Pden_4851 periplasmic binding protein/LacI transcriptional regulator from Paracoccus denitrificans PD1222
32% identity, 81% coverage

6hbdA / A0QT50 Crystal structure of msmeg_1712 from mycobacterium smegmatis in complex with beta-d-galactofuranose (see paper)
31% identity, 69% coverage

PSPTO_3049 sugar ABC transporter, periplasmic sugar-binding, putative from Pseudomonas syringae pv. tomato str. DC3000
34% identity, 81% coverage

MSMEG_1712 ABC transporter periplasmic-binding protein YtfQ from Mycobacterium smegmatis str. MC2 155
31% identity, 68% coverage

W1DEX1 Ribose ABC transport system, periplasmic ribose-binding protein RbsB (TC 3.A.1.2.1) from Klebsiella pneumoniae IS43
37% identity, 46% coverage

BPSL1793 putative sugar-binding exported protein from Burkholderia pseudomallei K96243
27% identity, 88% coverage

CAC1453 Ribose ABC transporter (ribose-binding periplasmic component) from Clostridium acetobutylicum ATCC 824
27% identity, 97% coverage

4rs3A / A0QYB3 Crystal structure of carbohydrate transporter a0qyb3 from mycobacterium smegmatis str. Mc2 155, target efi-510969, in complex with xylitol (see paper)
32% identity, 76% coverage

XYPA_MYCS2 / A0QYB3 Xylitol-binding protein from Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) (Mycobacterium smegmatis) (see paper)
MSMEG_3598 periplasmic sugar-binding proteins from Mycobacterium smegmatis str. MC2 155
32% identity, 70% coverage

THPA_MYCS2 / A0QYB5 D-threitol-binding protein from Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) (Mycobacterium smegmatis) (see paper)
MSMEG_3599 sugar-binding transcriptional regulator, LacI family protein from Mycobacterium smegmatis str. MC2 155
MSMEG_3599 sugar ABC transporter xylitol/D-threitol-binding protein ThpA from Mycolicibacterium smegmatis MC2 155
32% identity, 71% coverage

4rsmA / A0QYB5 Crystal structure of carbohydrate transporter msmeg_3599 from mycobacterium smegmatis str. Mc2 155, target efi-510970, in complex with d-threitol (see paper)
32% identity, 77% coverage

BCAL1657 putative ribose transport system, substrate-binding protein from Burkholderia cenocepacia J2315
28% identity, 88% coverage

SMb21377 putative sugar uptake ABC transporter periplasmic solute-binding protein precursor from Sinorhizobium meliloti 1021
30% identity, 81% coverage

PSPTO_3490 sugar ABC transporter, periplasmic sugar-binding protein from Pseudomonas syringae pv. tomato str. DC3000
26% identity, 98% coverage

YP_3140 putative periplasmic protein precursor from Yersinia pestis biovar Medievalis str. 91001
32% identity, 69% coverage

PS417_11885 Inositol transport system sugar-binding protein from Pseudomonas simiae WCS417
28% identity, 88% coverage

Z5689 putative periplasmic ribose-binding protein of ABC transport system from Escherichia coli O157:H7 EDL933
30% identity, 89% coverage

BPSS0140 putative sugar ABC transport system, lipoprotein from Burkholderia pseudomallei K96243
27% identity, 90% coverage

2vk2A / P39325 Crystal structure of a galactofuranose binding protein (see paper)
30% identity, 80% coverage

ECs5205 putative LACI-type transcriptional regulator from Escherichia coli O157:H7 str. Sakai
Z5838 putative LACI-type transcriptional regulator from Escherichia coli O157:H7 EDL933
30% identity, 80% coverage

YtfQ / b4227 galactofuranose ABC transporter periplasmic binding protein (EC 7.5.2.9) from Escherichia coli K-12 substr. MG1655 (see 3 papers)
YtfQ / P39325 galactofuranose ABC transporter periplasmic binding protein (EC 7.5.2.9) from Escherichia coli (strain K12) (see 3 papers)
YTFQ_ECOLI / P39325 Galactofuranose-binding protein YtfQ from Escherichia coli (strain K12) (see paper)
P39325 ABC-type D-galactofuranose transporter (EC 7.5.2.9) from Escherichia coli (see paper)
NP_418648 galactofuranose ABC transporter periplasmic binding protein from Escherichia coli str. K-12 substr. MG1655
b4227 predicted sugar transporter subunit: periplasmic-binding component of ABC superfamily from Escherichia coli str. K-12 substr. MG1655
30% identity, 80% coverage

Entcl_4082 ABC transporter substrate-binding protein from [Enterobacter] lignolyticus SCF1
30% identity, 89% coverage

AT5A_00215 galactofuranose ABC transporter, galactofuranose-binding protein YtfQ from Agrobacterium tumefaciens 5A
28% identity, 96% coverage

J2ZWP7 ABC-type sugar transport system, periplasmic component from Pseudomonas sp. GM48
27% identity, 88% coverage

B21_04060 galactofuranose ABC transporter substrate-binding protein YtfQ from Escherichia coli BL21(DE3)
30% identity, 80% coverage

SACE_0943 binding protein/LacI transcriptional regulator from Saccharopolyspora erythraea NRRL 2338
29% identity, 87% coverage

PS417_18405 D-ribose ABC transporter, substrate-binding component RbsB from Pseudomonas simiae WCS417
28% identity, 89% coverage

Entcl_4175 sugar ABC transporter substrate-binding protein from [Enterobacter] lignolyticus SCF1
27% identity, 95% coverage

C9ZDX0 Putative secreted solute-binding protein from Streptomyces scabiei (strain 87.22)
30% identity, 69% coverage

A0R773 Sugar ABC transporter substrate-binding protein from Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155)
MSMEG_6804 sugar ABC transporter substrate-binding protein from Mycobacterium smegmatis str. MC2 155
MSMEG_6804 substrate-binding domain-containing protein from Mycolicibacterium smegmatis MC2 155
29% identity, 76% coverage

PFLU2583 putative rhizopine-binding ABC transporter protein from Pseudomonas fluorescens SBW25
PFLU_2583 sugar ABC transporter substrate-binding protein from Pseudomonas [fluorescens] SBW25
27% identity, 96% coverage

PFLU3727 putative D-ribose-binding protein component of ABC transporter from Pseudomonas fluorescens SBW25
29% identity, 88% coverage

Q1MM20 Solute-binding component of ABC transporter from Rhizobium johnstonii (strain DSM 114642 / LMG 32736 / 3841)
RL0489 putative solute-binding component of ABC transporter from Rhizobium leguminosarum bv. viciae 3841
28% identity, 85% coverage

Atu0063 ABC transporter, substrate binding protein (sugar) from Agrobacterium tumefaciens str. C58 (Cereon)
28% identity, 78% coverage

CTN_0364 putative periplasmic binding protein from Thermotoga neapolitana DSM 4359
29% identity, 72% coverage

RL2376 putative periplasmic-binding protein component of ABC transporter from Rhizobium leguminosarum bv. viciae 3841
29% identity, 78% coverage

FTV88_0053 substrate-binding domain-containing protein from Heliorestis convoluta
29% identity, 83% coverage

RL4655 putative rhizopine-binding ABC transporter protein from Rhizobium leguminosarum bv. viciae 3841
28% identity, 88% coverage

CPE1627 probable ribose ABC transporter from Clostridium perfringens str. 13
36% identity, 41% coverage

RL4172 putative solute-binding component of ABC transporter from Rhizobium leguminosarum bv. viciae 3841
26% identity, 93% coverage

AlsB / b4088 D-allose ABC transporter periplasmic binding protein (EC 7.5.2.8) from Escherichia coli K-12 substr. MG1655 (see 5 papers)
AlsB / P39265 D-allose ABC transporter periplasmic binding protein (EC 7.5.2.8) from Escherichia coli (strain K12) (see 5 papers)
ALSB_ECOLI / P39265 D-allose-binding periplasmic protein; ALBP from Escherichia coli (strain K12) (see paper)
P39265 ABC-type D-allose transporter (EC 7.5.2.8) from Escherichia coli (see paper)
TC 3.A.1.2.6 / P39265 AlsB aka B4088, component of D-allose porter from Escherichia coli (see 6 papers)
alsB / GB|AAC77049.1 D-allose-binding periplasmic protein; EC 3.6.3.17 from Escherichia coli K12 (see 6 papers)
b4088 D-allose transporter subunit from Escherichia coli str. K-12 substr. MG1655
30% identity, 82% coverage

YPK_3045 monosaccharide-transporting ATPase from Yersinia pseudotuberculosis YPIII
25% identity, 97% coverage

B9Q16_23105 substrate-binding domain-containing protein from Pantoea ananatis
25% identity, 87% coverage

Atu3063 ABC transporter, nucleotide binding/ATPase protein from Agrobacterium tumefaciens str. C58 (Cereon)
28% identity, 95% coverage

SMb20671 putative sugar uptake ABC transporter periplasmic solute-binding protein precursor from Sinorhizobium meliloti 1021
28% identity, 88% coverage

MSMEG_1374 ribose ABC transporter, periplasmic binding protein from Mycobacterium smegmatis str. MC2 155
27% identity, 81% coverage

YP1134 putative sugar-binding periplasmic protein from Yersinia pestis biovar Medievalis str. 91001
25% identity, 97% coverage

MHJ_0227 periplasmic sugar-binding proteins from Mycoplasma hyopneumoniae J
Q4AAA3 Periplasmic sugar-binding protein from Mesomycoplasma hyopneumoniae (strain J / ATCC 25934 / NCTC 10110)
mhp145 putative D-ribose-binding protein mutant from Mycoplasma hyopneumoniae 232
MHP7448_0234 periplasmic sugar-binding proteins from Mycoplasma hyopneumoniae 7448
27% identity, 73% coverage

H16_B1500 ABC-type transporter, periplasmic component: CUT2 family, fructose specific from Ralstonia eutropha H16
29% identity, 90% coverage

CAC1351 Periplasmic sugar-binding protein from Clostridium acetobutylicum ATCC 824
CEA_G1365 sugar ABC transporter substrate-binding protein from Clostridium acetobutylicum EA 2018
27% identity, 78% coverage

H16_B1500 ABC transporter substrate-binding protein from Cupriavidus necator H16
29% identity, 91% coverage

1gudA / P39265 Hinge-bending motion of d-allose binding protein from escherichia coli: three open conformations (see paper)
30% identity, 75% coverage

BCAL1548 putative sugar ABC transport system, lipoprotein from Burkholderia cenocepacia J2315
28% identity, 90% coverage

BCAN_A0557 hypothetical protein from Brucella canis ATCC 23365
25% identity, 88% coverage

SMb21587 putative sugar uptake ABC transporter periplasmic solute-binding protein precursor from Sinorhizobium meliloti 1021
28% identity, 86% coverage

SMc02171 PUTATIVE PERIPLASMIC BINDING ABC TRANSPORTER PROTEIN from Sinorhizobium meliloti 1021
26% identity, 93% coverage

BAB1_1648 Periplasmic binding protein/LacI transcriptional regulator from Brucella melitensis biovar Abortus 2308
30% identity, 86% coverage

A9CF36 ABC transporter, substrate binding protein (Sugar) from Agrobacterium fabrum (strain C58 / ATCC 33970)
29% identity, 80% coverage

HSERO_RS03635 D-mannose ABC transporter, substrate-binding component from Herbaspirillum seropedicae SmR1
28% identity, 90% coverage

C289_0603 sugar-binding protein from Anoxybacillus ayderensis
26% identity, 95% coverage

FRCB_RHIML / Q9F9B2 Fructose import binding protein FrcB from Rhizobium meliloti (Ensifer meliloti) (Sinorhizobium meliloti) (see paper)
TC 3.A.1.2.7 / Q9F9B2 FrcB, component of Fructose/mannose/ribose porter from Rhizobium meliloti (Sinorhizobium meliloti) (see paper)
26% identity, 93% coverage

BR0544 ribose ABC transporter, periplasmic D-ribose-binding protein from Brucella suis 1330
25% identity, 88% coverage

mlr7582 sugar ABC transporter, periplasmic sugar-binding protein from Mesorhizobium loti MAFF303099
28% identity, 73% coverage

PsgRace4_17419 substrate-binding domain-containing protein from Pseudomonas savastanoi pv. glycinea str. race 4
30% identity, 70% coverage

Pden_4771 ABC ribose transporter, periplasmic solute-binding protein from Paracoccus denitrificans PD1222
25% identity, 78% coverage

BOV_0546 ribose ABC transporter, periplasmic D-ribose-binding protein from Brucella ovis ATCC 25840
25% identity, 88% coverage

BCAM0766 D-ribose-binding periplasmic protein precursor from Burkholderia cenocepacia J2315
29% identity, 65% coverage

RbsR / VIMSS66308 RbsR regulator of Ribose utilization, effector Ribose (repressor) from Bacillus halodurans C-125
28% identity, 69% coverage

Q8YIQ0 D-ribose-binding periplasmic protein from Brucella melitensis biotype 1 (strain ATCC 23456 / CCUG 17765 / NCTC 10094 / 16M)
BMEI0393 D-RIBOSE-BINDING PERIPLASMIC PROTEIN PRECURSOR from Brucella melitensis 16M
31% identity, 69% coverage

BOV_1574 sugar-binding transcriptional regulator, LacI family from Brucella ovis ATCC 25840
31% identity, 68% coverage

BCAN_A1667 rhizopine-binding protein precursor from Brucella canis ATCC 23365
BR1630 sugar-binding transcriptional regulator, LacI family from Brucella suis 1330
BME_RS01965 substrate-binding domain-containing protein from Brucella melitensis bv. 1 str. 16M
31% identity, 68% coverage

SCAB_5351 putative substrate-binding component of ABC transporter from Streptomyces scabiei 87.22
C9YUG2 Putative substrate-binding component of ABC transporter from Streptomyces scabiei (strain 87.22)
28% identity, 84% coverage

PS417_12055 xylitol ABC transporter, substrate-binding component from Pseudomonas simiae WCS417
28% identity, 71% coverage

Q2YMQ4 Periplasmic binding protein/LacI transcriptional regulator from Brucella abortus (strain 2308)
Q57EI7 RbsB-1, ribose ABC transporter from Brucella abortus biovar 1 (strain 9-941)
25% identity, 88% coverage

BMEI1390 D-RIBOSE-BINDING PERIPLASMIC PROTEIN PRECURSOR from Brucella melitensis 16M
25% identity, 88% coverage

MSMEG_3999 ABC transporter periplasmic-binding protein YphF from Mycobacterium smegmatis str. MC2 155
26% identity, 93% coverage

BT1754 two-component system sensor histidine kinase/response regulator, hybrid ('one component system') from Bacteroides thetaiotaomicron VPI-5482
27% identity, 29% coverage

RR42_RS03370 fructose ABC transporter, substrate-binding component (FrcB) from Cupriavidus basilensis FW507-4G11
28% identity, 90% coverage

2x7xA / Q8A6X1 Fructose binding periplasmic domain of hybrid two component system bt1754 (see paper)
30% identity, 60% coverage

CMM_0423 putative sugar ABC transporter, binding protein from Clavibacter michiganensis subsp. michiganensis NCPPB 382
28% identity, 70% coverage

Q98CU7 Mll4996 protein from Mesorhizobium japonicum (strain LMG 29417 / CECT 9101 / MAFF 303099)
28% identity, 93% coverage

BACOVA_04496 hypothetical protein from Bacteroides ovatus ATCC 8483
30% identity, 20% coverage

blr3200 ABC transporter sugar-binding protein from Bradyrhizobium japonicum USDA 110
27% identity, 72% coverage

SMb20072 putative rhizopine binding protein from Sinorhizobium meliloti 1021
27% identity, 95% coverage

Ac3H11_3035 Fructose ABC transporter, substrate-binding component FrcB from Acidovorax sp. GW101-3H11
27% identity, 89% coverage

YPO0860 sugar-binding periplasmic protein from Yersinia pestis CO92
28% identity, 89% coverage

SPRI_RS32325 sugar ABC transporter substrate-binding protein from Streptomyces pristinaespiralis
26% identity, 85% coverage

4rxmA Crystal structure of periplasmic abc transporter solute binding protein a7jw62 from mannheimia haemolytica phl213, target efi-511105, in complex with myo-inositol
26% identity, 77% coverage

BPSS0257 putative ribose ABC transport system, substrate-binding exported protein from Burkholderia pseudomallei K96243
27% identity, 70% coverage

PP_2758 ribose ABC transporter, periplasmic ribose-binding protein, putative from Pseudomonas putida KT2440
26% identity, 83% coverage

YphF / b2548 putative ABC transporter periplasmic binding protein YphF from Escherichia coli K-12 substr. MG1655 (see 4 papers)
P77269 ABC transporter periplasmic-binding protein YphF from Escherichia coli (strain K12)
NP_417043 putative ABC transporter periplasmic binding protein YphF from Escherichia coli str. K-12 substr. MG1655
27% identity, 74% coverage

C289_0763 sugar ABC transporter substrate-binding protein from Anoxybacillus ayderensis
28% identity, 70% coverage

A0A140N593 Periplasmic binding protein/LacI transcriptional regulator from Escherichia coli (strain B / BL21-DE3)
27% identity, 74% coverage

c3070 ABC transporter Periplasmic binding protein yphF precursor from Escherichia coli CFT073
27% identity, 74% coverage

FRUE_BIFLO / Q8G848 Fructose import binding protein FruE from Bifidobacterium longum (strain NCC 2705) (see paper)
BLD_0045 ABC-type sugar transport system periplasmic component from Bifidobacterium longum DJO10A
BL0033 probable solute binding protein of ABC transporter system possibly for sugars from Bifidobacterium longum NCC2705
28% identity, 74% coverage

HSERO_RS05260 ABC transporter for L-fucose, substrate-binding component from Herbaspirillum seropedicae SmR1
28% identity, 74% coverage

CTN_0777 Periplasmic binding protein/LacI transcriptional regulator precursor from Thermotoga neapolitana DSM 4359
TRQ7_RS05225 sugar-binding protein from Thermotoga sp. RQ7
26% identity, 83% coverage

CTC_00907 ABC transporter substrate-binding protein from Clostridium tetani E88
Q896U1 D-ribose-binding periplasmic protein from Clostridium tetani (strain Massachusetts / E88)
26% identity, 70% coverage

pRL90085 putative periplasmic substrate binding component of ABC transporter from Rhizobium leguminosarum bv. viciae 3841
26% identity, 87% coverage

4wutA / B9K0B2 Crystal structure of an abc transporter solute binding protein (ipr025997) from agrobacterium vitis (avi_5133, target efi-511220) with bound d-fucose
25% identity, 88% coverage

AW20_2081 ribose operon transcriptional repressor RbsR from Bacillus anthracis str. Sterne
BAS0631 ribose operon repressor from Bacillus anthracis str. Sterne
27% identity, 68% coverage

TC 3.A.1.2.20 / G4FGN5 LacI family transcriptional regulator, component of Glucose porter. Also bind xylose (Boucher and Noll 2011). Induced by glucose (Frock et al. 2012). Directly regulated by glucose-responsive regulator GluR from Thermotoga maritima (strain ATCC 43589 / MSB8 / DSM 3109 / JCM 10099)
Tmari_1858 sugar-binding protein from Thermotoga maritima MSB8
26% identity, 84% coverage

ML0398 putative D-ribose-binding protein from Mycobacterium leprae TN
25% identity, 90% coverage

KPN_04210 putative LACI-type transcriptional regulator from Klebsiella pneumoniae subsp. pneumoniae MGH 78578
27% identity, 80% coverage

B9K0Q5 ABC-type D-galactose transporter (EC 7.5.2.11) from Agrobacterium vitis (see paper)
Avi_5305 ABC transporter, binding protein from Agrobacterium vitis S4
21% identity, 91% coverage

RbsR / VIMSS358011 RbsR regulator of Ribose utilization, effector Ribose (repressor) from Bacillus cereus ATCC 14579
BC0659 Ribose operon repressor from Bacillus cereus ATCC 14579
25% identity, 86% coverage

EF0172 sugar-binding transcriptional regulator, LacI family from Enterococcus faecalis V583
27% identity, 67% coverage

RT761_02219 sugar ABC transporter substrate-binding protein from Atribacter laminatus
27% identity, 98% coverage

5hqjA / B1G1H7 Crystal structure of abc transporter solute binding protein b1g1h7 from burkholderia graminis c4d1m, target efi-511179, in complex with d-arabinose
26% identity, 87% coverage

B1G1H7 Periplasmic binding protein/LacI transcriptional regulator from Paraburkholderia graminis (strain ATCC 700544 / DSM 17151 / LMG 18924 / NCIMB 13744 / C4D1M)
26% identity, 87% coverage

BBR47_06790 putative ABC transporter substrate binding protein from Brevibacillus brevis NBRC 100599
26% identity, 80% coverage

CTN_0576 Sugar ABC transporter, periplasmic sugar-binding protein from Thermotoga neapolitana DSM 4359
27% identity, 95% coverage

4wt7A / B9K0E0 Crystal structure of an abc transporter solute binding protein (ipr025997) from agrobacterium vitis (avi_5165, target efi-511223) with bound allitol
27% identity, 89% coverage

BCAL1431 putative ribose ABC transport system, substrate-binding exported protein from Burkholderia cenocepacia J2315
25% identity, 72% coverage

CMM_2842 putative sugar ABC transporter, substrate binding protein from Clavibacter michiganensis subsp. michiganensis NCPPB 382
25% identity, 69% coverage

2h3hA / Q9WXW9 Crystal structure of the liganded form of thermotoga maritima glucose binding protein (see paper)
27% identity, 87% coverage

TM0114 sugar ABC transporter, periplasmic sugar-binding protein from Thermotoga maritima MSB8
27% identity, 84% coverage

6gt9A / W8QN64 Crystal structure of ganp, a glucose-galactose binding protein from geobacillus stearothermophilus, in complex with galactose
24% identity, 89% coverage

4y9tA / B9K0Q5 Crystal structure of an abc transporter solute binding protein (ipr025997) from agrobacterium vitis s4 (avi_5305, target efi-511224) with bound alpha-d-glucosamine (see paper)
22% identity, 81% coverage

Gocc_2298 sugar ABC transporter substrate-binding protein from Gaiella occulta
24% identity, 81% coverage

BC2960 Sugar-binding protein from Bacillus cereus ATCC 14579
24% identity, 79% coverage

Atu3487 ABC transporter, substrate binding protein (sugar) from Agrobacterium tumefaciens str. C58 (Cereon)
25% identity, 78% coverage

pRL110413 putative substrate binding protein involved in competition for nodulation from Rhizobium leguminosarum bv. viciae 3841
26% identity, 69% coverage

ruthe_01818 LacI family DNA-binding transcriptional regulator from Rubellimicrobium thermophilum DSM 16684
25% identity, 64% coverage

TC 3.A.1.2.9 / Q7BSH5 RhaS, component of Rhamnose porter (Richardson et al., 2004) (Transport activity is dependent on rhamnokinase (RhaK; AAQ92412) activity (Richardson and Oresnik, 2007) This could be an example of group translocation!) from Rhizobium leguminosarum (biovar trifolii) (see paper)
25% identity, 70% coverage

Avi_5339 ABC transporter substrate binding protein (ribose) from Agrobacterium vitis S4
25% identity, 71% coverage

5dkvA / B9K0T2 Crystal structure of an abc transporter solute binding protein from agrobacterium vitis(avis_5339, target efi-511225) bound with alpha-d- tagatopyranose
25% identity, 72% coverage

SMc02324 PUTATIVE PERIPLASMIC BINDING ABC TRANSPORTER PROTEIN from Sinorhizobium meliloti 1021
23% identity, 74% coverage

Gocc_3071 sugar ABC transporter substrate-binding protein from Gaiella occulta
25% identity, 79% coverage

Swol_0423 putative sugar ABC transporter, substrate-binding protein from Syntrophomonas wolfei subsp. wolfei str. Goettingen
26% identity, 89% coverage

orf7 sugar-binding sensor histidine kinase/response r egulator hybrid from Gramella forsetii KT0803
25% identity, 21% coverage

A6T653 Periplasmic binding protein/LacI transcriptional regulator from Klebsiella pneumoniae subsp. pneumoniae (strain ATCC 700721 / MGH 78578)
KPN_00624 putative periplasmic binding protein/LacI transcriptional regulator from Klebsiella pneumoniae subsp. pneumoniae MGH 78578
28% identity, 68% coverage

VK055_1912 sugar ABC transporter substrate-binding protein from Klebsiella pneumoniae subsp. pneumoniae
28% identity, 68% coverage

2rjoA / B2TEP5 Crystal structure of twin-arginine translocation pathway signal protein from burkholderia phytofirmans
25% identity, 74% coverage

Atu3821 ABC transporter, substrate binding protein (ribose) from Agrobacterium tumefaciens str. C58 (Cereon)
24% identity, 97% coverage

YPTS_1642 hypothetical protein from Yersinia pseudotuberculosis PB1/+
27% identity, 68% coverage

BOV_A0938 ribose ABC transporter, periplasmic D-ribose-binding protein from Brucella ovis ATCC 25840
25% identity, 84% coverage

BRA0996 ribose ABC transporter, periplasmic D-ribose-binding protein from Brucella suis 1330
BCAN_B1015 rhizopine-binding protein precursor from Brucella canis ATCC 23365
25% identity, 78% coverage

NJ56_RS14540 HTH-type transcriptional repressor PurR from Yersinia ruckeri
23% identity, 82% coverage

4pz0A / A0A6H3AKG3 The crystal structure of a solute binding protein from bacillus anthracis str. Ames in complex with quorum-sensing signal autoinducer-2 (ai-2)
23% identity, 72% coverage

RbsR / VIMSS7180207 RbsR regulator of Ribose utilization, effector Ribose (repressor) from Ralstonia pickettii 12J
22% identity, 64% coverage

ESA_RS03670 sugar ABC transporter substrate-binding protein from Cronobacter sakazakii ATCC BAA-894
27% identity, 68% coverage

8fxuA / D9TSJ1 Thermoanaerobacter thermosaccharolyticum periplasmic glucose-binding protein glucose complex: badan conjugate attached at f17c (see paper)
27% identity, 67% coverage

A4XG54 Periplasmic binding protein/LacI transcriptional regulator from Caldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903 / Tp8T 6331)
Csac_0242 periplasmic binding protein/LacI transcriptional regulator from Caldicellulosiruptor saccharolyticus DSM 8903
25% identity, 72% coverage

O68456 D-xylose-binding protein from Thermoanaerobacter ethanolicus
27% identity, 66% coverage

Cbei_2377 periplasmic binding protein/LacI transcriptional regulator from Clostridium beijerincki NCIMB 8052
A6LW07 Periplasmic binding protein/LacI transcriptional regulator from Clostridium beijerinckii (strain ATCC 51743 / NCIMB 8052)
22% identity, 95% coverage

Tcur_1732 transcriptional regulator, LacI family from Thermomonospora curvata DSM 43183
25% identity, 59% coverage

EXUR_BACSU / Q9JMQ1 Probable HTH-type transcriptional repressor ExuR; Probable galacturonate locus repressor from Bacillus subtilis (strain 168) (see paper)
BSU12370 transcriptional regulator (LacI family) from Bacillus subtilis subsp. subtilis str. 168
27% identity, 53% coverage

C0C300 Periplasmic binding protein domain-containing protein from [Clostridium] hylemonae DSM 15053
24% identity, 69% coverage

Cthe_2446 substrate-binding domain-containing protein from Acetivibrio thermocellus DSM 1313
Cthe_2446 ABC-type sugar transport system periplasmic component-like protein from Clostridium thermocellum ATCC 27405
27% identity, 67% coverage

Pat9b_3290 sugar ABC transporter substrate-binding protein from Pantoea sp. At-9b
28% identity, 60% coverage

CPE1341 probable galactoside ABC transporter from Clostridium perfringens str. 13
26% identity, 66% coverage

pRL100415 putative substrate-binding component of ABC transporter from Rhizobium leguminosarum bv. viciae 3841
25% identity, 77% coverage

YPTB1325 ABC transporter, periplasmic sugar-binding protein from Yersinia pseudotuberculosis IP 32953
23% identity, 81% coverage

ManR2 / VIMSS6936097 ManR2 regulator of Mannose utilization; Mannosides utilization, effector Mannose (repressor) from Shewanella amazonensis SB2B
24% identity, 66% coverage

TC 3.A.1.2.24 / O50503 Solute-binding protein, component of XylFGH downstream of characterized transcriptional regulator, ROK7B7 (Sco6008); XylF (Sco6009); XylG (Sco6010); XylH (Sco6011)) from Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145)
SCO6009 solute-binding protein from Streptomyces coelicolor A3(2)
26% identity, 68% coverage

UriR / VIMSS300374 UriR regulator of Pyrimidine utilization; Uridine utilization, effector New effector (repressor) from Corynebacterium efficiens YS-314
24% identity, 67% coverage

3ksmA / Q2S7D2 Crystal structure of abc-type sugar transport system, periplasmic component from hahella chejuensis
28% identity, 68% coverage

OA04_14870 sugar ABC transporter substrate-binding protein from Pectobacterium versatile
24% identity, 67% coverage

ExuR / VIMSS3427925 ExuR regulator of Glucuronate utilization; Galacturonate utilization, effector Glucuronate; Galacturonate (repressor) from Bacillus pumilus SAFR-032
26% identity, 49% coverage

RbsR / VIMSS2859848 RbsR regulator of Ribose utilization, effector Ribose (repressor) from Psychromonas sp. CNPT3
22% identity, 73% coverage

RHE_RS30060 substrate-binding domain-containing protein from Rhizobium etli CFN 42
25% identity, 49% coverage

ScrR / VIMSS1100825 ScrR regulator of Sucrose utilization, effector Sucrose-6-phosphate (repressor) from Lactobacillus sakei subsp. sakei 23K
22% identity, 80% coverage

RbsR / VIMSS547489 RbsR regulator of Ribose utilization, effector Ribose (repressor) from Photorhabdus luminescens TTO1
22% identity, 85% coverage

Q2S7D2 ABC-type sugar transport system, periplasmic component from Hahella chejuensis (strain KCTC 2396)
28% identity, 68% coverage

5ulbA / A1JTX8 Crystal structure of sugar abc transporter from yersinia enterocolitica subsp. Enterocolitica 8081
26% identity, 73% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 793,807 different protein sequences to 1,259,118 scientific articles. Searches against EuropePMC were last performed on March 13 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory