PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for reanno::pseudo6_N2E2:Pf6N2E2_5402 ABC transporter for D-Alanine, periplasmic substrate-binding component (Pseudomonas fluorescens FW300-N2E2) (343 a.a., MKLLKSTLAV...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 253 similar proteins in the literature:

Pf6N2E2_5402 ABC transporter for D-Alanine, periplasmic substrate-binding component from Pseudomonas fluorescens FW300-N2E2
100% identity, 100% coverage

PFLU_1000 amino acid ABC transporter substrate-binding protein from Pseudomonas [fluorescens] SBW25
93% identity, 100% coverage

Q4KHV4 General L-amino acid ABC transporter, periplasmic L-amino acid-binding protein AapJ from Pseudomonas fluorescens (strain ATCC BAA-477 / NRRL B-23932 / Pf-5)
90% identity, 100% coverage

Psyr_1072 extracellular solute-binding protein, family 3 from Pseudomonas syringae pv. syringae B728a
86% identity, 100% coverage

PSPTO_1255 amino acid ABC transporter, periplasmic amino acid-binding protein from Pseudomonas syringae pv. tomato str. DC3000
85% identity, 100% coverage

PP1297 general amino acid ABC transporter, periplasmic binding protein from Pseudomonas putida KT2440
PP_1297 amino acid ABC transporter substrate-binding protein from Pseudomonas putida KT2440
86% identity, 100% coverage

Pput_4428 extracellular solute-binding protein from Pseudomonas putida F1
86% identity, 100% coverage

L321_23611 amino acid ABC transporter substrate-binding protein from Pseudomonas plecoglossicida NB2011
86% identity, 100% coverage

W6QUE9 Putative amino-acid ABC transporter-binding protein from Ectopseudomonas oleovorans (strain CECT 5344)
79% identity, 100% coverage

PA3858 probable amino acid-binding protein from Pseudomonas aeruginosa PAO1
64% identity, 100% coverage

PA14_14100 Putative amino-acid ABC transporter binding protei from Pseudomonas aeruginosa UCBPP-PA14
64% identity, 100% coverage

EAMY_0266 putative ABC transport system, periplasmic component from Erwinia amylovora CFBP1430
66% identity, 100% coverage

VV1_2703 ABC-type amino acid transport, signal transduction systems, periplasmic component/domain from Vibrio vulnificus CMCP6
59% identity, 100% coverage

VpaChn25_1613, WU75_14760 amino acid ABC transporter substrate-binding protein from Vibrio parahaemolyticus
Q87P98 Amino acid ABC transporter, periplasmic amino acid-binding protein from Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633)
VP1620 amino acid ABC transporter, periplasmic amino acid-binding protein from Vibrio parahaemolyticus RIMD 2210633
59% identity, 99% coverage

BAU10_07410 amino acid ABC transporter substrate-binding protein from Vibrio alginolyticus
58% identity, 99% coverage

AAPJ_RHIJ3 / Q52812 General L-amino acid-binding periplasmic protein AapJ from Rhizobium johnstonii (strain DSM 114642 / LMG 32736 / 3841) (Rhizobium leguminosarum bv. viciae) (see 2 papers)
TC 3.A.1.3.8 / Q52812 AapJ, component of General L-amino acid porter; transports basic and acidic amino acids preferentially, but also transports aliphatic amino acids (catalyzes both uptake and efflux) from Rhizobium leguminosarum (biovar viciae) (see 2 papers)
aapJ / CAA57933.1 general amino acid ABC type transporter from Rhizobium leguminosarum (see paper)
RL2204 General L-amino acid substrate binding protein from Rhizobium leguminosarum bv. viciae 3841
58% identity, 98% coverage

VC1362 amino acid ABC transporter, periplasmic amino acid-binding protein from Vibrio cholerae O1 biovar eltor str. N16961
VCV52_1340 amino acid ABC transporter, periplasmic amino acid-binding protein from Vibrio cholerae V52
61% identity, 92% coverage

SMc02118 ABC transporter for L-Glutamine, L-Histidine, and other L-amino acids, periplasmic substrate-binding component from Sinorhizobium meliloti 1021
Q92Q71 Probable general L-amino acid-binding periplasmic ABC transporter from Rhizobium meliloti (strain 1021)
SMc02118 PROBABLE GENERAL L-AMINO ACID-BINDING PERIPLASMIC ABC TRANSPORTER PROTEIN from Sinorhizobium meliloti 1021
55% identity, 99% coverage

VSAL_I2057 amino acid ABC transporter substrate-binding protein from Aliivibrio salmonicida LFI1238
VSAL_I2057 general L-amino acid-binding periplasmic protein precursor from Vibrio salmonicida LFI1238
57% identity, 100% coverage

BCAN_A0756 lysine-arginine-ornithine-binding periplasmic protein from Brucella canis ATCC 23365
BR0741 amino acid ABC transporter, periplasmic amino acid-binding protein from Brucella suis 1330
BOV_0736 amino acid ABC transporter, periplasmic amino acid-binding protein from Brucella ovis ATCC 25840
58% identity, 99% coverage

BME_RS06090 amino acid ABC transporter substrate-binding protein from Brucella melitensis bv. 1 str. 16M
Q8YGE8 General l-amino acid-binding periplasmic protein aapj from Brucella melitensis biotype 1 (strain ATCC 23456 / CCUG 17765 / NCTC 10094 / 16M)
BMEI1211 GENERAL L-AMINO ACID-BINDING PERIPLASMIC PROTEIN AAPJ PRECURSOR from Brucella melitensis 16M
58% identity, 99% coverage

4z9nB / A0A0M3KL33 Abc transporter / periplasmic binding protein from brucella ovis with glutathione bound
60% identity, 92% coverage

BRA0948 amino acid ABC transporter, periplasmic amino acid-binding protein from Brucella suis 1330
58% identity, 98% coverage

BCAN_B0969 amino-acid ABC transporter-binding protein yhdW precursor from Brucella canis ATCC 23365
58% identity, 98% coverage

BOV_A0890 amino acid ABC transporter, periplasmic amino acid-binding protein from Brucella ovis ATCC 25840
58% identity, 98% coverage

BMEII0349 GENERAL L-AMINO ACID-BINDING PERIPLASMIC PROTEIN AAPJ PRECURSOR from Brucella melitensis 16M
59% identity, 94% coverage

BAB_RS27735 amino acid ABC transporter substrate-binding protein from Brucella abortus 2308
57% identity, 98% coverage

Atu1577 ABC transporter, substrate binding protein (amino acid) from Agrobacterium tumefaciens str. C58 (Cereon)
57% identity, 91% coverage

BP0558 amino acid-binding periplasmic protein from Bordetella pertussis Tohama I
Q7VS83 Amino acid-binding periplasmic protein from Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251)
56% identity, 93% coverage

I6I48_RS29915 amino acid ABC transporter substrate-binding protein from Achromobacter xylosoxidans
55% identity, 94% coverage

BP3831 putative ABC transporter periplasmic amino acid-binding protein from Bordetella pertussis Tohama I
Q7VSU1 ABC transporter periplasmic amino acid-binding protein from Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251)
53% identity, 100% coverage

BP1529 putative extracellular solute-binding protein from Bordetella pertussis Tohama I
56% identity, 92% coverage

RSP_1747 ABC glutamate/glutamine/aspartate/asparagine transporter, periplasmic substrate-binding protein from Rhodobacter sphaeroides 2.4.1
54% identity, 92% coverage

PB7211_1204 amino acid ABC transporter substrate-binding protein from Candidatus Pelagibacter sp. HTCC7211
53% identity, 99% coverage

SAR11_0953 ABC transporter from Candidatus Pelagibacter ubique HTCC1062
53% identity, 93% coverage

YP_165781 glutamate/glutamine/aspartate/asparagine ABC transporter, periplasmic substrate-binding protein from Silicibacter pomeroyi DSS-3
52% identity, 83% coverage

A1B061 L-aspartate-binding protein / L-glutamate-binding protein / L-glutamine-binding protein / L-asparagine-binding protein from Paracoccus denitrificans (strain Pd 1222)
52% identity, 93% coverage

CD16_RS00210 amino acid ABC transporter substrate-binding protein from Candidatus Liberibacter asiaticus
CLIBASIA_00265 cationic amino acid ABC transporter, periplasmic binding protein from Candidatus Liberibacter asiaticus str. psy62
46% identity, 99% coverage

bll2909 bll2909 from Bradyrhizobium japonicum USDA 110
52% identity, 92% coverage

Dshi_0318 cationic amino acid ABC transporter, periplasmic binding protein from Dinoroseobacter shibae DFL 12
54% identity, 92% coverage

ZP_02146891 glutamate/glutamine/aspartate/asparagine ABC transporter, periplasmic substrate-binding protein from Phaeobacter gallaeciensis BS107
52% identity, 92% coverage

TC 3.A.1.3.7 / Q52663 BztA, component of Glutamate/glutamine/aspartate/asparagine porter from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (see paper)
52% identity, 92% coverage

SPO2364 amino acid ABC transporter substrate-binding protein from Ruegeria pomeroyi DSS-3
50% identity, 92% coverage

Rru_A0779 extracellular solute-binding protein, family 3 from Rhodospirillum rubrum ATCC 11170
46% identity, 92% coverage

OA04_42530 amino acid ABC transporter substrate-binding protein from Pectobacterium versatile
44% identity, 97% coverage

Synpcc7942_0246 extracellular solute-binding protein, family 3 from Synechococcus elongatus PCC 7942
44% identity, 89% coverage

RPA2628 polar amino acid ABC transport substrate-binding protein, aapJ-2 from Rhodopseudomonas palustris CGA009
48% identity, 93% coverage

TC 3.A.1.3.18 / Q8YPM9 NatF, component of Acidic and neutral amino acid uptake transporter NatFGH/BgtA. BgtA is shared with BgtAB (see paper)
alr4164 periplasmic amino acid-binding protein of amino acid ABC transporter from Nostoc sp. PCC 7120
43% identity, 87% coverage

SYNW0840 ABC transporter, substrate binding protein for amino acids from Synechococcus sp. WH 8102
44% identity, 94% coverage

BL107_14770 extracellular solute-binding protein, family 3 from Synechococcus sp. BL107
45% identity, 84% coverage

EP10_002623 glutamate ABC transporter substrate-binding protein from Geobacillus icigianus
29% identity, 66% coverage

GALLO_1556 Putative ABC transporter, amino acid binding protein from Streptococcus gallolyticus UCN34
33% identity, 52% coverage

IUJ47_RS08610 transporter substrate-binding domain-containing protein from Enterococcus faecalis
32% identity, 54% coverage

Q836J2 Amino acid ABC transporter, amino acid-binding protein from Enterococcus faecalis (strain ATCC 700802 / V583)
EF1119 amino acid ABC transporter, amino acid-binding protein from Enterococcus faecalis V583
OG1RF_10897 transporter substrate-binding domain-containing protein from Enterococcus faecalis OG1RF
32% identity, 54% coverage

4zv1A An ancestral arginine-binding protein bound to arginine (see paper)
30% identity, 60% coverage

PA5082 probable binding protein component of ABC transporter from Pseudomonas aeruginosa PAO1
26% identity, 72% coverage

SAG0717 amino acid ABC transporter, amino acid-binding protein from Streptococcus agalactiae 2603V/R
30% identity, 51% coverage

5t0wA Crystal structure of the ancestral amino acid-binding protein anccdt- 1, a precursor of cyclohexadienyl dehydratase
28% identity, 70% coverage

BCAL1668 periplasmic solute-binding protein from Burkholderia cenocepacia J2315
27% identity, 71% coverage

OA04_45500 ABC transporter substrate-binding protein from Pectobacterium versatile
27% identity, 74% coverage

CJM1_0885 bifunctional adhesin/ABC transporter aspartate/glutamate-binding protein PEB1a from Campylobacter jejuni subsp. jejuni M1
CJJ81176_0928 amino acid ABC transporter, periplasmic amino acid-binding protein PEB1 from Campylobacter jejuni subsp. jejuni 81-176
C8J_0858 amino acid ABC transporter, periplasmic amino acid-binding protein PEB1 from Campylobacter jejuni subsp. jejuni 81116
27% identity, 69% coverage

Q1GBX1 Amino acid ABC transporter, substrate binding protein from Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC 11842 / DSM 20081 / BCRC 10696 / JCM 1002 / NBRC 13953 / NCIMB 11778 / NCTC 12712 / WDCM 00102 / Lb 14)
26% identity, 69% coverage

TC 3.A.1.3.16 / Q0P9X8 PEB1A, component of Uptake system for glutamate and aspartate from Campylobacter jejuni (see 2 papers)
peb1A / RF|YP_002344319.1 major cell-binding factor from Campylobacter jejuni (see 3 papers)
peb1 / AAA02919.1 major cell-binding factor from Campylobacter jejuni (see paper)
Cj0921c, NP_282073 probable ABC-type amino-acid transporter periplasmic solute-binding protein from Campylobacter jejuni subsp. jejuni NCTC 11168
YP_002344319 bifunctional adhesin/ABC transporter aspartate/glutamate-binding protein from Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819
27% identity, 69% coverage

STM0665 ABC superfamily (bind_prot), glutamate/aspartate transporter from Salmonella typhimurium LT2
25% identity, 85% coverage

SEN0634 ABC transporter periplasmic binding protein from Salmonella enterica subsp. enterica serovar Enteritidis str. P125109
25% identity, 85% coverage

Q0P9S0 Amino-acid transporter periplasmic solute-binding protein from Campylobacter jejuni subsp. jejuni serotype O:2 (strain ATCC 700819 / NCTC 11168)
Cj0982c putative amino-acid transporter periplasmic solute-binding protein from Campylobacter jejuni subsp. jejuni NCTC 11168
26% identity, 69% coverage

cjaA / CAJ20048.1 glutamine-binding protein from Campylobacter jejuni (see paper)
CJJ81176_1001 CjaA protein from Campylobacter jejuni subsp. jejuni 81-176
26% identity, 69% coverage

B6D87_RS01815 transporter substrate-binding domain-containing protein from Pseudomonas fragi
26% identity, 72% coverage

DVU2342, ORF02865 amino acid ABC transporter, periplasmic amino acid-binding protein from Desulfovibrio vulgaris Hildenborough
27% identity, 61% coverage

Q8Z8G8 ABC transporter periplasmic binding protein from Salmonella typhi
24% identity, 85% coverage

Entcl_3149 amino acid ABC transporter substrate-binding protein from [Enterobacter] lignolyticus SCF1
25% identity, 77% coverage

FORC47_RS03370 glutamine ABC transporter substrate-binding protein GlnH from Bacillus cereus
25% identity, 68% coverage

TM0593 amino acid ABC transporter, periplasmic amino acid-binding protein from Thermotoga maritima MSB8
Q9WZ62 Amino acid ABC transporter, periplasmic amino acid-binding protein from Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8)
26% identity, 68% coverage

Q9HWI6 Probable binding protein component of ABC transporter from Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1)
PA4195 probable binding protein component of ABC transporter from Pseudomonas aeruginosa PAO1
28% identity, 68% coverage

1xt8B / Q0P9S0 Crystal structure of cysteine-binding protein from campylobacter jejuni at 2.0 a resolution (see paper)
27% identity, 62% coverage

EAM_RS05505 amino acid ABC transporter substrate-binding protein from Erwinia amylovora ATCC 49946
24% identity, 78% coverage

SPO2658 amino acid ABC transporter substrate-binding protein from Ruegeria pomeroyi DSS-3
27% identity, 70% coverage

BTH_I2450 extracellular solute-binding protein from Burkholderia thailandensis E264
27% identity, 63% coverage

PMI0437 glutamate/aspartate ABC transporter, substrate-binding protein from Proteus mirabilis HI4320
25% identity, 76% coverage

C3L23_RS03070 transporter substrate-binding domain-containing protein from Nautilia sp. PV-1
26% identity, 66% coverage

LBA1046 glutamine ABC transporter substrate-binding protein from Lactobacillus acidophilus NCFM
25% identity, 76% coverage

2v25A / Q0P9X8 Structure of the campylobacter jejuni antigen peb1a, an aspartate and glutamate receptor with bound aspartate (see paper)
26% identity, 61% coverage

BPSL2924 glutamate/aspartate periplasmic binding protein precursor from Burkholderia pseudomallei K96243
25% identity, 76% coverage

APL_1694 antigenic protein, ABC transporter-like protein from Actinobacillus pleuropneumoniae L20
APP7_1755 antigenic protein, ABC transporter-like protein from Actinobacillus pleuropneumoniae serovar 7 str. AP76
26% identity, 71% coverage

HMU03500 putative amino-acid transporter periplasmic solute-binding protein from Helicobacter mustelae 12198
24% identity, 70% coverage

A9497_01790, STER_RS05530 transporter substrate-binding domain-containing protein from Streptococcus thermophilus
29% identity, 52% coverage

AKL23_RS05335 transporter substrate-binding domain-containing protein from Streptococcus thermophilus
29% identity, 52% coverage

HZ99_18940 transporter substrate-binding domain-containing protein from Pseudomonas fluorescens
28% identity, 62% coverage

APJL_1726 putative ABC transporter, periplasmic binding protein from Actinobacillus pleuropneumoniae serovar 3 str. JL03
26% identity, 71% coverage

2ia4B / A0A0H2UXX1 Crystal structure of novel amino acid binding protein from shigella flexneri
25% identity, 66% coverage

CLP_0371 cysteine ABC transporter substrate-binding protein from Clostridium butyricum E4 str. BoNT E BL5262
25% identity, 75% coverage

UTI89_C0651 glutamate/aspartate periplasmic binding protein precursor from Escherichia coli UTI89
25% identity, 66% coverage

ACINB_20500 amino acid ABC transporter substrate-binding protein from Acidovorax sp. NB1
26% identity, 68% coverage

HH1481 probable ABC-type amino-acid transporter periplasmic solute-binding protein from Helicobacter hepaticus ATCC 51449
27% identity, 68% coverage

YPTB3957 ABC transporter, periplasmic amino acid binding protein from Yersinia pseudotuberculosis IP 32953
YPO4111 putative periplasmic solute-binding protein from Yersinia pestis CO92
y4125 putative solute-binding periplasmic protein precursor for ABC transporter from Yersinia pestis KIM
26% identity, 67% coverage

X276_14095 cysteine ABC transporter substrate-binding protein from Clostridium beijerinckii NRRL B-598
26% identity, 63% coverage

6svfA / Q9WZ62 Crystal structure of the p235gk mutant of argbp from t. Maritima (see paper)
26% identity, 63% coverage

BURMUCGD2M_3196 glutamate/aspartate ABC transporter, periplasmic glutamate/aspartate-binding protein from Burkholderia multivorans CGD2M
Bmul_2714 glutamate/aspartate ABC transporter substrate-binding protein from Burkholderia multivorans
25% identity, 76% coverage

YPO2615 putative amino acid-binding protein precursor from Yersinia pestis CO92
y1189 solute-binding periplasmic protein of glutamate/aspartate ABC transporter from Yersinia pestis KIM
YPK_3010 extracellular solute-binding protein from Yersinia pseudotuberculosis YPIII
YPTB1108 ABC transporter, periplasmic glutamate/aspatate binding protein from Yersinia pseudotuberculosis IP 32953
23% identity, 83% coverage

ECs0694 putative periplasmic binding transport protein from Escherichia coli O157:H7 str. Sakai
25% identity, 66% coverage

HBZC1_05960 transporter substrate-binding domain-containing protein from Helicobacter bizzozeronii CIII-1
25% identity, 78% coverage

AO353_21710 ABC transporter for D-glucosamine, periplasmic substrate-binding component from Pseudomonas fluorescens FW300-N2E3
26% identity, 69% coverage

plu1307 Glutamate/aspartate transport system permease protein GltI from Photorhabdus luminescens subsp. laumondii TTO1
24% identity, 77% coverage

NGO2014 putative ABC transporter, periplasmic binding protein, amino acid from Neisseria gonorrhoeae FA 1090
25% identity, 70% coverage

SSUBM407_0596 glutamine-binding protein precursor from Streptococcus suis BM407
SSUSC84_RS06430 transporter substrate-binding domain-containing protein from Streptococcus suis SC84
26% identity, 62% coverage

A9762_17050 glutamate/aspartate ABC transporter substrate-binding protein from Pandoraea sp. ISTKB
26% identity, 67% coverage

8ovoA / P37902,P42212 X-ray structure of the sf-iglusnfr-s72a in complex with l-aspartate
25% identity, 45% coverage

Aave_4073 extracellular solute-binding protein, family 3 from Acidovorax avenae subsp. citrulli AAC00-1
27% identity, 64% coverage

I35_RS02685 glutamate/aspartate ABC transporter substrate-binding protein from Burkholderia cenocepacia H111
BCAL3358 periplasmic glutamate/aspartate-binding protein from Burkholderia cenocepacia J2315
26% identity, 70% coverage

DVU0752, ORF00176 amino acid ABC transporter, amino acid-binding protein from Desulfovibrio vulgaris Hildenborough
30% identity, 64% coverage

CCC13826_0664 surface antigen, CjaA from Campylobacter concisus 13826
25% identity, 69% coverage

H16_A0472 ABC-type transporter, periplasmic component: PAAT family from Ralstonia eutropha H16
26% identity, 68% coverage

alr3429 glutamine-binding protein of glutamine ABC transporter from Nostoc sp. PCC 7120
22% identity, 68% coverage

Q8PVM4 Glutamine-binding protein from Methanosarcina mazei (strain ATCC BAA-159 / DSM 3647 / Goe1 / Go1 / JCM 11833 / OCM 88)
MM1939 Glutamine-binding protein from Methanosarcina mazei Goe1
27% identity, 59% coverage

2yjpA / Q5F5B5 Crystal structure of the solute receptors for l-cysteine of neisseria gonorrhoeae (see paper)
25% identity, 63% coverage

CAC0380 Periplasmic amino acid-binding protein from Clostridium acetobutylicum ATCC 824
CA_C0380, CEA_G0390 ABC transporter substrate-binding protein from Clostridium acetobutylicum EA 2018
27% identity, 69% coverage

AO356_00480 ABC transporter for D-Glucosamine, periplasmic substrate-binding component from Pseudomonas fluorescens FW300-N2C3
25% identity, 72% coverage

SM12261_RS07180 transporter substrate-binding domain-containing protein from Streptococcus mitis NCTC 12261
28% identity, 61% coverage

PPYC1_02025 cysteine ABC transporter substrate-binding protein from Paenibacillus polymyxa
26% identity, 65% coverage

GltI / b0655 glutamate/aspartate ABC transporter periplasmic binding protein (EC 7.4.2.1) from Escherichia coli K-12 substr. MG1655 (see 4 papers)
GltI / P37902 glutamate/aspartate ABC transporter periplasmic binding protein (EC 7.4.2.1) from Escherichia coli (strain K12) (see 3 papers)
GLTI_ECOLI / P37902 Glutamate/aspartate import solute-binding protein from Escherichia coli (strain K12) (see 3 papers)
TC 3.A.1.3.4 / P37902 YBEJ aka GltI aka B0655, component of Glutamate/aspartate porter from Escherichia coli (see 6 papers)
gltI / GB|BAA35307.2 glutamate-aspartate periplasmic-binding protein from Escherichia coli K12 (see 6 papers)
b0655 glutamate and aspartate transporter subunit from Escherichia coli str. K-12 substr. MG1655
NP_415188 glutamate/aspartate ABC transporter periplasmic binding protein from Escherichia coli str. K-12 substr. MG1655
25% identity, 66% coverage

WS0279 BINDING COMPONENT OF ABC TRANSPORTER from Wolinella succinogenes DSM 1740
Q7MAG0 BINDING COMPONENT OF ABC TRANSPORTER from Wolinella succinogenes (strain ATCC 29543 / DSM 1740 / CCUG 13145 / JCM 31913 / LMG 7466 / NCTC 11488 / FDC 602W)
24% identity, 73% coverage

GLNH_BACSU / O34563 ABC transporter glutamine-binding protein GlnH from Bacillus subtilis (strain 168) (see paper)
26% identity, 61% coverage

SSA_1567 Polar amino acid ABC transporter, amino acid-binding protein, putative from Streptococcus sanguinis SK36
26% identity, 55% coverage

SP_0609 amino acid ABC transporter, amino acid-binding protein from Streptococcus pneumoniae TIGR4
28% identity, 57% coverage

SMb21135 putative amino acid uptake ABC transporter periplasmic solute-binding protein precursor from Sinorhizobium meliloti 1021
25% identity, 78% coverage

HP17_RS11910 transporter substrate-binding domain-containing protein from Helicobacter pylori NCTC 11637 = CCUG 17874 = ATCC 43504 = JCM
25% identity, 62% coverage

bll7600 ABC transporter amino-acid-binding protein from Bradyrhizobium japonicum USDA 110
25% identity, 75% coverage

SPD_0530 amino acid ABC transporter, amino acid-binding protein from Streptococcus pneumoniae D39
spr0534 ABC transporter substrate-binding protein - glutamine transport/Major cell binding factor precursor from Streptococcus pneumoniae R6
27% identity, 60% coverage

HMPREF0010_00975 cysteine ABC transporter substrate-binding protein from Acinetobacter baumannii ATCC 19606 = CIP 70.34 = JCM 6841
24% identity, 73% coverage

O25786 Glutamine ABC transporter, periplasmic glutamine-binding protein (GlnH) from Helicobacter pylori (strain ATCC 700392 / 26695)
HP1172 glutamine ABC transporter, periplasmic glutamine-binding protein (glnH) from Helicobacter pylori 26695
25% identity, 62% coverage

MSMEG_0787 Bacterial extracellular solute-binding protein, family protein 3 from Mycobacterium smegmatis str. MC2 155
26% identity, 63% coverage

H7F35_04365 ABC transporter substrate-binding protein from Variovorax sp. PAMC26660
26% identity, 73% coverage

SGO_0982 amino acid transport protein from Streptococcus gordonii str. Challis substr. CH1
25% identity, 71% coverage

AFA2_00700 ABC transporter substrate-binding protein from Alcaligenes faecalis subsp. faecalis NBRC 13111
27% identity, 74% coverage

LJ0752 glutamine ABC transporter solute-binding component from Lactobacillus johnsonii NCC 533
29% identity, 53% coverage

HPG27_1116 glutamine ABC transporter, periplasmic glutamine-binding protein from Helicobacter pylori G27
24% identity, 62% coverage

pRL120071 putative substrate-binding component of ABC transporter from Rhizobium leguminosarum bv. viciae 3841
23% identity, 85% coverage

BPSS0153 glutamate/aspartate periplasmic binding protein precursor from Burkholderia pseudomallei K96243
24% identity, 69% coverage

B6D87_RS09955 ABC transporter substrate-binding protein from Pseudomonas fragi
24% identity, 70% coverage

jhp1099 AMINO ACID ABC TRANSPORTER, BINDING PROTEIN PRECURSOR from Helicobacter pylori J99
24% identity, 62% coverage

Pf6N2E2_2053 ABC transporter for D-Glucosamine, periplasmic substrate-binding component from Pseudomonas fluorescens FW300-N2E2
24% identity, 66% coverage

AO353_16290 ABC transporter for L-aspartate, L-asparagine, L-glutamate, and L-glutamine, periplasmic substrate-binding component from Pseudomonas fluorescens FW300-N2E3
26% identity, 69% coverage

RSUY_41100, RSUY_RS20020 ABC transporter substrate-binding protein from Ralstonia solanacearum
26% identity, 74% coverage

9e2bC / A0A101DJ27 Structure of a solute binding protein from desulfonauticus sp. Bound to l-tryptophan
24% identity, 70% coverage

MAV_4750 Bacterial extracellular solute-binding protein, family protein 3 from Mycobacterium avium 104
26% identity, 53% coverage

bglu_1g05590 Glutamate/aspartate ABC transporter, periplasmic glutamate/aspartate-binding protein from Burkholderia glumae BGR1
25% identity, 71% coverage

AH67_06815 cysteine ABC transporter substrate-binding protein from Bifidobacterium pseudolongum PV8-2
26% identity, 66% coverage

TC 2.A.56.3.1 / P74223 GtrC aka GLNH aka SLL1104, component of Tripartite glutamate:Na+ symporter (see paper)
sll1104 glutamine-binding protein from Synechocystis sp. PCC 6803
24% identity, 76% coverage

TEL01S_RS02380 transporter substrate-binding domain-containing protein from Pseudothermotoga elfii DSM 9442 = NBRC 107921
25% identity, 71% coverage

Psyr_3908 extracellular solute-binding protein, family 3 from Pseudomonas syringae pv. syringae B728a
25% identity, 69% coverage

M9QLL2 Peb1A (Fragment) from Campylobacter jejuni
30% identity, 38% coverage

6h2tA / P96257 Glnh bound to glu, mycobacterium tuberculosis (see paper)
31% identity, 40% coverage

MAB_4223 Probable glutamine-binding protein GlnH from Mycobacterium abscessus ATCC 19977
26% identity, 58% coverage

Mb0419c PROBABLE GLUTAMINE-BINDING LIPOPROTEIN GLNH (GLNBP) from Mycobacterium bovis AF2122/97
P96257 Probable glutamine-binding lipoprotein GlnH (GLNBP) from Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv)
Rv0411c PROBABLE GLUTAMINE-BINDING LIPOPROTEIN GLNH (GLNBP) from Mycobacterium tuberculosis H37Rv
31% identity, 40% coverage

RSp0931 ABC transporter substrate-binding protein from Ralstonia pseudosolanacearum GMI1000
25% identity, 74% coverage

B2J7D8 Extracellular solute-binding protein, family 3 from Nostoc punctiforme (strain ATCC 29133 / PCC 73102)
30% identity, 18% coverage

PA2204 probable binding protein component of ABC transporter from Pseudomonas aeruginosa PAO1
Q9I1R3 Probable binding protein component of ABC transporter from Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1)
24% identity, 74% coverage

GASBP_PSEAE / Q9I402 L-glutamate/L-aspartate-binding protein from Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) (see paper)
TC 3.A.1.3.22 / Q9I402 Probable binding protein component of ABC transporter, component of Amino acid transporter, AatJMQP. Probably transports L-glutamic acid, D-glutamine acid, L-glutamine and N-acetyl L-glutamic acid (Johnson et al. 2008). Very similar to 3.A.1.3.19 of P. putida from Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 / 1C / PRS 101 / LMG 12228)
PA1342 probable binding protein component of ABC transporter from Pseudomonas aeruginosa PAO1
25% identity, 71% coverage

3k4uE / Q7MAG0 Crystal structure of putative binding component of abc transporter from wolinella succinogenes dsm 1740 complexed with lysine
25% identity, 65% coverage

BB0329 probable extracellular solute-binding protein from Bordetella bronchiseptica RB50
24% identity, 63% coverage

PA14_46910 putative binding protein component of ABC transporter from Pseudomonas aeruginosa UCBPP-PA14
25% identity, 71% coverage

PMI2898 amino acid ABC transporter, substrate-binding protein from Proteus mirabilis HI4320
PMI_RS14325 ABC transporter substrate-binding protein from Proteus mirabilis HI4320
25% identity, 66% coverage

PA14_36200 putative binding protein component of ABC transporter from Pseudomonas aeruginosa UCBPP-PA14
24% identity, 74% coverage

PMA4326_020240 glutamate/aspartate ABC transporter substrate-binding protein from Pseudomonas syringae pv. maculicola str. ES4326
25% identity, 63% coverage

Pf1N1B4_771 ABC transporter for L-asparagine and L-glutamate, periplasmic substrate-binding component from Pseudomonas fluorescens FW300-N1B4
25% identity, 63% coverage

cg3045 ABC-type amino acid transport system, secreted component from Corynebacterium glutamicum ATCC 13032
28% identity, 37% coverage

Q9HUA7 Probable binding protein component of ABC transporter from Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1)
PA5076 probable binding protein component of ABC transporter from Pseudomonas aeruginosa PAO1
26% identity, 68% coverage

PSPTO5180 cystine ABC transporter, periplasmic cystine binding protein from Pseudomonas syringae pv. tomato str. DC3000
24% identity, 66% coverage

5eyfB / Q3XZW5 Crystal structure of solute-binding protein from enterococcus faecium with bound glutamate
24% identity, 55% coverage

Blon_0747 extracellular solute-binding protein, family 3 from Bifidobacterium longum subsp. infantis ATCC 15697
26% identity, 66% coverage

SPO3040 transporter substrate-binding domain-containing protein from Ruegeria pomeroyi DSS-3
26% identity, 59% coverage

VT47_18645 glutamate/aspartate ABC transporter substrate-binding protein from Pseudomonas syringae CC1543
25% identity, 63% coverage

BLGT_RS07135 cysteine ABC transporter substrate-binding protein from Bifidobacterium longum subsp. longum GT15
26% identity, 66% coverage

Cbei_1049 extracellular solute-binding protein from Clostridium beijerincki NCIMB 8052
26% identity, 69% coverage

AH68_02785 cysteine ABC transporter substrate-binding protein from Bifidobacterium catenulatum PV20-2
25% identity, 72% coverage

PSPTO_4171 amino acid ABC transporter, periplasmic amino acid-binding protein from Pseudomonas syringae pv. tomato str. DC3000
25% identity, 63% coverage

D9Q9A4 Transporter substrate-binding domain-containing protein from Corynebacterium pseudotuberculosis (strain C231)
CpC231_0647 glutamate ABC transporter substrate-binding protein from Corynebacterium pseudotuberculosis C231
23% identity, 60% coverage

MSMEG_2727, MSMEI_2660 glutamate ABC transporter substrate-binding protein from Mycolicibacterium smegmatis MC2 155
A0QVX3 Glutamate binding protein from Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155)
MSMEG_2727 glutamate binding protein from Mycobacterium smegmatis str. MC2 155
23% identity, 56% coverage

PFLU_0246 cystine ABC transporter substrate-binding protein from Pseudomonas [fluorescens] SBW25
25% identity, 65% coverage

lp_2312 glutamine ABC transporter, substrate binding protein from Lactobacillus plantarum WCFS1
25% identity, 64% coverage

PP_1071 glutamate/aspartate ABC transporter substrate-binding protein from Pseudomonas putida KT2440
24% identity, 61% coverage

CA_C3620, CEA_G3627 ABC transporter substrate-binding protein from Clostridium acetobutylicum EA 2018
23% identity, 68% coverage

A9497_03785, AKL23_RS07395, STER_RS07565, T303_08720 cysteine ABC transporter substrate-binding protein from Streptococcus thermophilus LMD-9
25% identity, 63% coverage

TC 3.A.1.3.19 / Q88NY2 PP1071, component of Acidic amino acid uptake porter, AatJMQP from Pseudomonas putida (strain KT2440) (see paper)
PP1071 amino acid ABC transporter, periplasmic amino acid-binding protein from Pseudomonas putida KT2440
24% identity, 61% coverage

ABUW_2333, FQU82_01778 amino acid ABC transporter substrate-binding protein from Acinetobacter baumannii
D0C807 Glutamate-aspartate periplasmic-binding protein from Acinetobacter baumannii (strain ATCC 19606 / DSM 30007 / JCM 6841 / CCUG 19606 / CIP 70.34 / NBRC 109757 / NCIMB 12457 / NCTC 12156 / 81)
24% identity, 72% coverage

GLUB_CORGL / P48242 Glutamate-binding protein GluB; Glutamate uptake system protein GluB from Corynebacterium glutamicum (strain ATCC 13032 / DSM 20300 / JCM 1318 / BCRC 11384 / CCUG 27702 / LMG 3730 / NBRC 12168 / NCIMB 10025 / NRRL B-2784 / 534) (see 2 papers)
TC 3.A.1.3.9 / P48242 GluB aka CGL1951, component of Glutamate porter from Corynebacterium glutamicum (Brevibacterium flavum) (see 2 papers)
cg2137 glutamate secreted binding protein from Corynebacterium glutamicum ATCC 13032
NCgl1876 glutamate ABC transporter substrate-binding protein GluB from Corynebacterium glutamicum ATCC 13032
22% identity, 64% coverage

TcyJ / b1920 cystine ABC transporter periplasmic binding protein (EC 7.4.2.12) from Escherichia coli K-12 substr. MG1655 (see 12 papers)
tcyJ / P0AEM9 cystine ABC transporter periplasmic binding protein (EC 7.4.2.12) from Escherichia coli (strain K12) (see 13 papers)
TCYJ_ECOLI / P0AEM9 L-cystine-binding protein TcyJ; CBP; Protein FliY; Sulfate starvation-induced protein 7; SSI7 from Escherichia coli (strain K12) (see 6 papers)
TC 3.A.1.3.10 / P0AEM9 Cystine-binding periplasmic protein FLIY aka CysX aka B1920, component of Cystine/cysteine/diaminopimelate transporter, CysXYZ; these proteins are also designated FliY/YecS/YecC from Escherichia coli (see 7 papers)
NP_416430 cystine ABC transporter periplasmic binding protein from Escherichia coli str. K-12 substr. MG1655
P0AEN0 L-cystine-binding protein TcyJ from Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 / UPEC)
b1920 cystine transporter subunit from Escherichia coli str. K-12 substr. MG1655
25% identity, 70% coverage

CPR_1324 amino acid ABC transporter (binding protein) from Clostridium perfringens SM101
24% identity, 65% coverage

Pput_1112 extracellular solute-binding protein from Pseudomonas putida F1
24% identity, 61% coverage

SF5M90T_1910 cystine ABC transporter substrate-binding protein from Shigella flexneri 5a str. M90T
25% identity, 70% coverage

c2335 Cystine-binding periplasmic protein precursor from Escherichia coli CFT073
25% identity, 70% coverage

WIGMOR_RS02980 transporter substrate-binding domain-containing protein from Wigglesworthia glossinidia endosymbiont of Glossina morsitans
24% identity, 67% coverage

Z3010 putative periplasmic binding transport protein from Escherichia coli O157:H7 EDL933
24% identity, 70% coverage

Pput_0242 cystine transporter subunit from Pseudomonas putida F1
23% identity, 64% coverage

PP0227 cysteine ABC transporter, periplasmic cysteine-binding protein, putative from Pseudomonas putida KT2440
23% identity, 64% coverage

LSA_RS00930 amino acid ABC transporter substrate-binding protein from Fructilactobacillus sanfranciscensis TMW 1.1304
22% identity, 50% coverage

PVLB_05350 glutamate/aspartate ABC transporter substrate-binding protein from Pseudomonas sp. VLB120
24% identity, 61% coverage

PSPTO_1134 amino acid ABC transporter, periplasmic amino acid-binding protein from Pseudomonas syringae pv. tomato str. DC3000
24% identity, 67% coverage

E6B08_RS28125 transporter substrate-binding domain-containing protein from Pseudomonas putida
27% identity, 63% coverage

sll0064 unknown protein from Synechocystis sp. PCC 6803
26% identity, 69% coverage

Teth39_1765 extracellular solute-binding protein from Thermoanaerobacter ethanolicus ATCC 33223
28% identity, 45% coverage

SMb20263 putative ABC transporter periplasmic amino acid-binding protein from Sinorhizobium meliloti 1021
25% identity, 50% coverage

A9CGZ5 ABC transporter, substrate binding protein (Amino acid) from Agrobacterium fabrum (strain C58 / ATCC 33970)
Atu4678 ABC transporter, substrate binding protein (amino acid) from Agrobacterium tumefaciens str. C58 (Cereon)
26% identity, 54% coverage

B1745_05195 transporter substrate-binding domain-containing protein from Lactobacillus amylolyticus
23% identity, 82% coverage

sll0224 hypothetical protein from Synechocystis sp. PCC 6803
24% identity, 71% coverage

MMSR116_20450, MMSR116_RS20185 amino acid ABC transporter substrate-binding protein from Methylobacterium mesophilicum SR1.6/6
24% identity, 62% coverage

AH67_03690 glutamate ABC transporter substrate-binding protein from Bifidobacterium pseudolongum PV8-2
25% identity, 56% coverage

STM1954 putative periplasmic binding transport protein from Salmonella typhimurium LT2
NP_460907 putative periplasmic binding transport protein from Salmonella enterica subsp. enterica serovar Typhimurium str. LT2
24% identity, 70% coverage

Q03PN2 ABC-type amino acid transport/signal transduction system, periplasmic component/domain from Levilactobacillus brevis (strain ATCC 367 / BCRC 12310 / CIP 105137 / JCM 1170 / LMG 11437 / NCIMB 947 / NCTC 947)
26% identity, 62% coverage

PfGW456L13_4770 ABC transporter for L-Asparagine and possibly other L-amino acids, periplasmic substrate-binding component from Pseudomonas fluorescens GW456-L13
25% identity, 64% coverage

B0D71_08240 glutamate/aspartate ABC transporter substrate-binding protein from Pseudomonas laurylsulfativorans
25% identity, 64% coverage

AH68_03390 glutamate ABC transporter substrate-binding protein from Bifidobacterium catenulatum PV20-2
23% identity, 69% coverage

MMSR116_RS29475 transporter substrate-binding domain-containing protein from Methylobacterium mesophilicum SR1.6/6
24% identity, 54% coverage

LSA1497 Putative glutamine/glutamate ABC transporter, membrane-spanning/substrate-binding subunit precursor from Lactobacillus sakei subsp. sakei 23K
27% identity, 33% coverage

SENTW_1129 cystine ABC transporter substrate-binding protein from Salmonella enterica subsp. enterica serovar Weltevreden str.
23% identity, 70% coverage

BP951000_0988 amino acid ABC transporter substrate-binding protein from Brachyspira pilosicoli 95/1000
24% identity, 52% coverage

Blon_0710 extracellular solute-binding protein, family 3 from Bifidobacterium longum subsp. infantis ATCC 15697
24% identity, 69% coverage

HSISS4_01405 ABC transporter substrate-binding protein/permease from Streptococcus salivarius
29% identity, 30% coverage

BLGT_07630 glutamate ABC transporter substrate-binding protein from Bifidobacterium longum subsp. longum GT15
24% identity, 69% coverage

GSU0800 amino acid ABC transporter, periplasmic amino acid-binding protein from Geobacter sulfurreducens PCA
25% identity, 62% coverage

BCAS0291 periplasmic solute-binding protein from Burkholderia cenocepacia J2315
28% identity, 50% coverage

OA04_29520 transporter substrate-binding domain-containing protein from Pectobacterium versatile
23% identity, 74% coverage

OG1RF_10537 amino acid ABC transporter substrate-binding protein from Enterococcus faecalis OG1RF
26% identity, 58% coverage

A1S_1399 ArtI protein from Acinetobacter baumannii ATCC 17978
25% identity, 52% coverage

Swol_0316 extracellular solute-binding protein, family 3 from Syntrophomonas wolfei subsp. wolfei str. Goettingen
26% identity, 62% coverage

RHE_RS11720 transporter substrate-binding domain-containing protein from Rhizobium etli CFN 42
RHE_CH02293 probable amino acid ABC transporter, substrate-binding protein from Rhizobium etli CFN 42
25% identity, 60% coverage

PP_0282 ABC transporter substrate-binding protein from Pseudomonas putida KT2440
PP0282 amino acid ABC transporter, periplasmic amino acid-binding protein from Pseudomonas putida KT2440
26% identity, 66% coverage

lpg0491 amino acid (glutamine) ABC transporter, periplasmic amino acid binding protein from Legionella pneumophila subsp. pneumophila str. Philadelphia 1
24% identity, 43% coverage

SCO5776 glutamate binding protein from Streptomyces coelicolor A3(2)
O50494 Glutamate binding protein from Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145)
25% identity, 51% coverage

TC 3.A.1.3.21 / Q9I484 Amino acid ABC transporter periplasmic binding protein, component of Hydroxy L-proline uptake porter, HprABC from Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 / 1C / PRS 101 / LMG 12228)
PA1260 probable binding protein component of ABC transporter from Pseudomonas aeruginosa PAO1
25% identity, 52% coverage

Q98FA8 Amino acid ABC transporter, periplasmic amino acid-binding protein from Mesorhizobium japonicum (strain LMG 29417 / CECT 9101 / MAFF 303099)
23% identity, 71% coverage

A9497_03405 ABC transporter substrate-binding protein/permease from Streptococcus thermophilus
31% identity, 30% coverage

IUJ47_RS06845 amino acid ABC transporter substrate-binding protein/permease from Enterococcus faecalis
29% identity, 21% coverage

AKL23_RS07020 ABC transporter substrate-binding protein/permease from Streptococcus thermophilus
31% identity, 30% coverage

A1S_1490 glutamate/aspartate transport protein from Acinetobacter baumannii ATCC 17978
24% identity, 49% coverage

SPD_0615 transporter substrate-binding domain-containing protein from Streptococcus pneumoniae D39
26% identity, 37% coverage

BP0057 amino-acid ABC transporter binding protein precursor from Bordetella pertussis Tohama I
23% identity, 67% coverage

SP_0708 transporter substrate-binding domain-containing protein from Streptococcus pneumoniae TIGR4
26% identity, 37% coverage

4g4pA / Q837S0 Crystal structure of glutamine-binding protein from enterococcus faecalis at 1.5 a (see paper)
29% identity, 44% coverage

2ylnA / Q5F9M1 Crystal structure of the l-cystine solute receptor of neisseria gonorrhoeae in the closed conformation (see paper)
24% identity, 52% coverage

NGO0372 putative ABC transporter, periplasmic binding protein, amino acid from Neisseria gonorrhoeae FA 1090
24% identity, 52% coverage

Q5F9M1 Amino acid ABC transporter substrate-binding protein from Neisseria gonorrhoeae (strain ATCC 700825 / FA 1090)
24% identity, 52% coverage

Dtur_1051 extracellular solute-binding protein family 3 from Dictyoglomus turgidum DSM 6724
27% identity, 68% coverage

EF0761 amino acid ABC transporter, amino acid-binding/permease protein from Enterococcus faecalis V583
29% identity, 21% coverage

YPTB1718 putative cystine-binding periplasmic protein from Yersinia pseudotuberculosis IP 32953
26% identity, 56% coverage

SUB1152 glutamine ABC transporter, glutamine-binding protein/permease protein from Streptococcus uberis 0140J
31% identity, 21% coverage

PFLU0376 putative ABC transport system, exported protein from Pseudomonas fluorescens SBW25
22% identity, 69% coverage

ECs0946 arginine 3rd transport system periplasmic binding protein from Escherichia coli O157:H7 str. Sakai
24% identity, 67% coverage

PA14_47920 putative binding protein component of ABC transporter from Pseudomonas aeruginosa UCBPP-PA14
24% identity, 52% coverage

SSU1853 amino-acid ABC transporter extracellular-binding protein from Streptococcus suis P1/7
SSUBM407_1923 amino-acid ABC transporter extracellular-binding protein from Streptococcus suis BM407
23% identity, 61% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 793,807 different protein sequences to 1,259,118 scientific articles. Searches against EuropePMC were last performed on March 13 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory