PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for 76 a.a. (MVTPTKHAIG...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 251 similar proteins in the literature:

KP1_3804 putative cellobiose-specific PTS permease from Klebsiella pneumoniae NTUH-K2044
61% identity, 9% coverage

A6TBM5 Cellobiose-specific PTS permease from Klebsiella pneumoniae subsp. pneumoniae (strain ATCC 700721 / MGH 78578)
KPN_02578 putative cellobiose-specific PTS permease from Klebsiella pneumoniae subsp. pneumoniae MGH 78578
61% identity, 9% coverage

BWI76_RS19555 Beta-glucoside phosphotransferase system, IIA/IIB/IIC components from Klebsiella michiganensis M5al
61% identity, 9% coverage

SA0183 hypothetical protein from Staphylococcus aureus subsp. aureus N315
SAV0189 PTS enzyme II from Staphylococcus aureus subsp. aureus Mu50
55% identity, 10% coverage

SAS0164 glucose-specific PTS transporter protein, IIABC component from Staphylococcus aureus subsp. aureus MSSA476
SAUSA300_0191 PTS system, glucose-specific IIBC component domain protein from Staphylococcus aureus subsp. aureus USA300_FPR3757
MW0163 ORFID:MW0163~PTS enzyme II (EC 2.7.1.69), glucose-specific, factor IIA homologue from Staphylococcus aureus subsp. aureus MW2
SACOL0175 PTS system, IIABC components from Staphylococcus aureus subsp. aureus COL
55% identity, 10% coverage

RDJ18_RS00835 glucose-specific PTS transporter subunit IIBC from Staphylococcus aureus
SAR0190 glucose-specific PTS transporter protein, IIABC component from Staphylococcus aureus subsp. aureus MRSA252
55% identity, 10% coverage

Q831B4 PTS system, beta-glucoside-specific IIABC component from Enterococcus faecalis (strain ATCC 700802 / V583)
EF2598 PTS system, beta-glucoside-specific IIABC component from Enterococcus faecalis V583
60% identity, 9% coverage

SXYL_01421 PTS sugar transporter subunit IIA from Staphylococcus xylosus
50% identity, 43% coverage

LSEI_2700 Phosphotransferase system IIA component from Lactobacillus casei ATCC 334
51% identity, 45% coverage

Ent638_2740 beta-glucoside-specific PTS system components IIABC from Enterobacter sp. 638
56% identity, 9% coverage

Cbei_3273 PTS system, beta-glucoside-specific IIABC subunit from Clostridium beijerincki NCIMB 8052
56% identity, 10% coverage

SA1255 PTS system, glucose-specific enzyme II, A component from Staphylococcus aureus subsp. aureus N315
SAV1422 glucose-specific enzyme II, PTS system A component from Staphylococcus aureus subsp. aureus Mu50
50% identity, 43% coverage

SH1484 phosphotransferase system enzyme IIA-like protein from Staphylococcus haemolyticus JCSC1435
47% identity, 43% coverage

PTW3C_BACSU / P39816 PTS system glucosamine-specific EIICBA component; EC 2.7.1.- from Bacillus subtilis (strain 168) (see 4 papers)
TC 4.A.1.1.6 / P39816 The glucosamine IICBA porter (GamP) (40% identical to 4.A.1.1.2) (Plumbridge 2015). The IIA domain in this protein can transfer the phosphoryl moiety to the maltose, N-acetylglucosamine, sucrose and trehalose PTS systems (MalP, NagP, SacP and TreP, respectively) from Bacillus subtilis (see 3 papers)
BSU02350 phosphotransferase system (PTS) glucosamine-specific enzyme IICBA component from Bacillus subtilis subsp. subtilis str. 168
47% identity, 12% coverage

LBA0609 PTS enzyme II, ABC component from Lactobacillus acidophilus NCFM
53% identity, 36% coverage

SPy2097 putative PTS system enzyme II from Streptococcus pyogenes M1 GAS
49% identity, 11% coverage

SAR1435 PTS system, glucose-specific IIA component from Staphylococcus aureus subsp. aureus MRSA252
49% identity, 43% coverage

EF_0270 beta-glucoside-specific PTS transporter subunit IIABC from Enterococcus faecalis V583
51% identity, 12% coverage

SPy1986 putative PTS system, enzyme II, A component from Streptococcus pyogenes M1 GAS
49% identity, 10% coverage

abgF / AAC05713.1 PTS-dependent enzyme II from Clostridium longisporum (see paper)
44% identity, 11% coverage

SAUSA300_1315 PTS system, glucose-specific IIA component from Staphylococcus aureus subsp. aureus USA300_FPR3757
SACOL1457 PTS system, IIA component from Staphylococcus aureus subsp. aureus COL
49% identity, 43% coverage

TC 4.A.1.1.12 / Q48WG5 Maltose/Maltotriose PTS transporter, MalT (Shelburne et al., 2008) 631aas (68% identical to 4.A.1.1.11 from S. mutans from Streptococcus pyogenes serotype M1 (see paper)
M5005_Spy_1692 PTS system, glucose-specific IIABC component from Streptococcus pyogenes MGAS5005
49% identity, 11% coverage

M28_Spy1768 PTS system, trehalose-specific IIBC component from Streptococcus pyogenes MGAS6180
55% identity, 9% coverage

llmg_1045 similar to PTS system, beta-glucosides specific enzyme IIABC from Lactococcus lactis subsp. cremoris MG1363
47% identity, 11% coverage

P42015 PTS system glucose-specific EIICBA component (Fragment) from Geobacillus stearothermophilus
47% identity, 22% coverage

SXYL_00528 beta-glucoside-specific PTS transporter subunit IIABC from Staphylococcus xylosus
47% identity, 10% coverage

ECA1870 PTS system, beta-glucoside-specific IIABC component from Erwinia carotovora subsp. atroseptica SCRI1043
55% identity, 9% coverage

CTK_C20580 glucose PTS transporter subunit IIA from Clostridium tyrobutyricum
50% identity, 9% coverage

SAG0192 PTS system, IIABC components from Streptococcus agalactiae 2603V/R
ID870_08455 PTS system trehalose-specific EIIBC component from Streptococcus agalactiae CJB111
53% identity, 9% coverage

SAK_0257 PTS system, trehalose-specific IIBCA component from Streptococcus agalactiae A909
53% identity, 9% coverage

SGO_1653 trehalose PTS enzyme II from Streptococcus gordonii str. Challis substr. CH1
52% identity, 9% coverage

SPSF3K_00506 PTS transporter subunit IIBC from Streptococcus parauberis
52% identity, 8% coverage

SXYL_00369 glucose-specific PTS transporter subunit IIBC from Staphylococcus xylosus
47% identity, 11% coverage

BC4050 PTS system, glucose-specific IIABC component from Bacillus cereus ATCC 14579
44% identity, 10% coverage

GBAA4269 PTS system, glucose-specific IIABC component from Bacillus anthracis str. 'Ames Ancestor'
44% identity, 10% coverage

EF1516 PTS system, IIABC components from Enterococcus faecalis V583
EF_RS07325 N-acetylglucosamine-specific PTS transporter subunit IIBC from Enterococcus faecalis V583
48% identity, 9% coverage

D8IBR5 PTS system, glucose subfamily, IIA subunit from Brachyspira pilosicoli (strain ATCC BAA-1826 / 95/1000)
51% identity, 9% coverage

SMU_2038, SMU_RS09325 PTS system trehalose-specific EIIBC component from Streptococcus mutans UA159
47% identity, 11% coverage

KSF55_11445 N-acetylglucosamine-specific PTS transporter subunit IIBC from Lactiplantibacillus pentosus
51% identity, 9% coverage

lp_0265 glucose PTS transporter subunit IIA from Lactiplantibacillus plantarum WCFS1
lp_0265 beta-glucosides PTS, EIIABC from Lactobacillus plantarum WCFS1
49% identity, 9% coverage

SSU0217 sugar phosphotransferase system (PTS), IIABC component from Streptococcus suis P1/7
54% identity, 8% coverage

CHF17_RS01560 PTS system trehalose-specific EIIBC component from Streptococcus agalactiae
gbs0189 Unknown from Streptococcus agalactiae NEM316
52% identity, 9% coverage

EMQU_2186 beta-glucoside-specific PTS transporter subunit IIABC from Enterococcus mundtii QU 25
51% identity, 10% coverage

SMU_980, SMU_RS04505 beta-glucoside-specific PTS transporter subunit IIABC from Streptococcus mutans UA159
55% identity, 9% coverage

UC7_RS14195 beta-glucoside-specific PTS transporter subunit IIABC from Enterococcus caccae ATCC BAA-1240
49% identity, 10% coverage

SSU05_0398 Phosphotransferase system IIC component, glucose/maltose/N-acetylglucosamine-specific from Streptococcus suis 05ZYH33
54% identity, 9% coverage

C289_1015 glucose-specific PTS transporter subunit IIBC from Anoxybacillus ayderensis
44% identity, 11% coverage

PTU3C_STACT / Q53922 PTS system glucoside-specific EIICBA component; EIICBA-Glc 2; EC 2.7.1.- from Staphylococcus carnosus (strain TM300) (see 2 papers)
TC 4.A.1.1.14 / Q53922 Glucose porter GlcB (IICBA). Glucose uptake is inhibited by methyl-α-D-glucoside, methyl-β-D-glucoside, p-nitrophenyl-α-D-glucoside, o-nitrophenyl-β-D-glucoside and salicin, but not by 2-deoxyglucose. Mannose and N-acetylglucosamine are not transported from Staphylococcus carnosus (strain TM300) (see 4 papers)
49% identity, 9% coverage

Spaf_1559 PTS system trehalose-specific EIIBC component from Streptococcus parasanguinis FW213
52% identity, 9% coverage

SSU0357 glucose-specific phosphotransferase system (PTS), IIABC component from Streptococcus suis P1/7
SSU98_0384 Phosphotransferase system IIC component, glucose/maltose/N-acetylglucosamine-specific from Streptococcus suis 98HAH33
SSUSC84_0343 putative glucose-specific phosphotransferase system (PTS), IIABC component from Streptococcus suis SC84
54% identity, 7% coverage

TC 4.A.1.1.11 / Q8DS05 The maltose/maltotriose porter, MalT (31% identical to 4.A.1.1.9) from Streptococcus mutans (see paper)
SMU_2047, SMU_RS09355 PTS transporter subunit IIBC from Streptococcus mutans UA159
54% identity, 7% coverage

SMUGS5_09220 PTS transporter subunit IIBC from Streptococcus mutans GS-5
54% identity, 7% coverage

BFP66_RS08570 beta-glucoside-specific PTS transporter subunit IIABC from Streptococcus suis
55% identity, 9% coverage

BCAL_RS06200 PTS system trehalose-specific EIIBC component from Bifidobacterium callitrichos DSM 23973
52% identity, 8% coverage

CD0388 PTS system, beta-glucoside-specific IIabc component from Clostridium difficile 630
51% identity, 9% coverage

CD3137 PTS system, IIabc component from Clostridium difficile 630
51% identity, 10% coverage

SSA_0379 PTS system, beta-glucoside-specific EII component, putative from Streptococcus sanguinis SK36
48% identity, 10% coverage

SXYL_00060 PTS cellobiose/arbutin/salicin transporter subunit IIBC from Staphylococcus xylosus
52% identity, 9% coverage

lp_2531 N-acetylglucosamine and glucose PTS, EIICBA from Lactobacillus plantarum WCFS1
lp_2531 N-acetylglucosamine-specific PTS transporter subunit IIBC from Lactiplantibacillus plantarum WCFS1
49% identity, 9% coverage

lmo0738 similar to phosphotransferase system (PTS) beta-glucoside-specific enzyme IIABC component from Listeria monocytogenes EGD-e
LMON_0743 beta-glucoside-specific PTS transporter subunit IIABC from Listeria monocytogenes EGD
56% identity, 8% coverage

str1734 sucrose-specific PTS permease, enzyme II from Streptococcus thermophilus CNRZ1066
52% identity, 9% coverage

SGO_0505 PTS system, IIBC component from Streptococcus gordonii str. Challis substr. CH1
54% identity, 7% coverage

STER_1710, STER_RS08355 sucrose-specific PTS transporter subunit IIBC from Streptococcus thermophilus LMD-9
52% identity, 9% coverage

stu1734 sucrose PTS component II from Streptococcus thermophilus LMG 18311
52% identity, 9% coverage

T303_09505 sucrose-specific PTS transporter subunit IIBC from Streptococcus thermophilus ASCC 1275
52% identity, 9% coverage

SGO_RS01385 beta-glucoside-specific PTS transporter subunit IIABC from Streptococcus gordonii str. Challis substr. CH1
SGO_0281 PTS system, beta-glucoside-specific IIABC component from Streptococcus gordonii str. Challis substr. CH1
48% identity, 10% coverage

CA_C0423 sucrose-specific PTS transporter subunit IIBC from Clostridium acetobutylicum ATCC 824
42% identity, 11% coverage

PTSA_BACSU / P50829 Phosphotransferase enzyme IIA component PtsA; PTS system EIIA component from Bacillus subtilis (strain 168) (see paper)
49% identity, 34% coverage

UH47_06750 glucose-specific PTS transporter subunit IIBC from Staphylococcus pseudintermedius
50% identity, 10% coverage

BJK46_009280 glucose-specific PTS transporter subunit IIBC from Staphylococcus pseudintermedius
50% identity, 10% coverage

Cbei_0751 PTS system, glucose subfamily, IIA subunit from Clostridium beijerincki NCIMB 8052
43% identity, 11% coverage

SP_0577 PTS system, beta-glucosides-specific IIABC components from Streptococcus pneumoniae TIGR4
54% identity, 10% coverage

SPD_0502 PTS system, beta-glucosides-specific IIABC components from Streptococcus pneumoniae D39
spr0505 Phosphotransferase system sugar-specific EII component from Streptococcus pneumoniae R6
54% identity, 10% coverage

HSISS4_01641 sucrose-specific PTS transporter subunit IIBC from Streptococcus salivarius
50% identity, 9% coverage

TC 4.A.1.2.6 / Q9KJ80 β-glucoside (Aesculin/arbutin) porter, BglP from Streptococcus mutans (see paper)
53% identity, 9% coverage

BH0844 PTS system, glucose-specific enzyme II, A component from Bacillus halodurans C-125
44% identity, 11% coverage

PTG3C_STACT / Q57071 PTS system glucose-specific EIICBA component; EIICBA-Glc; EII-Glc; EIICBA-Glc 1; EC 2.7.1.199 from Staphylococcus carnosus (strain TM300) (see 2 papers)
TC 4.A.1.1.13 / Q57071 Glucose porter, GlcA (IICBA). Glucose uptake is inhibited by 2-deoxyglucose and methyl-β-D-glucoside from Staphylococcus carnosus (strain TM300) (see 4 papers)
46% identity, 10% coverage

LMOf2365_0030 PTS system, beta-glucoside-specific, IIABC component from Listeria monocytogenes str. 4b F2365
lmo0027 similar to PTS system, beta-glucosides specific enzyme IIABC from Listeria monocytogenes EGD-e
LMOf6854_0030 PTS system, beta-glucoside-specific, IIABC component from Listeria monocytogenes str. 1/2a F6854
LMRG_02456 PTS system, beta-glucoside-specific, IIABC component from Listeria monocytogenes 10403S
LM6179_0306, LMRG_RS00135 beta-glucoside-specific PTS transporter subunit IIABC from Listeria monocytogenes 10403S
46% identity, 10% coverage

CA_C0570 glucose PTS transporter subunit IIA from Clostridium acetobutylicum ATCC 824
CAC0570 PTS enzyme II, ABC component from Clostridium acetobutylicum ATCC 824
47% identity, 9% coverage

BSQ49_09740 beta-glucoside-specific PTS transporter subunit IIABC from Liquorilactobacillus hordei
43% identity, 12% coverage

UH47_06270 glucose-specific PTS transporter subunit IIBC from Staphylococcus pseudintermedius
38% identity, 10% coverage

Cbei_2833 PTS system, beta-glucoside-specific IIABC subunit from Clostridium beijerincki NCIMB 8052
59% identity, 8% coverage

LP_RS14755 beta-glucoside-specific PTS transporter subunit IIABC from Lactiplantibacillus plantarum WCFS1
48% identity, 10% coverage

SAK_0915 PTS system, beta-glucoside-specific IIABC component from Streptococcus agalactiae A909
52% identity, 9% coverage

ID870_00140 PTS transporter subunit IIBC from Streptococcus agalactiae CJB111
SAK_1920 PTS system, glucose-specific IIABC component, putative from Streptococcus agalactiae A909
54% identity, 7% coverage

CD3097 PTS system, IIabc component from Clostridium difficile 630
54% identity, 8% coverage

RBAM_020380 YpqE from Bacillus amyloliquefaciens FZB42
58% identity, 29% coverage

gbs1946 Unknown from Streptococcus agalactiae NEM316
SAG1959 PTS system, IIABC components from Streptococcus agalactiae 2603V/R
54% identity, 7% coverage

SPD_0661 PTS system, IIABC components from Streptococcus pneumoniae D39
spr0668 PTS glucose-specific enzyme IIABC component from Streptococcus pneumoniae R6
52% identity, 7% coverage

SP_0758 PTS transporter subunit IIBC from Streptococcus pneumoniae TIGR4
52% identity, 7% coverage

SSA_1752 Phosphotransferase system, trehalose-specific IIBC component, putative from Streptococcus sanguinis SK36
56% identity, 8% coverage

ID870_01290 sucrose-specific PTS transporter subunit IIBC from Streptococcus agalactiae CJB111
46% identity, 11% coverage

CD2512 PTS system, IIa component from Clostridium difficile 630
48% identity, 35% coverage

gbs1734 Unknown from Streptococcus agalactiae NEM316
46% identity, 11% coverage

LLNZ_07350 glucose PTS transporter subunit IIA from Lactococcus cremoris subsp. cremoris NZ9000
llmg_1426 sucrose-specific PTS system IIBC component from Lactococcus lactis subsp. cremoris MG1363
56% identity, 8% coverage

SAG1690 PTS system, IIABC components from Streptococcus agalactiae 2603V/R
46% identity, 11% coverage

CHF17_RS08755 sucrose-specific PTS transporter subunit IIBC from Streptococcus agalactiae
46% identity, 11% coverage

WP_002581229 PTS sugar transporter subunit IIA from Clostridium butyricum
45% identity, 37% coverage

LP_RS13505 sucrose-specific PTS transporter subunit IIBC from Lactiplantibacillus plantarum WCFS1
lp_3219 sucrose PTS, EIIBCA from Lactobacillus plantarum WCFS1
50% identity, 9% coverage

PFREUD_01380 glucose PTS transporter subunit IIA from Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1
43% identity, 7% coverage

LPST_C2650 sucrose-specific PTS transporter subunit IIBC from Lactiplantibacillus plantarum ST-III
50% identity, 9% coverage

bglP / CAA84286.1 beta-glucoside permease from Bacillus subtilis (see paper)
57% identity, 8% coverage

TC 4.A.1.2.11 / P40739 Aryl β-glucoside porter, IIBCA (BglP; SytA) (35% identical to 4.A.1.2.2) from Bacillus subtilis (see 5 papers)
57% identity, 8% coverage

Q831R2 PTS system, IIA component from Enterococcus faecalis (strain ATCC 700802 / V583)
EF2438 PTS system, IIA component from Enterococcus faecalis V583
61% identity, 25% coverage

BMMGA3_RS01615 glucose-specific PTS transporter subunit IIBC from Bacillus methanolicus MGA3
46% identity, 10% coverage

SAK_1702 PTS system, sucrose-specific IIABC component from Streptococcus agalactiae A909
46% identity, 11% coverage

spr1699 Phosphotransferase system, trehalose-specific IIBC component from Streptococcus pneumoniae R6
58% identity, 7% coverage

SPD_1664 PTS system, trehalose-specific IIABC components from Streptococcus pneumoniae D39
58% identity, 8% coverage

SP_1884 trehalose PTS system, IIABC components from Streptococcus pneumoniae TIGR4
58% identity, 8% coverage

LGAS_1669 Trehalose PTS trehalose component IIBC from Lactobacillus gasseri ATCC 33323
51% identity, 8% coverage

TC 4.A.1.2.14 / ART98499 PTS beta-glucoside transporter, EIIBCA of 672 aas and 12 predicted TMSs from Lactobacillus gasseri
51% identity, 8% coverage

SPCG_RS09625 PTS system trehalose-specific EIIBC component from Streptococcus pneumoniae CGSP14
58% identity, 8% coverage

lmo2772 similar to beta-glucoside-specific enzyme IIABC from Listeria monocytogenes EGD-e
48% identity, 10% coverage

PECL_1777 N-acetylglucosamine-specific PTS transporter subunit IIBC from Pediococcus claussenii ATCC BAA-344
50% identity, 8% coverage

LMOf2365_2762 PTS system, beta-glucoside-specific, IIABC component from Listeria monocytogenes str. 4b F2365
48% identity, 10% coverage

LEUM_0901 Trehalose PTS trehalose component IIBC from Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293
39% identity, 12% coverage

LMOf2365_1056 PTS system, beta-glucoside-specific, IIABC component from Listeria monocytogenes str. 4b F2365
55% identity, 8% coverage

spr1566 Phosphotransferase system enzyme II from Streptococcus pneumoniae R6
49% identity, 8% coverage

SP_1722 sucrose-specific PTS transporter subunit IIBC from Streptococcus pneumoniae TIGR4
49% identity, 8% coverage

SPy_0572 beta-glucoside permease IIABC component from Streptococcus pyogenes M1 GAS
48% identity, 9% coverage

CD2666 PTS system, glucose-specific IIa component from Clostridium difficile 630
45% identity, 45% coverage

CD196_2507 PTS system, glucose-specific IIa component from Clostridium difficile CD196
CDR20291_2554 PTS system, glucose-specific IIa component from Clostridium difficile R20291
45% identity, 45% coverage

CDR20291_2969 PTS system, IIabc component from Clostridium difficile R20291
CDR20291_2969 beta-glucoside-specific PTS transporter subunit IIABC from Clostridioides difficile R20291
39% identity, 11% coverage

HGB56_08700 glycoside-pentoside-hexuronide (GPH):cation symporter from Lactiplantibacillus plantarum
41% identity, 12% coverage

lmo1035 similar to phosphotransferase system (PTS) beta-glucoside-specific enzyme IIABC from Listeria monocytogenes EGD-e
55% identity, 8% coverage

KSF55_15465 PTS sugar transporter subunit IIA from Lactiplantibacillus pentosus
42% identity, 11% coverage

lp_3486 sugar transport protein from Lactobacillus plantarum WCFS1
F9UUF2 PTS regulated carbohydrate transporter, GPH family, raffinose/melibiose/galactose (Can switch between symport (H+) and antiport (Lactose)) from Lactiplantibacillus plantarum (strain ATCC BAA-793 / NCIMB 8826 / WCFS1)
42% identity, 11% coverage

LSEI_0374 Phosphotransferase system IIA component from Lactobacillus casei ATCC 334
LSEI_0374 PTS sugar transporter subunit IIA from Lacticaseibacillus paracasei ATCC 334
40% identity, 46% coverage

PTG3C_BACSU / P20166 PTS system glucose-specific EIICBA component; EII-Glc/EIII-Glc; EIICBA-Glc; EIICBA-Glc 1; EC 2.7.1.199 from Bacillus subtilis (strain 168) (see 2 papers)
TC 4.A.1.1.9 / P20166 The glucose IICBA porter (PtsG) 44% identical to 4.A.1.1.1) from Bacillus subtilis (see 8 papers)
ptsG / GB|CAB13262.1 PTS system glucose-specific EIICBA component; EC 2.7.1.-; EC 2.7.1.69 from Bacillus subtilis (see 8 papers)
ptsG / CAA77803.1 IIGlc from Bacillus subtilis (see 3 papers)
BSU13890 phosphotransferase system (PTS) glucose-specific enzyme IICBA component from Bacillus subtilis subsp. subtilis str. 168
44% identity, 10% coverage

AO353_04460 N-acetylglucosamine-specific PTS system, I, HPr, and IIA components (nagF) from Pseudomonas fluorescens FW300-N2E3
52% identity, 6% coverage

LPL9_RS10760 PTS beta-glucoside transporter subunit IIBCA from Lacticaseibacillus paracasei
45% identity, 9% coverage

NJ56_RS11145 PTS glucose transporter subunit IIA from Yersinia ruckeri
46% identity, 37% coverage

LEUM_0507 Phosphotransferase system IIA component from Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293
41% identity, 43% coverage

CD0816 PTS system, IIABC component from Clostridium difficile 630
50% identity, 8% coverage

SGO_RS09095 sucrose-specific PTS transporter subunit IIBC from Streptococcus gordonii str. Challis substr. CH1
SGO_1857 PTS system, IIABC components from Streptococcus gordonii str. Challis substr. CH1
49% identity, 8% coverage

BC_5320 PTS sugar transporter subunit IIA from Bacillus cereus ATCC 14579
BC5320, NP_834982 PTS system, glucose-specific IIA component from Bacillus cereus ATCC 14579
41% identity, 42% coverage

BTF1_24980 PTS sugar transporter subunit IIA from Bacillus thuringiensis HD-789
41% identity, 42% coverage

lp_2969 N-acetylglucosamine PTS, EIICBA from Lactobacillus plantarum WCFS1
46% identity, 9% coverage

RUO99_14415 glycoside-pentoside-hexuronide (GPH):cation symporter from Lactiplantibacillus plantarum
42% identity, 11% coverage

CRIB_2018 PTS sugar transporter subunit IIA from Romboutsia ilealis
41% identity, 45% coverage

EfmE1162_1485 PTS transporter subunit IIBC from Enterococcus faecium E1162
40% identity, 10% coverage

YPO2995 PTS system, glucose-specific IIA component from Yersinia pestis CO92
y1485 PTS system, glucose-specific IIA component from Yersinia pestis KIM
YPTB2717 PTS system, glucose-specific IIA component, permease from Yersinia pseudotuberculosis IP 32953
44% identity, 37% coverage

SSA_0456 Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific, putative from Streptococcus sanguinis SK36
42% identity, 11% coverage

lp_3513 beta-glucosides PTS, EIIBCA from Lactobacillus plantarum WCFS1
F9UUH8 PTS system, beta-glucosides-specific EIIBCA component from Lactiplantibacillus plantarum (strain ATCC BAA-793 / NCIMB 8826 / WCFS1)
36% identity, 12% coverage

lmo1017 similar to phosphotransferase system glucose-specific enzyme IIA from Listeria monocytogenes EGD-e
52% identity, 34% coverage

LMOf2365_1038 PTS system, glucose-specific, IIA component, putative from Listeria monocytogenes str. 4b F2365
52% identity, 34% coverage

LM6179_1334 PTS sugar transporter subunit IIA from Listeria monocytogenes 6179
LMOf6854_1066 PTS system, glucose-specific, IIA component, putative from Listeria monocytogenes str. 1/2a F6854
52% identity, 34% coverage

A5I073 PTS system, beta-glucoside-specific IIabc component from Clostridium botulinum (strain Hall / ATCC 3502 / NCTC 13319 / Type A)
CBO0881 PTS system, beta-glucoside-specific IIabc component from Clostridium botulinum A str. ATCC 3502
47% identity, 10% coverage

CD630_04690 PTS beta-glucoside transporter subunit IIBCA from Clostridioides difficile 630
CD0469 PTS system, IIabc component from Clostridium difficile 630
41% identity, 11% coverage

SSU05_1490 Phosphotransferase system IIC component, glucose/maltose/N-acetylglucosamine-specific from Streptococcus suis 05ZYH33
SSU98_1501 Phosphotransferase system IIC component, glucose/maltose/N-acetylglucosamine-specific from Streptococcus suis 98HAH33
SSU1309 beta-glucoside-specific phosphotransferase system (PTS), IIABC component from Streptococcus suis P1/7
54% identity, 8% coverage

HI1711 PTS system, glucose-specific IIA component (crr) from Haemophilus influenzae Rd KW20
43% identity, 38% coverage

USA300HOU_0276 PTS family glucose/glucoside (glc) porter component IIABC from Staphylococcus aureus subsp. aureus USA300_TCH1516
SAUSA300_0259 PTS system, IIA component from Staphylococcus aureus subsp. aureus USA300_FPR3757
46% identity, 20% coverage

ETAE_1130 glucose-specific PTS system component from Edwardsiella tarda EIB202
46% identity, 37% coverage

Halsa_0150 PTS sugar transporter subunit IIA from Halanaerobium hydrogeniformans
42% identity, 38% coverage

PFLU5027 putative multiphosphoryl transfer protein from Pseudomonas fluorescens SBW25
46% identity, 6% coverage

EHLA_0193 PTS transporter subunit IIBC from Anaerobutyricum hallii
43% identity, 9% coverage

D9QA35 PTS transporter subunit EIIC from Corynebacterium pseudotuberculosis (strain C231)
CpC231_0932 glucose PTS transporter subunit IIA from Corynebacterium pseudotuberculosis C231
48% identity, 9% coverage

Q5A_018245 PTS glucose transporter subunit IIA from Serratia inhibens PRI-2C
44% identity, 37% coverage

SERP_RS10495 glucose-specific PTS transporter subunit IIBC from Staphylococcus epidermidis RP62A
Q5HL73 PTS system glucose-specific EIICBA component from Staphylococcus epidermidis (strain ATCC 35984 / DSM 28319 / BCRC 17069 / CCUG 31568 / BM 3577 / RP62A)
SERP2114 PTS system, IIABC components from Staphylococcus epidermidis RP62A
42% identity, 11% coverage

RDJ18_RS01200 glucose PTS transporter subunit IIA from Staphylococcus aureus
SAR0263 putative PTS transport system protein from Staphylococcus aureus subsp. aureus MRSA252
44% identity, 20% coverage

LP_RS00755, lp_0185 sucrose-specific PTS transporter subunit IIBC from Lactiplantibacillus plantarum WCFS1
lp_0185 sucrose PTS, EIIBCA from Lactobacillus plantarum WCFS1
45% identity, 8% coverage

P43470 PTS system sucrose-specific EIIBCA component from Pediococcus pentosaceus
45% identity, 8% coverage

OG1RF_11317 sucrose-specific PTS transporter subunit IIBC from Enterococcus faecalis OG1RF
37% identity, 12% coverage

U876_06285 PTS glucose transporter subunit IIA from Aeromonas hydrophila NJ-35
44% identity, 37% coverage

VP0793 PTS system, glucose-specific IIA component from Vibrio parahaemolyticus RIMD 2210633
44% identity, 37% coverage

LLKF_0663 PTS system sucrose-specific transporter subunit IIABC from Lactococcus lactis subsp. lactis KF147
40% identity, 11% coverage

AO356_17540 N-acetylglucosamine-specific PTS system, I, HPr, and IIA components (nagF) from Pseudomonas fluorescens FW300-N2C3
50% identity, 6% coverage

M7W_1488 PTS glucose transporter subunit IIA from Enterococcus faecium ATCC 8459 = NRRL B-2354
44% identity, 36% coverage

SA0255 hypothetical protein from Staphylococcus aureus subsp. aureus N315
44% identity, 20% coverage

BglS / b3722 β-glucoside specific PTS enzyme II / BglG kinase / BglG phosphatase (EC 2.7.1.199) from Escherichia coli K-12 substr. MG1655 (see paper)
bglF / P08722 β-glucoside specific PTS enzyme II / BglG kinase / BglG phosphatase (EC 2.7.1.199) from Escherichia coli (strain K12) (see 59 papers)
TC 4.A.1.2.2 / P08722 β-Glucoside (salicin, arbutin, cellobiose, etc) group translocator, BglF from Escherichia coli (see 6 papers)
NP_418178 beta-glucoside specific PTS enzyme II/BglG kinase/BglG phosphatase from Escherichia coli str. K-12 substr. MG1655
b3722 fused beta-glucoside-specific PTS enzymes: IIA component/IIB component/IIC component from Escherichia coli str. K-12 substr. MG1655
48% identity, 10% coverage

NWMN_0199 PTS system, IIA component from Staphylococcus aureus subsp. aureus str. Newman
SAOUHSC_00235 hypothetical protein from Staphylococcus aureus subsp. aureus NCTC 8325
44% identity, 20% coverage

lp_0264 glucose PTS transporter subunit IIA from Lactiplantibacillus plantarum WCFS1
F9UT61 PTS system, trehalose-specific IIBC component from Lactiplantibacillus plantarum (strain ATCC BAA-793 / NCIMB 8826 / WCFS1)
lp_0264 beta-glucosides PTS, EIIABC from Lactobacillus plantarum WCFS1
46% identity, 9% coverage

NJ74_RS04255 PTS beta-glucoside transporter subunit IIABC from Escherichia coli DH5[alpha]
48% identity, 10% coverage

Clocel_2778 glucose-specific PTS transporter subunit IIBC from Clostridium cellulovorans 743B
39% identity, 9% coverage

Q46072 protein-Npi-phosphohistidine-D-mannose phosphotransferase (EC 2.7.1.191); protein-Npi-phosphohistidine-D-glucose phosphotransferase (EC 2.7.1.199) from Corynebacterium glutamicum (see 2 papers)
ptsM / AAA53546.1 phosphoenolpyruvate sugar phosphotransferase from Corynebacterium glutamicum (see 2 papers)
NCgl1305 glucose PTS transporter subunit IIA from Corynebacterium glutamicum ATCC 13032
cg1537 glucose-specific enzyme II BC component of PTS from Corynebacterium glutamicum ATCC 13032
45% identity, 9% coverage

TC 4.A.3.2.5 / Q8L3C4 IIA aka Crr, component of The N,N' -diacetylchitobiose Enzyme II (Toratani et al., 2008) (>80% identical to the E. coli enzyme (4.A.3.2.1)) from Serratia marcescens (see paper)
43% identity, 37% coverage

NWMN_RS14005 glucose-specific PTS transporter subunit IIBC from Staphylococcus aureus subsp. aureus str. Newman
SAOUHSC_02848 PTS system glucose-specific IIABC component from Staphylococcus aureus subsp. aureus NCTC 8325
SAUSA300_2476 phosphotransferase system, glucose-specific IIABC component from Staphylococcus aureus subsp. aureus USA300_FPR3757
SACOL2552 PTS system, IIABC components from Staphylococcus aureus subsp. aureus COL
40% identity, 10% coverage

NO343_03030 PTS sugar transporter subunit IIA from Mycoplasma capricolum subsp. capricolum
52% identity, 34% coverage

SAR2618 PTS system, glucose-specific IIABC component from Staphylococcus aureus subsp. aureus MRSA252
40% identity, 10% coverage

KQ76_13275 glucose-specific PTS transporter subunit IIBC from Staphylococcus aureus
40% identity, 10% coverage

RDJ18_RS13495 glucose-specific PTS transporter subunit IIBC from Staphylococcus aureus
40% identity, 10% coverage

PTS3B_STRMU / P12655 PTS system sucrose-specific EIIBCA component; EIIBCA-Scr; EII-Scr; EC 2.7.1.211 from Streptococcus mutans serotype c (strain ATCC 700610 / UA159) (see 2 papers)
P12655 protein-Npi-phosphohistidine-sucrose phosphotransferase (EC 2.7.1.211) from Streptococcus mutans serotype c (see 3 papers)
scrA / GB|AAN59464.1 PTS system, sucrose-specific, EIIBCA component; EC 2.7.1.69 from Streptococcus mutans (see paper)
SMUGS5_08275, SMU_1841, SMU_RS08435 sucrose-specific PTS transporter subunit IIBC from Streptococcus mutans UA159
40% identity, 11% coverage

PBPRA0861 putative PTS system, glucose-specific IIAcomponent from Photobacterium profundum SS9
43% identity, 37% coverage

KSF55_00925 sucrose-specific PTS transporter subunit IIBC from Lactiplantibacillus pentosus
45% identity, 8% coverage

lmo2787 beta-glucoside-specific phosphotransferase enzyme II ABC component from Listeria monocytogenes EGD-e
38% identity, 12% coverage

BB562_00810 sucrose-specific PTS transporter subunit IIBC from Lactiplantibacillus pentosus
45% identity, 8% coverage

CAC1354 Phosphotransferase system IIA component from Clostridium acetobutylicum ATCC 824
46% identity, 36% coverage

J3U91_00728 PTS sugar transporter subunit IIA from Oenococcus oeni
46% identity, 33% coverage

BPHYT_RS02740 N-acetylglucosamine-specific PTS system, I, HPr, and IIA components (nagF) from Burkholderia phytofirmans PsJN
48% identity, 6% coverage

MS1508 NagE protein from Mannheimia succiniciproducens MBEL55E
MS1508 PTS glucose transporter subunit IIA from [Mannheimia] succiniciproducens MBEL55E
40% identity, 38% coverage

TC 4.A.1.2.12 / Q8NMD6 The sucrose porter, PtsS (regulated by SugR which also regulates other enzymes II) from Corynebacterium glutamicum (Brevibacterium flavum) (see paper)
NCgl2553 sucrose-specific PTS transporter subunit IIBC from Corynebacterium glutamicum ATCC 13032
cg2925 enzyme II sucrose protein from Corynebacterium glutamicum ATCC 13032
48% identity, 8% coverage

TC 4.A.1.1.15 / Q9HXN5 N-Acetyl-D-Glucosamine phosphotransferase system transporter, component of N-acetyl glucosamine-specific PTS permease, GlcNAc IIBC/GlcNAc I-HPr-IIA from Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 / 1C / PRS 101 / LMG 12228)
PA3760 probable phosphotransferase protein from Pseudomonas aeruginosa PAO1
49% identity, 6% coverage

AH67_02320 glucose PTS transporter subunit IIA from Bifidobacterium pseudolongum PV8-2
44% identity, 8% coverage

EFA0067 PTS system, IIABC components from Enterococcus faecalis V583
47% identity, 9% coverage

PA14_15790 putative phosphoenolpyruvate-protein phosphotransferase from Pseudomonas aeruginosa UCBPP-PA14
49% identity, 6% coverage

MNF30_03865 PTS sugar transporter subunit IIA from Mycoplasma mycoides subsp. capri
50% identity, 34% coverage

CTK_C27920 PTS sugar transporter subunit IIA from Clostridium tyrobutyricum
41% identity, 35% coverage

Bbr_1879 PTS sugar transporter subunit IIA from Bifidobacterium breve UCC2003
48% identity, 29% coverage

LSA1792 Sucrose-specific phosphotransferase system, enzyme IIBCA from Lactobacillus sakei subsp. sakei 23K
LCA_RS08985 sucrose-specific PTS transporter subunit IIBC from Latilactobacillus sakei subsp. sakei 23K
40% identity, 11% coverage

AO353_15995 trehalose-specific PTS system, I, HPr, and IIA components from Pseudomonas fluorescens FW300-N2E3
44% identity, 8% coverage

Halsa_1861 PTS sugar transporter subunit IIA from Halanaerobium hydrogeniformans
40% identity, 36% coverage

VC0964 PTS system, glucose-specific IIA component from Vibrio cholerae O1 biovar eltor str. N16961
43% identity, 37% coverage

Lreu_1086 sugar (glycoside-Pentoside-hexuronide) transporter from Lactobacillus reuteri DSM 20016
Lreu_1086 PTS sugar transporter subunit IIA from Limosilactobacillus reuteri subsp. reuteri
38% identity, 11% coverage

GJQ69_01220 PTS transporter subunit IIABC from Caproicibacterium lactatifermentans
47% identity, 7% coverage

LSEI_0631 beta-glucoside-specific PTS system IIABC component from Lactobacillus casei ATCC 334
42% identity, 8% coverage

MW0241 ORFID:MW0241~hypothetical protein, similar to PTS beta-glucoside-specific enzyme II, ABC component from Staphylococcus aureus subsp. aureus MW2
42% identity, 20% coverage

LGG_00603 PTS system, beta-glucoside-specific IIABC component from Lactobacillus rhamnosus GG
44% identity, 8% coverage

STER_1367, STER_RS06730, T303_07870 PTS sugar transporter subunit IIA from Streptococcus thermophilus LMD-9
42% identity, 9% coverage

TC 2.A.2.2.1 / P23936 Lactose permease, LacS. Mediates uptake of β-galactooligosaccharides, lactitol, and a broad range of prebiotic β-galactosides that selectively stimulate beneficial gut microbiota from Streptococcus thermophilus (see 2 papers)
42% identity, 9% coverage

CC0448 PTS system, fructose-specific EIIA/HPr/EI components from Caulobacter crescentus CB15
46% identity, 6% coverage

Blon_2470 PTS system, glucose subfamily, IIA subunit from Bifidobacterium longum subsp. infantis ATCC 15697
46% identity, 29% coverage

Blon_2183 PTS system, glucose subfamily, IIA subunit from Bifidobacterium longum subsp. infantis ATCC 15697
43% identity, 8% coverage

LGAS_1778 Sucrose PTS, EIIBCA from Lactobacillus gasseri ATCC 33323
39% identity, 11% coverage

SAV2538 PTS system, glucose-specific II ABC component from Staphylococcus aureus subsp. aureus Mu50
SA2326 PTS system, glucose-specific IIABC component from Staphylococcus aureus subsp. aureus N315
39% identity, 10% coverage

TC 4.A.1.2.16 / ART98386 PTS beta-glucoside transporter, EIIBCA of 647 aas and 10 predicted TMSs from Lactobacillus gasseri
39% identity, 11% coverage

1glcF / P69783 Cation promoted association (cpa) of a regulatory and target protein is controlled by phosphorylation (see paper)
43% identity, 39% coverage

stu1398 lactose permease from Streptococcus thermophilus LMG 18311
str1398 sodium:beta-glucoside symporter from Streptococcus thermophilus CNRZ1066
42% identity, 9% coverage

TC 4.A.1.2.15 / ART98417 PTS beta-glucoside transporter, EIIBCA of 624 aas and 10 predicted TMSs from Lactobacillus gasseri
41% identity, 11% coverage

Crr / b2417 Enzyme IIAGlc (EC 2.7.1.199; EC 2.7.1.201; EC 2.7.1.192) from Escherichia coli K-12 substr. MG1655 (see 47 papers)
Crr / P69783 Enzyme IIAGlc (EC 2.7.1.199) from Escherichia coli (strain K12) (see 46 papers)
PTGA_ECOLI / P69783 PTS system glucose-specific EIIA component; EIIA-Glc; EIII-Glc; Glucose-specific phosphotransferase enzyme IIA component from Escherichia coli (strain K12) (see 6 papers)
P69783 protein-Npi-phosphohistidine-D-glucose phosphotransferase (EC 2.7.1.199) from Escherichia coli (see 2 papers)
TC 4.A.1.1.1 / P69783 Glucose-specific phosphotransferase enzyme IIA component PTGA aka CRR aka GSR aka IEX aka TGS aka TRED aka B2417, component of Glucose porter (PtsG; GlcA; Umg) (transports D-glucose and α-methyl-D-glucopyranoside) from Escherichia coli (see 16 papers)
crr / GB|AAC75470.1 glucose-specific phosphotransferase enzyme IIA component; EC 2.7.1.- from Escherichia coli K12 (see 16 papers)
b2417 glucose-specific PTS system enzyme IIA component from Escherichia coli str. K-12 substr. MG1655
NP_416912 Enzyme IIA(Glc) from Escherichia coli str. K-12 substr. MG1655
43% identity, 37% coverage

BCUN_1552 glucose PTS transporter subunit IIA from Bifidobacterium cuniculi
44% identity, 8% coverage

Bbr_1594 glucose PTS transporter subunit IIA from Bifidobacterium breve UCC2003
43% identity, 8% coverage

BAD_RS01940 glucose PTS transporter subunit IIA from Bifidobacterium adolescentis ATCC 15703
43% identity, 9% coverage

BBMN68_1665 glucose PTS transporter subunit IIA from Bifidobacterium longum subsp. longum BBMN68
43% identity, 7% coverage

PMI1830 PTS family enzyme IIA component from Proteus mirabilis HI4320
41% identity, 37% coverage

AH68_02050 glucose PTS transporter subunit IIA from Bifidobacterium catenulatum PV20-2
46% identity, 8% coverage

EF0958 PTS system, IIABC components from Enterococcus faecalis V583
43% identity, 8% coverage

OG1RF_10684 PTS transporter subunit IIBC from Enterococcus faecalis OG1RF
43% identity, 8% coverage

PS417_23035 D-trehalose PTS system, I, HPr, and IIA components from Pseudomonas simiae WCS417
40% identity, 7% coverage

PTGA_SALTY / P0A283 PTS system glucose-specific EIIA component; EIIA-Glc; EIII-Glc; Glucose-specific phosphotransferase enzyme IIA component from Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) (see 3 papers)
SPC_1226 glucose-specific PTS system enzyme IIA component from Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594
STM2433 PTS family, glucose-specific IIA component from Salmonella typhimurium LT2
41% identity, 37% coverage

ETAE_2616 PTS system, N-acetylglucosamine-specific IIBC subunit from Edwardsiella tarda EIB202
44% identity, 8% coverage

Dred_0332 PTS system, glucose subfamily, IIA subunit from Desulfotomaculum reducens MI-1
38% identity, 11% coverage

ECA_RS21710 beta-glucoside-specific PTS transporter subunit IIABC from Pectobacterium atrosepticum SCRI1043
51% identity, 7% coverage

CAC3427 PTS system, (possibly glucose-specific) IIA component from Clostridium acetobutylicum ATCC 824
40% identity, 36% coverage

ECs3289 glucose-specific PTS system IIA component from Escherichia coli O157:H7 str. Sakai
43% identity, 37% coverage

lp_3240 beta-glucosides PTS, EIIABC from Lactobacillus plantarum WCFS1
45% identity, 9% coverage

YPTB1120 pts system, N-acetylglucosamine-specific IIABC component from Yersinia pseudotuberculosis IP 32953
42% identity, 8% coverage

lp_3240 glucose PTS transporter subunit IIA from Lactiplantibacillus plantarum WCFS1
45% identity, 9% coverage

BSQ49_00775 sucrose-specific PTS transporter subunit IIBC from Liquorilactobacillus hordei
43% identity, 9% coverage

BB0559 PTS system, glucose-specific IIA component (crr) from Borrelia burgdorferi B31
35% identity, 33% coverage

BL1632 PtsG from Bifidobacterium longum NCC2705
41% identity, 7% coverage

PMI2226 PTS system, IIabc component from Proteus mirabilis HI4320
47% identity, 8% coverage

SG0859 PTS system N-acetylglucosamine-specific IIABC component from Sodalis glossinidius str. 'morsitans'
40% identity, 8% coverage

NH13_02050 sucrose-specific PTS transporter subunit IIBC from Lactobacillus acidophilus
51% identity, 8% coverage

LBA1705 PTS system IIBC component from Lactobacillus acidophilus NCFM
42% identity, 10% coverage

ESA_02545 beta-glucoside-specific PTS transporter subunit IIABC from Cronobacter sakazakii ATCC BAA-894
ESA_02545 hypothetical protein from Enterobacter sakazakii ATCC BAA-894
42% identity, 10% coverage

LBA0725 phosphotransferase system enzyme II from Lactobacillus acidophilus NCFM
40% identity, 9% coverage

P43466 Raffinose carrier protein from Pediococcus pentosaceus
49% identity, 8% coverage

B1745_01765 sucrose-specific PTS transporter subunit IIBC from Lactobacillus amylolyticus
49% identity, 8% coverage

Cbei_4533 PTS system, glucose subfamily, IIA subunit from Clostridium beijerincki NCIMB 8052
42% identity, 36% coverage

P45604 protein-Npi-phosphohistidine-N-acetyl-D-glucosamine phosphotransferase (EC 2.7.1.193) from Klebsiella pneumoniae (see paper)
42% identity, 8% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 798,070 different protein sequences to 1,261,478 scientific articles. Searches against EuropePMC were last performed on May 12 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory