PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for tr|Q9HUQ6|Q9HUQ6_PSEAE Probable short-chain dehydrogenase OS=Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) OX=208964 GN=PA4907 PE=1 SV=1 (253 a.a., MSSTLFITGA...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 252 similar proteins in the literature:

PA4907 probable short-chain dehydrogenase from Pseudomonas aeruginosa PAO1
100% identity, 100% coverage

PP_4862 oxidoreductase, short chain dehydrogenase/reductase family from Pseudomonas putida KT2440
80% identity, 99% coverage

VK055_1961 SDR family NAD(P)-dependent oxidoreductase from Klebsiella pneumoniae subsp. pneumoniae
62% identity, 99% coverage

isfD / D3U1D9 sulfoacetaldehyde reductase (NADPH) (EC 1.1.1.313) from Klebsiella oxytoca (see 2 papers)
ISFD_KLEOX / D3U1D9 Sulfoacetaldehyde reductase; Isethionate formation reductase; EC 1.1.1.313 from Klebsiella oxytoca (see 2 papers)
D3U1D9 sulfoacetaldehyde reductase (EC 1.1.1.313) from Klebsiella oxytoca (see 2 papers)
61% identity, 99% coverage

6ixjA / D3U1D9 The crystal structure of sulfoacetaldehyde reductase from klebsiella oxytoca (see paper)
61% identity, 99% coverage

ISFD2_CHRSD / Q1QU27 Sulfoacetaldehyde reductase 2; Isethionate formation reductase 2; EC 1.1.1.313 from Chromohalobacter salexigens (strain ATCC BAA-138 / DSM 3043 / CIP 106854 / NCIMB 13768 / 1H11) (see paper)
63% identity, 100% coverage

isfD / Q1R183 sulfoacetaldehyde reductase monomer (EC 1.1.1.313) from Chromohalobacter salexigens (strain ATCC BAA-138 / DSM 3043 / CIP 106854 / NCIMB 13768 / 1H11) (see paper)
ISFD_CHRSD / Q1R183 Sulfoacetaldehyde reductase; Isethionate formation reductase; EC 1.1.1.313 from Chromohalobacter salexigens (strain ATCC BAA-138 / DSM 3043 / CIP 106854 / NCIMB 13768 / 1H11) (see paper)
Q1R183 sulfoacetaldehyde reductase (EC 1.1.1.313) from Chromohalobacter salexigens (see paper)
Csal_0161 short-chain dehydrogenase/reductase SDR from Chromohalobacter salexigens DSM 3043
60% identity, 100% coverage

YPTB2195 probable oxidoreductase in dcp-nohA intergenic region from Yersinia pseudotuberculosis IP 32953
56% identity, 98% coverage

Q8U8I2 Serine 3-dehydrogenase from Agrobacterium fabrum (strain C58 / ATCC 33970)
57% identity, 100% coverage

Smlt0396 putative oxidoreductase from Stenotrophomonas maltophilia K279a
58% identity, 100% coverage

ZMO0226 short-chain dehydrogenase/reductase SDR from Zymomonas mobilis subsp. mobilis ZM4
59% identity, 99% coverage

SDH_RHIRD / Q9KWN1 Serine 3-dehydrogenase; EC 1.1.1.276 from Rhizobium radiobacter (Agrobacterium tumefaciens) (Agrobacterium radiobacter) (see paper)
56% identity, 100% coverage

plu2233 No description from Photorhabdus luminescens subsp. laumondii TTO1
54% identity, 98% coverage

CFF8240_0977 NADP-dependent l-serine/l-allo-threonine dehydrogenase ydfg from Campylobacter fetus subsp. fetus 82-40
52% identity, 100% coverage

BPSL2759 putative short-chain dehydrogenase from Burkholderia pseudomallei K96243
56% identity, 98% coverage

SEET0819_14220 bifunctional NADP-dependent 3-hydroxy acid dehydrogenase/3-hydroxypropionate dehydrogenase YdfG from Salmonella enterica subsp. enterica serovar Tennessee str.
54% identity, 98% coverage

c1965 Probable oxidoreductase ydfG from Escherichia coli CFT073
52% identity, 98% coverage

PMI1288 NADP-dependent L-serine/L-allo-threonine dehydrogenase from Proteus mirabilis HI4320
53% identity, 98% coverage

YdfG / b1539 3-hydroxy acid dehydrogenase YdfG (EC 1.1.1.298) from Escherichia coli K-12 substr. MG1655 (see 7 papers)
ydfG / P39831 3-hydroxy acid dehydrogenase YdfG (EC 1.1.1.298) from Escherichia coli (strain K12) (see 6 papers)
YDFG_ECOLI / P39831 NADP-dependent 3-hydroxy acid dehydrogenase YdfG; L-allo-threonine dehydrogenase; Malonic semialdehyde reductase; EC 1.1.1.381; EC 1.1.1.298 from Escherichia coli (strain K12) (see 2 papers)
P39831 3-hydroxy acid dehydrogenase (EC 1.1.1.381) from Escherichia coli (see paper)
3asvA The closed form of serine dehydrogenase complexed with NADP+ (see paper)
NP_416057 3-hydroxy acid dehydrogenase YdfG from Escherichia coli str. K-12 substr. MG1655
b1539 L-allo-threonine dehydrogenase, NAD(P)-binding from Escherichia coli str. K-12 substr. MG1655
52% identity, 98% coverage

HI1430 short chain dehydrogenase/reductase from Haemophilus influenzae Rd KW20
51% identity, 100% coverage

ETAE_1724 short-chain alcohol dehydrogenase of unknown specificity from Edwardsiella tarda EIB202
53% identity, 98% coverage

NGO1079 putative oxidoreductase from Neisseria gonorrhoeae FA 1090
51% identity, 90% coverage

DJ41_1604, KZA74_05010 SDR family oxidoreductase from Acinetobacter baumannii ATCC 19606 = CIP 70.34 = JCM 6841
52% identity, 94% coverage

gbs1158 unknown from Streptococcus agalactiae NEM316
49% identity, 98% coverage

CJJ81176_0850 oxidoreductase, short chain dehydrogenase/reductase family from Campylobacter jejuni subsp. jejuni 81-176
48% identity, 99% coverage

Cj0833c oxidoreductase from Campylobacter jejuni subsp. jejuni NCTC 11168
48% identity, 99% coverage

HP0357 short chain alcohol dehydrogenase from Helicobacter pylori 26695
49% identity, 98% coverage

APA386B_2052 SDR family NAD(P)-dependent oxidoreductase from Acetobacter pasteurianus 386B
49% identity, 97% coverage

all0475 probable short-chain dehydrogenase from Nostoc sp. PCC 7120
48% identity, 98% coverage

PP0488, PP_0488 oxidoreductase, short chain dehydrogenase/reductase family from Pseudomonas putida KT2440
47% identity, 98% coverage

PGN_0711 putative oxidoreductase short chain dehydrogenase/reductase family from Porphyromonas gingivalis ATCC 33277
48% identity, 97% coverage

PG0676 oxidoreductase, short chain dehydrogenase/reductase family from Porphyromonas gingivalis W83
48% identity, 97% coverage

CBU_1513 oxidoreductase, short chain dehydrogenase/reductase family from Coxiella burnetii RSA 493
Q83BJ5 Short chain dehydrogenase from Coxiella burnetii (strain RSA 493 / Nine Mile phase I)
41% identity, 93% coverage

PtVFX2014_13425 SDR family NAD(P)-dependent oxidoreductase from Legionella pneumophila
42% identity, 95% coverage

XP_001689640 uncharacterized protein from Chlamydomonas reinhardtii
43% identity, 95% coverage

YI13_SCHPO / Q9P7B4 NADP-dependent 3-hydroxy acid dehydrogenase; L-allo-threonine dehydrogenase; EC 1.1.1.381 from Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast) (see paper)
SPAC521.03 short chain dehydrogenase (predicted) from Schizosaccharomyces pombe
40% identity, 97% coverage

SP_1909 short chain dehydrogenase/reductase family oxidoreductase from Streptococcus pneumoniae TIGR4
42% identity, 100% coverage

SPD_RS09115 SDR family NAD(P)-dependent oxidoreductase from Streptococcus pneumoniae D39
41% identity, 100% coverage

M5005_Spy_0852 short chain dehydrogenase from Streptococcus pyogenes MGAS5005
48% identity, 75% coverage

SJAG_01690 NADP-dependent L-serine/L-allo-threonine dehydrogenase ydfG from Schizosaccharomyces japonicus yFS275
38% identity, 96% coverage

PAAG_02354 serine 3-dehydrogenase from Paracoccidioides lutzii Pb01
36% identity, 93% coverage

SJAG_01492 NADP-dependent L-serine/L-allo-threonine dehydrogenase ydfG from Schizosaccharomyces japonicus yFS275
35% identity, 96% coverage

Rv0484c PROBABLE SHORT-CHAIN TYPE OXIDOREDUCTASE from Mycobacterium tuberculosis H37Rv
40% identity, 95% coverage

NCgl2122 SDR family NAD(P)-dependent oxidoreductase from Corynebacterium glutamicum ATCC 13032
38% identity, 94% coverage

YM71_YEAST / Q05016 NADP-dependent 3-hydroxy acid dehydrogenase; L-allo-threonine dehydrogenase; EC 1.1.1.-; EC 1.1.1.381 from Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) (see 3 papers)
Q05016 3-hydroxy acid dehydrogenase (EC 1.1.1.381) from Saccharomyces cerevisiae (see paper)
YMR226C NADP(+)-dependent dehydrogenase; acts on serine, L-allo-threonine, and other 3-hydroxy acids; green fluorescent protein fusion protein localizes to the cytoplasm and nucleus; may interact with ribosomes, based on co-purification experiments from Saccharomyces cerevisiae
34% identity, 91% coverage

3rkuA / Q05016 Substrate fingerprint and the structure of NADP+ dependent serine dehydrogenase from saccharomyces cerevisiae complexed with NADP+
34% identity, 90% coverage

SAS2370 putative short chain dehydrogenase from Staphylococcus aureus subsp. aureus MSSA476
33% identity, 93% coverage

CNC05450 oxidoreductase from Cryptococcus neoformans var. neoformans JEC21
35% identity, 79% coverage

SACOL2488 oxidoreductase, short-chain dehydrogenase/reductase family from Staphylococcus aureus subsp. aureus COL
Q2FE21 Uncharacterized oxidoreductase SAUSA300_2422 from Staphylococcus aureus (strain USA300)
SAUSA300_2422 oxidoreductase, short-chain dehydrogenase/reductase family from Staphylococcus aureus subsp. aureus USA300_FPR3757
NWMN_2375 NAD-dependent epimerase/dehydratase from Staphylococcus aureus subsp. aureus str. Newman
33% identity, 93% coverage

FRAAL4695 Putative oxidoreductase, short chain dehydrogenase family. from Frankia alni ACN14a
37% identity, 97% coverage

SAR2567 putative short chain dehydrogenase from Staphylococcus aureus subsp. aureus MRSA252
Q6GDV6 Uncharacterized oxidoreductase SAR2567 from Staphylococcus aureus (strain MRSA252)
33% identity, 93% coverage

Q7A3L9 Uncharacterized oxidoreductase SA2266 from Staphylococcus aureus (strain N315)
Q99RF5 Uncharacterized oxidoreductase SAV2478 from Staphylococcus aureus (strain Mu50 / ATCC 700699)
SA2266 hypothetical protein from Staphylococcus aureus subsp. aureus N315
SAV2478 similar to oxidoreductase from Staphylococcus aureus subsp. aureus Mu50
32% identity, 93% coverage

XP_002112422 uncharacterized protein from Trichoplax adhaerens
35% identity, 19% coverage

N2J7Q2 Oxidoreductase from Pseudomonas sp. HPB0071
36% identity, 95% coverage

SMa0320 putative from Sinorhizobium meliloti 1021
36% identity, 94% coverage

GOX1899 Putative oxidoreductase from Gluconobacter oxydans 621H
36% identity, 95% coverage

BCAS0172 putative dehydrogenase from Burkholderia cenocepacia J2315
35% identity, 96% coverage

MSMEG_5568 clavaldehyde dehydrogenase from Mycobacterium smegmatis str. MC2 155
33% identity, 95% coverage

2jahC / Q9LCV7 Biochemical and structural analysis of the clavulanic acid dehydeogenase (cad) from streptomyces clavuligerus (see paper)
37% identity, 94% coverage

BB341_RS07845, SCLAV_4190 SDR family oxidoreductase from Streptomyces clavuligerus
37% identity, 94% coverage

cad / Q9LCV7 clavulanate dehydrogenase subunit from Streptomyces clavuligerus (see paper)
37% identity, 94% coverage

Bm2014 oxidoreductase, short chain dehydrogenase/reductase family protein from Brugia malayi
33% identity, 19% coverage

Atu0857 oxidoreductase from Agrobacterium tumefaciens str. C58 (Cereon)
34% identity, 95% coverage

MSMEG_6859 oxidoreductase from Mycobacterium smegmatis str. MC2 155
36% identity, 94% coverage

2ehdB / Q5SK86 Crystal structure analysis of oxidoreductase
33% identity, 94% coverage

ZMO3_RS07160 SDR family oxidoreductase from Zymomonas mobilis subsp. mobilis
ZMO1576 short-chain dehydrogenase/reductase SDR from Zymomonas mobilis subsp. mobilis ZM4
32% identity, 96% coverage

Afu5g11240 oxidoreductase, short chain dehydrogenase/reductase family from Aspergillus fumigatus Af293
37% identity, 67% coverage

bll3640 short-chain dehydrogenase from Bradyrhizobium japonicum USDA 110
34% identity, 99% coverage

VPA1275 oxidoreductase, short-chain dehydrogenase/reductase family from Vibrio parahaemolyticus RIMD 2210633
34% identity, 96% coverage

A9762_05735 SDR family oxidoreductase from Pandoraea sp. ISTKB
35% identity, 94% coverage

AFUA_5G10120, Afu5g10120 NRPS-like enzyme, putative from Aspergillus fumigatus Af293
34% identity, 18% coverage

SMa2343 putative oxidoreductase from Sinorhizobium meliloti 1021
33% identity, 95% coverage

XF1744 oxidoreductase from Xylella fastidiosa 9a5c
32% identity, 95% coverage

VCA1057 oxidoreductase, short-chain dehydrogenase/reductase family from Vibrio cholerae O1 biovar eltor str. N16961
34% identity, 96% coverage

lp_2851 SDR family oxidoreductase from Lactiplantibacillus plantarum WCFS1
lp_2851 short-chain dehydrogenase/oxidoreductase from Lactobacillus plantarum WCFS1
33% identity, 96% coverage

ATR12_EMENI / A0A1U8QWA2 Glycine betaine reductase ATRR; Nonribosomal peptide synthetase-like protein ATRR; EC 1.2.1.-; EC 1.1.1.- from Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) (Aspergillus nidulans) (see paper)
33% identity, 18% coverage

ANOM_004242 NRPS-like enzyme from Aspergillus nomiae NRRL 13137
32% identity, 18% coverage

lmo0432 similar to oxidoreductase from Listeria monocytogenes EGD-e
31% identity, 96% coverage

HMPREF1120_00598 linear gramicidin synthetase subunit C from Exophiala dermatitidis NIH/UT8656
33% identity, 18% coverage

lin0452 similar to oxidoreductase from Listeria innocua Clip11262
30% identity, 96% coverage

Z0343 putative oxidoreductase from Escherichia coli O157:H7 EDL933
33% identity, 96% coverage

3p19A / Q9F172 Improved NADPH-dependent blue fluorescent protein (see paper)
32% identity, 96% coverage

An02g00210 uncharacterized protein from Aspergillus niger
31% identity, 18% coverage

FTT_1201c oxidoreductase, short-chain dehydrogenase family protein from Francisella tularensis subsp. tularensis SCHU S4
33% identity, 95% coverage

SCO0522 oxidoreductase from Streptomyces coelicolor A3(2)
34% identity, 92% coverage

PADG_02575 uncharacterized protein from Paracoccidioides brasiliensis Pb18
32% identity, 18% coverage

BBA_04028 nonribosomal peptide synthetase 10 from Beauveria bassiana ARSEF 2860
31% identity, 18% coverage

FGSG_06507 hypothetical protein from Fusarium graminearum PH-1
32% identity, 18% coverage

FFUJ_03506 related to alpha-aminoadipate reductase large subunit from Fusarium fujikuroi IMI 58289
32% identity, 18% coverage

PK81I_PSEFD / M2ZIX7 Short chain dehydrogenase MYCFIDRAFT_6125; PKS8-1 gene cluster protein MYCFIDRAFT_6125; EC 1.1.1.- from Pseudocercospora fijiensis (strain CIRAD86) (Black leaf streak disease fungus) (Mycosphaerella fijiensis) (see 2 papers)
35% identity, 60% coverage

alr2142 oxidoreductase from Nostoc sp. PCC 7120
32% identity, 95% coverage

slr0315 hypothetical protein from Synechocystis sp. PCC 6803
30% identity, 96% coverage

H9Q64_13605 SDR family oxidoreductase from Enterococcus faecalis
30% identity, 95% coverage

EF2956 oxidoreductase, short-chain dehydrogenase/reductase family from Enterococcus faecalis V583
30% identity, 95% coverage

AO090005000690 No description from Aspergillus oryzae RIB40
32% identity, 18% coverage

MAC_06316 putative NRPS-like protein biosynthetic cluster from Metarhizium acridum
30% identity, 18% coverage

Dgeo_0818 short-chain dehydrogenase/reductase SDR from Deinococcus geothermalis DSM 11300
32% identity, 92% coverage

Pc18g00380 uncharacterized protein from Penicillium rubens
30% identity, 18% coverage

MAA_07148 Acyl carrier protein-like protein from Metarhizium robertsii ARSEF 23
30% identity, 18% coverage

LOC113211558 3-oxoacyl-[acyl-carrier-protein] reductase FabG from Frankliniella occidentalis
32% identity, 85% coverage

SARO_RS03965 SDR family NAD(P)-dependent oxidoreductase from Novosphingobium aromaticivorans DSM 12444
Saro_0794 short-chain dehydrogenase/reductase SDR from Novosphingobium aromaticivorans DSM 12444
32% identity, 61% coverage

MUL_1543 ketoacyl reductase from Mycobacterium ulcerans Agy99
38% identity, 81% coverage

MMAR_2367 ketoacyl reductase from Mycobacterium marinum M
37% identity, 81% coverage

SMb20214 putative oxidoreductase protein from Sinorhizobium meliloti 1021
35% identity, 70% coverage

MAV_4791 dehydrogenase from Mycobacterium avium 104
MAV_RS23020 SDR family NAD(P)-dependent oxidoreductase from Mycobacterium avium 104
40% identity, 72% coverage

YP_177255 3-oxoacyl-[acyl-carrier-protein] reductase from Bacillus clausii KSM-K16
32% identity, 84% coverage

ACP_0942 clavaldehyde dehydrogenase from Acidobacterium capsulatum ATCC 51196
29% identity, 94% coverage

HVO_B0031 short-chain family oxidoreductase from Haloferax volcanii DS2
37% identity, 74% coverage

NP_001021765 DeHydrogenases, Short chain from Caenorhabditis elegans
29% identity, 91% coverage

YPO1452 putative short chain dehydrogenase from Yersinia pestis CO92
28% identity, 88% coverage

K7GE31 Dehydrogenase/reductase SDR family member 7B from Pelodiscus sinensis
34% identity, 61% coverage

F1LB38 3-oxoacyl-[acyl-carrier-protein reductase from Ascaris suum
33% identity, 78% coverage

PEPE_RS00250 SDR family oxidoreductase from Pediococcus pentosaceus ATCC 25745
27% identity, 96% coverage

CG31548 uncharacterized protein from Drosophila melanogaster
30% identity, 88% coverage

SDR-1 / D2WKD9 NADP+-dependent farnesol dehydrogenase 1 monomer (EC 1.1.1.216) from Aedes aegypti (see paper)
SDR1_AEDAE / D2WKD9 Farnesol dehydrogenase; NADP(+)-dependent farnesol dehydrogenase 1; AaSDR-1; EC 1.1.1.216 from Aedes aegypti (Yellowfever mosquito) (Culex aegypti) (see paper)
D2WKD9 farnesol dehydrogenase (NADP+) (EC 1.1.1.216) from Aedes aegypti (see paper)
27% identity, 94% coverage

D2WKE0 NADP+-dependent farnesol dehydrogenase 2 from Aedes aegypti
28% identity, 94% coverage

Q71R50 Dehydrogenase/reductase SDR family member 11 from Gallus gallus
28% identity, 93% coverage

SNOG_07126 hypothetical protein from Parastagonospora nodorum SN15
29% identity, 19% coverage

BOA17_BOTFB / A6SSW9 Oxidoreductase BOA17; Botcinic acid biosynthesis cluster B protein 17; EC 1.-.-.- from Botryotinia fuckeliana (strain B05.10) (Noble rot fungus) (Botrytis cinerea) (see 2 papers)
34% identity, 67% coverage

lmo1976 similar to oxidoreductase from Listeria monocytogenes EGD-e
31% identity, 90% coverage

RLV_2389 SDR family oxidoreductase from Rhizobium leguminosarum bv. viciae
29% identity, 93% coverage

DHR11_HUMAN / Q6UWP2 Dehydrogenase/reductase SDR family member 11; 17-beta-hydroxysteroid dehydrogenase; 3-beta-hydroxysteroid 3-dehydrogenase; Estradiol 17-beta-dehydrogenase; Short-chain dehydrogenase/reductase family 24C member 1; EC 1.1.1.270; EC 1.1.1.62 from Homo sapiens (Human) (see paper)
27% identity, 91% coverage

LOC101264635 11-beta-hydroxysteroid dehydrogenase A from Solanum lycopersicum
33% identity, 62% coverage

B0VT18 NADP-dependent 3-hydroxy acid dehydrogenase YdfG from Acinetobacter baumannii (strain SDF)
ABSDF0838 putative Oxidoreductase, short chain dehydrogenase/reductase family from Acinetobacter baumannii SDF
33% identity, 72% coverage

Q6IAN0 Dehydrogenase/reductase SDR family member 7B from Homo sapiens
NP_056325 dehydrogenase/reductase SDR family member 7B isoform 1 from Homo sapiens
33% identity, 65% coverage

GL50803_14626 Oxidoreductase, short chain dehydrogenase/reductase family from Giardia intestinalis
35% identity, 60% coverage

Q9X248 3-oxoacyl-[acyl-carrier-protein] reductase FabG from Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8)
29% identity, 86% coverage

AbA118F_3481 SDR family NAD(P)-dependent oxidoreductase from Acinetobacter baumannii
32% identity, 73% coverage

CG3699 uncharacterized protein from Drosophila melanogaster
28% identity, 92% coverage

1xg5C / Q6UWP2 Structure of human putative dehydrogenase mgc4172 in complex with nadp
26% identity, 91% coverage

Q10782 Possible ketoacyl reductase from Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv)
Rv1544 Possible ketoacyl reductase from Mycobacterium tuberculosis H37Rv
35% identity, 81% coverage

Saro_1875 short-chain dehydrogenase/reductase SDR from Novosphingobium aromaticivorans DSM 12444
SARO_RS09390 SDR family NAD(P)-dependent oxidoreductase from Novosphingobium aromaticivorans DSM 12444
35% identity, 66% coverage

ABO_1918 short-chain dehydrogenase/reductase family from Alcanivorax borkumensis SK2
35% identity, 63% coverage

BMD_2094 short-chain dehydrogenase from Bacillus megaterium DSM319
33% identity, 71% coverage

LNN31_11860 SDR family oxidoreductase from Acetobacterium wieringae
29% identity, 80% coverage

LA_2621 short-chain dehydrogenase from Leptospira interrogans serovar lai str. 56601
34% identity, 72% coverage

RL0644 putative ribitol 2-dehydrogenase from Rhizobium leguminosarum bv. viciae 3841
30% identity, 92% coverage

EA682_19600 SDR family oxidoreductase from Acinetobacter baumannii
29% identity, 91% coverage

Rv2509 / I6Y9I3 Rv2509 from Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) (see 2 papers)
Rv2509 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mycobacterium tuberculosis H37Rv
I6Y9I3 Probable short-chain type dehydrogenase/reductase from Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv)
Mb2537 PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mycobacterium bovis AF2122/97
38% identity, 69% coverage

E1VCL7 Sorbitol dehydrogenase from Halomonas elongata (strain ATCC 33173 / DSM 2581 / NBRC 15536 / NCIMB 2198 / 1H9)
31% identity, 70% coverage

tsaC / P94681 4-sulfobenzyl alcohol dehydrogenase subunit (EC 1.1.1.257) from Comamonas testosteroni (see paper)
TSAC_COMTE / P94681 4-formylbenzenesulfonate dehydrogenase TsaC1/TsaC2; Toluenesulfonate zinc-independent alcohol dehydrogenase TsaC; EC 1.2.1.62 from Comamonas testosteroni (Pseudomonas testosteroni) (see 2 papers)
28% identity, 89% coverage

Q3ZBV9 Dehydrogenase/reductase SDR family member 11 from Bos taurus
27% identity, 93% coverage

NP_001165583 dehydrogenase/reductase SDR family member 7B isoform 1 from Mus musculus
30% identity, 70% coverage

rbtD / P00335 ribitol dehydrogenase subunit (EC 1.1.1.56) from Klebsiella aerogenes (see paper)
P00335 Ribitol 2-dehydrogenase from Klebsiella aerogenes
31% identity, 93% coverage

Q99J47 Dehydrogenase/reductase SDR family member 7B from Mus musculus
30% identity, 68% coverage

AW20_3574 glucose 1-dehydrogenase from Bacillus anthracis str. Sterne
BAS4612 glucose 1-dehydrogenase from Bacillus anthracis str. Sterne
27% identity, 87% coverage

NP_572420 spidey from Drosophila melanogaster
32% identity, 62% coverage

DR_1938 oxidoreductase, short-chain dehydrogenase/reductase family from Deinococcus radiodurans R1
36% identity, 66% coverage

EHI_163540 b-keto acyl reductase, putative from Entamoeba histolytica HM-1:IMSS
27% identity, 71% coverage

Tfu_1472 short-chain dehydrogenase/reductase (SDR) family protein from Thermobifida fusca YX
28% identity, 85% coverage

7ejhA / Q6WVP7 Crystal structure of kred mutant-f147l/l153q/y190p/l199a/m205f/m206f and 2-hydroxyisoindoline-1,3-dione complex
29% identity, 86% coverage

BQ1177_RS00925 SDR family NAD(P)-dependent oxidoreductase from Ligilactobacillus salivarius
32% identity, 81% coverage

G3V978 Dehydrogenase/reductase 11 from Rattus norvegicus
26% identity, 72% coverage

Cobetia51163_080094 SDR family NAD(P)-dependent oxidoreductase from Cobetia sp. 5-11-6-3
34% identity, 72% coverage

RADH_LENKE / Q6WVP7 NADP-dependent (R)-specific alcohol dehydrogenase; (R)-specific ADH; Ketoreductase; KRED; EC 1.1.1.- from Lentilactobacillus kefiri (Lactobacillus kefiri)
29% identity, 86% coverage

Q3U0B3 Dehydrogenase/reductase SDR family member 11 from Mus musculus
27% identity, 91% coverage

HSD_PINMS / A0A140FAN3 11-beta-hydroxysteroid dehydrogenase; 17-beta-hydroxysteroid dehydrogenase; Steroleosin; EC 1.1.1.146; EC 1.1.1.62 from Pinus massoniana (Chinese red pine) (see paper)
33% identity, 61% coverage

BAB2_0975 DNA gyrase, subunit B:Short-chain dehydrogenase/reductase SDR:Glucose/ribitol dehydrogenase from Brucella melitensis biovar Abortus 2308
29% identity, 89% coverage

MAP1716 hypothetical protein from Mycobacterium avium subsp. paratuberculosis str. k10
32% identity, 86% coverage

A0A068FPP9 glucose 1-dehydrogenase [NAD(P)+] (EC 1.1.1.47) from Bacillus sp. (in: Bacteria) (see paper)
26% identity, 87% coverage

MA0479 cyclohexanol dehydrogenase from Methanosarcina acetivorans C2A
27% identity, 86% coverage

CG12171 uncharacterized protein from Drosophila melanogaster
30% identity, 88% coverage

BMEII0514 3-OXOACYL-(ACYL-CARRIER PROTEIN) REDUCTASE from Brucella melitensis 16M
28% identity, 89% coverage

CGS9114_RS09725 oxidoreductase from Corynebacterium glutamicum S9114
33% identity, 66% coverage

xecD / Q56840 2-(R)-hydroxypropyl-CoM dehydrogenase subunit (EC 1.1.1.268) from Xanthobacter autotrophicus (strain ATCC BAA-1158 / Py2) (see 5 papers)
HCDR1_XANP2 / Q56840 2-(R)-hydroxypropyl-CoM dehydrogenase; R-HPCDH; 2-[(R)-2-hydroxypropylthio]ethanesulfonate dehydrogenase; Aliphatic epoxide carboxylation component III; Epoxide carboxylase component III; RHPCDH1; EC 1.1.1.268 from Xanthobacter autotrophicus (strain ATCC BAA-1158 / Py2) (see 7 papers)
Q56840 2-(R)-hydroxypropyl-CoM dehydrogenase (EC 1.1.1.268) from Xanthobacter autotrophicus (see 4 papers)
2cfcA / Q56840 Structural basis for stereo selectivity in the (r)- and (s)- hydroxypropylethane thiosulfonate dehydrogenases (see paper)
30% identity, 75% coverage

BC351_17295 SDR family NAD(P)-dependent oxidoreductase from Paenibacillus ferrarius
31% identity, 73% coverage

RPA3631 putative glucose dehydrogenase from Rhodopseudomonas palustris CGA009
32% identity, 91% coverage

6vspB / H9XP47 Structure of serratia marcescens 2,3-butanediol dehydrogenase mutant q247a (see paper)
27% identity, 93% coverage

DH12A_DANRE / Q6P3L6 Very-long-chain 3-oxoacyl-CoA reductase-A; 17-beta-hydroxysteroid dehydrogenase 12-A; 17-beta-HSD 12-A; zf3.1; zfHSD17B12A; 3-ketoacyl-CoA reductase; KAR; Estradiol 17-beta-dehydrogenase 12-A; EC 1.1.1.330; EC 1.1.1.62 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
30% identity, 59% coverage

budC / H9XP47 meso-butanediol dehydrogenase [(R)-acetoin-forming] (EC 1.1.1.304; EC 1.1.1.76) from Serratia marcescens (see paper)
MBDH_SERMA / H9XP47 Meso-2,3-butanediol dehydrogenase; BDH; meso-2,3-BDH; (R,S)-butane-2,3-diol dehydrogenase; NAD(H)-dependent meso-2,3-BDH; SmBdh; EC 1.1.1.- from Serratia marcescens (see 3 papers)
AN479_RS23455 SDR family NAD(P)-dependent oxidoreductase from Serratia ureilytica
27% identity, 93% coverage

CHADH_ASPFM / D3J0Z1 Chanoclavine-I dehydrogenase; ChaDH; EC 1.1.1.332 from Aspergillus fumigatus (Neosartorya fumigata) (see paper)
EASD_ASPFU / Q4WZ66 Chanoclavine-I dehydrogenase easD; ChaDH; Ergot alkaloid synthesis protein A; EC 1.1.1.332 from Aspergillus fumigatus (strain ATCC MYA-4609 / CBS 101355 / FGSC A1100 / Af293) (Neosartorya fumigata) (see 8 papers)
D3J0Z1 chanoclavine-I dehydrogenase (EC 1.1.1.332) from Aspergillus fumigatus (see paper)
Q4WZ66 chanoclavine-I dehydrogenase (EC 1.1.1.332) from Aspergillus fumigatus (see paper)
AFUA_2G18000, Afu2g18000 short chain dehydrogenase/oxidoreductase CpoX2 from Aspergillus fumigatus Af293
27% identity, 91% coverage

G0HY85 acetoacetyl-CoA reductase (EC 1.1.1.36) from Haloarcula hispanica (see paper)
33% identity, 57% coverage

4bmvC / B9U359 Short-chain dehydrogenase from sphingobium yanoikuyae in complex with NADPH
37% identity, 72% coverage

6y0sAAA / Q84EX5 6y0sAAA (see paper)
27% identity, 91% coverage

CG7601 uncharacterized protein from Drosophila melanogaster
Q9Y140 Dehydrogenase/reductase SDR family protein 7-like from Drosophila melanogaster
29% identity, 67% coverage

BL01177 (S)-acetoin forming diacetyl reductase from Bacillus licheniformis DSM 13 = ATCC 14580
Q65J08 Diacetyl reductase [(S)-acetoin forming] from Bacillus licheniformis (strain ATCC 14580 / DSM 13 / JCM 2505 / CCUG 7422 / NBRC 12200 / NCIMB 9375 / NCTC 10341 / NRRL NRS-1264 / Gibson 46)
28% identity, 84% coverage

ACIAD1828 putative short-chain dehydrogenase/reductase SDR protein from Acinetobacter sp. ADP1
28% identity, 43% coverage

B7HR44 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase from Bacillus cereus (strain AH187)
28% identity, 91% coverage

CC2966, CC_2966 short chain dehydrogenase family protein from Caulobacter crescentus CB15
32% identity, 81% coverage

Cobetia52542_060129 SDR family NAD(P)-dependent oxidoreductase from Cobetia sp. 5-25-4-2
33% identity, 72% coverage

barS1 / Q9LBV3 A-factor type γ-butyrolactone 6-reductase (6S-forming) monomer (EC 1.1.1.413) from Streptomyces virginiae (see paper)
BARS1_STRVG / Q9LBV3 A-factor type gamma-butyrolactone 1'-reductase (1S-forming); 6-dehydro-VB-A reductase; Butyrolactone autoregulator synthesis; EC 1.1.1.413 from Streptomyces virginiae (Streptomyces cinnamonensis) (see 2 papers)
30% identity, 89% coverage

KP22_02890 SDR family oxidoreductase from Pectobacterium betavasculorum
34% identity, 71% coverage

Elen_0666 short-chain dehydrogenase/reductase SDR from Eggerthella lenta DSM 2243
35% identity, 55% coverage

BT_1911 7-alpha-hydroxysteroid dehydrogenase from Bacteroides thetaiotaomicron VPI-5482
28% identity, 88% coverage

3BHD2_EGGLE / C8WGQ3 3beta-hydroxysteroid dehydrogenase 2; 3beta-HSDH 2; 3beta-hydroxycholanate 3-dehydrogenase (NAD(+)) 2; NAD-dependent bile acid 3beta-dehydrogenase; EC 1.1.1.-; EC 1.1.1.391 from Eggerthella lenta (strain ATCC 25559 / DSM 2243 / CCUG 17323 / JCM 9979 / KCTC 3265 / NCTC 11813 / VPI 0255 / 1899 B) (Eubacterium lentum) (see paper)
C8WGQ3 3beta-hydroxycholanate 3-dehydrogenase (NAD+) (EC 1.1.1.391) from Eggerthella lenta (see paper)
Elen_1325 short-chain dehydrogenase/reductase SDR from Eggerthella lenta DSM 2243
29% identity, 90% coverage

Q84EX5 R-specific alcohol dehydrogenase from Levilactobacillus brevis
27% identity, 91% coverage

SLG_12640 SDR family oxidoreductase from Sphingobium sp. SYK-6
31% identity, 61% coverage

4qecA / I6ZQW6 Elxo with NADP bound (see paper)
25% identity, 89% coverage

YfeF / b2426 oxidoreductase UcpA (EC 1.1.1.304) from Escherichia coli K-12 substr. MG1655 (see 6 papers)
ucpA / P37440 oxidoreductase UcpA (EC 1.1.1.304) from Escherichia coli (strain K12) (see 6 papers)
b2426 putative oxidoreductase from Escherichia coli str. K-12 substr. MG1655
NP_416921 oxidoreductase UcpA from Escherichia coli str. K-12 substr. MG1655
P37440 Oxidoreductase UcpA from Escherichia coli (strain K12)
36% identity, 71% coverage

SGO_1096 acetoin dehydrogenase from Streptococcus gordonii str. Challis substr. CH1
A8AX75 Diacetyl reductase [(S)-acetoin forming] from Streptococcus gordonii (strain Challis / ATCC 35105 / BCRC 15272 / CH1 / DL1 / V288)
28% identity, 88% coverage

Z3691 short chain dehydrogenase from Escherichia coli O157:H7 EDL933
36% identity, 71% coverage

CPS_RS07445 SDR family NAD(P)-dependent oxidoreductase from Colwellia psychrerythraea 34H
28% identity, 89% coverage

1xkqA / Q9N5G4 Crystal structure of short-chain dehydrogenase/reductase of unknown function from caenorhabditis elegans with cofactor
28% identity, 83% coverage

LBL_1854 Short chain dehydrogenase from Leptospira borgpetersenii serovar Hardjo-bovis L550
33% identity, 72% coverage

BCAS0713 putative short-chain oxidoreductase from Burkholderia cenocepacia J2315
32% identity, 75% coverage

ABE785_01835 SDR family oxidoreductase from Levilactobacillus brevis
Q03TF9 Short-chain alcohol dehydrogenase from Levilactobacillus brevis (strain ATCC 367 / BCRC 12310 / CIP 105137 / JCM 1170 / LMG 11437 / NCIMB 947 / NCTC 947)
26% identity, 91% coverage

LOC101112590 estradiol 17-beta-dehydrogenase 12-B-like from Ovis aries
31% identity, 59% coverage

MSMEG_4722 short-chain dehydrogenase from Mycobacterium smegmatis str. MC2 155
35% identity, 69% coverage

MAB_1537c Putative short chain dehydrogenase/reductase from Mycobacterium abscessus ATCC 19977
39% identity, 67% coverage

GALLO_1207 putative short chain dehydrogenase from Streptococcus gallolyticus UCN34
33% identity, 65% coverage

CHCB2_SINCY / P0DXE1 Cis-4-hydroxycyclohexanecarboxylate dehydrogenase; Cis-4-hydroxyCHCA dehydrogenase; EC 1.1.1.438 from Sinomonas cyclohexanicum (Corynebacterium cyclohexanicum) (see paper)
29% identity, 91% coverage

swb2 / D2KTX1 3-hydroxykynurenate reductase/dehydratase from Streptomyces sp. SNA15896 (see paper)
35% identity, 74% coverage

VK055_1916 SDR family NAD(P)-dependent oxidoreductase from Klebsiella pneumoniae subsp. pneumoniae
32% identity, 86% coverage

ZMO0318 short-chain dehydrogenase/reductase SDR from Zymomonas mobilis subsp. mobilis ZM4
28% identity, 89% coverage

P39482 glucose 1-dehydrogenase [NAD(P)+] (EC 1.1.1.47) from Priestia megaterium (see paper)
27% identity, 86% coverage

VDAG_07187 3-oxoacyl-[acyl-carrier-protein] reductase from Verticillium dahliae VdLs.17
33% identity, 84% coverage

1g6kA / P40288 Crystal structure of glucose dehydrogenase mutant e96a complexed with NAD+
27% identity, 86% coverage

Q7Q6V7 AGAP005645-PA from Anopheles gambiae
25% identity, 94% coverage

CHLNCDRAFT_53139 hypothetical protein from Chlorella variabilis
41% identity, 41% coverage

Rv1050 PROBABLE OXIDOREDUCTASE from Mycobacterium tuberculosis H37Rv
NP_215566 oxidoreductase from Mycobacterium tuberculosis H37Rv
32% identity, 61% coverage

AT3G55310 binding / catalytic/ oxidoreductase from Arabidopsis thaliana
31% identity, 67% coverage

XP_006246534 dehydrogenase/reductase SDR family member 7B isoform X1 from Rattus norvegicus
29% identity, 70% coverage

LOC110736399 dehydrogenase/reductase SDR family member 7-like from Chenopodium quinoa
27% identity, 77% coverage

SF5M90T_2416 SDR family oxidoreductase UcpA from Shigella flexneri 5a str. M90T
36% identity, 69% coverage

A0R723 3-oxoacyl-[acyl-carrier-protein] reductase (EC 1.1.1.100); acetoacetyl-CoA reductase (EC 1.1.1.36) from Mycolicibacterium smegmatis (see paper)
MSMEG_6753 oxidoreductase, short chain dehydrogenase/reductase family protein from Mycobacterium smegmatis str. MC2 155
29% identity, 91% coverage

EHI_165070 short chain dehydrogenase family protein from Entamoeba histolytica HM-1:IMSS
29% identity, 70% coverage

5t2uA / A0R723 Short chain dehydrogenase/reductase family protein (see paper)
29% identity, 91% coverage

DRS7B_RAT / Q5RJY4 Dehydrogenase/reductase SDR family member 7B; Short-chain dehydrogenase/reductase family 32C member 1; Protein SDR32C1; EC 1.1.-.- from Rattus norvegicus (Rat) (see 2 papers)
29% identity, 68% coverage

V9NF79 diacetyl reductase [(R)-acetoin forming] (EC 1.1.1.303) from Paenibacillus polymyxa (see paper)
28% identity, 92% coverage

CG9150 uncharacterized protein from Drosophila melanogaster
25% identity, 94% coverage

BPHY_RS37740, Bphy_7637 SDR family oxidoreductase from Paraburkholderia phymatum STM815
27% identity, 93% coverage

BAB1_0483 Short-chain dehydrogenase/reductase SDR:Glucose/ribitol dehydrogenase from Brucella melitensis biovar Abortus 2308
34% identity, 73% coverage

7w61A Crystal structure of farnesol dehydrogenase from helicoverpa armigera (see paper)
25% identity, 94% coverage

PG2069 oxidoreductase, short chain dehydrogenase/reductase family from Porphyromonas gingivalis W83
HMPREF1322_RS07305 SDR family NAD(P)-dependent oxidoreductase from Porphyromonas gingivalis W50
29% identity, 87% coverage

HSDA_SESIN / Q93W57 11-beta-hydroxysteroid dehydrogenase A; 17-beta-hydroxysteroid dehydrogenase A; Seed oil body protein 2; Steroleosin-A; EC 1.1.1.-; EC 1.1.1.62 from Sesamum indicum (Oriental sesame) (Sesamum orientale) (see 3 papers)
32% identity, 55% coverage

AT3G55290 short-chain dehydrogenase/reductase (SDR) family protein from Arabidopsis thaliana
30% identity, 67% coverage

MT1283 oxidoreductase, short-chain dehydrogenase/reductase family from Mycobacterium tuberculosis CDC1551
Rv1245c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mycobacterium tuberculosis H37Rv
31% identity, 78% coverage

LOC103713735 11-beta-hydroxysteroid dehydrogenase A-like from Phoenix dactylifera
33% identity, 54% coverage

OP10G_1224 SDR family NAD(P)-dependent oxidoreductase from Fimbriimonas ginsengisoli Gsoil 348
34% identity, 74% coverage

NP_663399 cis-retinol/3alpha hydroxysterol short-chain dehydrogenase-like precursor from Mus musculus
34% identity, 57% coverage

6b9uA / Q2YJS1 Crystal structure of 3-ketoacyl-(acyl-carrier-protein) reductase from brucella melitensis complexed with nadh
29% identity, 89% coverage

HSDA_ARAHY / A7LB60 11-beta-hydroxysteroid dehydrogenase A; 17-beta-hydroxysteroid dehydrogenase A; Steroleosin-A; EC 1.1.1.146; EC 1.1.1.- from Arachis hypogaea (Peanut) (see paper)
30% identity, 57% coverage

PMI2168 probable short-chain dehydrogenase from Proteus mirabilis HI4320
28% identity, 87% coverage

EASD_ARTOC / C5FTN0 Chanoclavine-I dehydrogenase easD; ChaDH; Ergot alkaloid synthesis protein D; EC 1.1.1.332 from Arthroderma otae (strain ATCC MYA-4605 / CBS 113480) (Microsporum canis) (see paper)
29% identity, 91% coverage

BRADO6616 putative oxidoreductase; putative Glucose/ribitol oxidoreductase from Bradyrhizobium sp. ORS278
28% identity, 91% coverage

CV_1482 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase from Chromobacterium violaceum ATCC 12472
38% identity, 70% coverage

DHG_PRIMG / P40288 Glucose 1-dehydrogenase; EC 1.1.1.47 from Priestia megaterium (Bacillus megaterium) (see paper)
P40288 glucose 1-dehydrogenase [NAD(P)+] (EC 1.1.1.47) from Priestia megaterium (see 9 papers)
26% identity, 86% coverage

CC2337 oxidoreductase, short-chain dehydrogenase/reductase family from Caulobacter crescentus CB15
34% identity, 68% coverage

BTH_I1719 3-oxoacyl-(acyl-carrier-protein) reductase from Burkholderia thailandensis E264
29% identity, 92% coverage

LINX_SPHIU / D4Z260 2,5-dichloro-2,5-cyclohexadiene-1,4-diol dehydrogenase LinX; 2,5-DDOL dehydrogenase; EC 1.1.1.- from Sphingobium indicum (strain DSM 16413 / CCM 7287 / MTCC 6362 / UT26 / NBRC 101211 / UT26S) (Sphingobium japonicum) (see paper)
33% identity, 70% coverage

AT2G17845 short-chain dehydrogenase/reductase (SDR) family protein from Arabidopsis thaliana
31% identity, 60% coverage

LOC103706421 11-beta-hydroxysteroid dehydrogenase A-like from Phoenix dactylifera
33% identity, 54% coverage

VC1591 oxidoreductase, short-chain dehydrogenase/reductase family from Vibrio cholerae O1 biovar eltor str. N16961
30% identity, 88% coverage

CBG09704 Protein CBG09704 from Caenorhabditis briggsae
29% identity, 80% coverage

A6N8S2 glucose 1-dehydrogenase [NAD(P)+] (EC 1.1.1.47) from Bacillus subtilis (see paper)
26% identity, 85% coverage

C1O30_RS04475 SDR family oxidoreductase from Dickeya zeae
27% identity, 90% coverage

MSMEG_0737 dehydrogenase from Mycobacterium smegmatis str. MC2 155
36% identity, 75% coverage

PSPTO_4065 oxidoreductase, short-chain dehydrogenase/reductase family from Pseudomonas syringae pv. tomato str. DC3000
32% identity, 71% coverage

NP_001086077 estradiol 17-beta-dehydrogenase 12-B-like L homeolog from Xenopus laevis
29% identity, 57% coverage

XCC1018 3-oxoacyl-[ACP] reductase from Xanthomonas campestris pv. campestris str. ATCC 33913
28% identity, 92% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 793,807 different protein sequences to 1,259,118 scientific articles. Searches against EuropePMC were last performed on March 13 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory