PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for metacyc::SGL_RS08600-MONOMER glycerate dehydrogenase (EC 1.1.1.29) (Synechocystis sp. (strain PCC 6803 / Kazusa)) (554 a.a., MNLAWLQGLS...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 252 similar proteins in the literature:

hprA / P73821 glycerate dehydrogenase (EC 1.1.1.29) from Synechocystis sp. (strain PCC 6803 / Kazusa) (see paper)
sll1908 phosphoglycerate dehydrogenase from Synechocystis sp. PCC 6803
100% identity, 100% coverage

SYNPCC7002_A1246 D-3-phosphoglycerate dehydrogenase from Synechococcus sp. PCC 7002
78% identity, 95% coverage

alr1890 phosphoglycerate dehydrogenase from Nostoc sp. PCC 7120
79% identity, 95% coverage

GKIL_0932 phosphoglycerate dehydrogenase from Gloeobacter kilaueensis JS1
69% identity, 95% coverage

gvip294 D-3-phosphoglycerate dehydrogenase from Gloeobacter violaceus PCC 7421
68% identity, 95% coverage

MA0592 phosphoglycerate dehydrogenase from Methanosarcina acetivorans C2A
47% identity, 95% coverage

DET0599 D-3-phosphoglycerate dehydrogenase from Dehalococcoides ethenogenes 195
47% identity, 95% coverage

ZMO1685 D-3-phosphoglycerate dehydrogenase from Zymomonas mobilis subsp. mobilis ZM4
44% identity, 95% coverage

Athe_2125 phosphoglycerate dehydrogenase from Caldicellulosiruptor bescii DSM 6725
43% identity, 95% coverage

A7Z657 D-3-phosphoglycerate dehydrogenase from Bacillus velezensis (strain DSM 23117 / BGSC 10A6 / LMG 26770 / FZB42)
43% identity, 95% coverage

BSU23070 D-3-phosphoglycerate dehydrogenase from Bacillus subtilis subsp. subtilis str. 168
P35136 D-3-phosphoglycerate dehydrogenase from Bacillus subtilis (strain 168)
43% identity, 95% coverage

RLV_6165 phosphoglycerate dehydrogenase from Rhizobium leguminosarum bv. viciae
43% identity, 94% coverage

serA / GB|AAC83943.1 phosphoglycerate dehydrogenase; EC 1.1.1.95 from Bacillus subtilis (see paper)
serA / AAC83943.1 phosphoglycerate dehydrogenase from Bacillus subtilis (see 3 papers)
43% identity, 95% coverage

MMP1588 D-3-phosphoglycerate dehydrogenase from Methanococcus maripaludis S2
43% identity, 95% coverage

RSP_1352 D-3-phosphoglycerate dehydrogenase from Rhodobacter sphaeroides 2.4.1
44% identity, 90% coverage

SMc00641 PUTATIVE D-3-PHOSPHOGLYCERATE DEHYDROGENASE PROTEIN from Sinorhizobium meliloti 1021
Q92MA3 D-3-phosphoglycerate dehydrogenase from Rhizobium meliloti (strain 1021)
42% identity, 94% coverage

BAB1_1697 ATP/GTP-binding site motif A (P-loop):Amino acid-binding ACT:D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain:... from Brucella melitensis biovar Abortus 2308
43% identity, 93% coverage

A4XJ21 D-3-phosphoglycerate dehydrogenase from Caldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903 / Tp8T 6331)
42% identity, 93% coverage

ATU_RS17200 phosphoglycerate dehydrogenase from Agrobacterium fabrum str. C58
42% identity, 93% coverage

Dshi_3318 D-3-phosphoglycerate dehydrogenase from Dinoroseobacter shibae DFL 12
44% identity, 91% coverage

SH1200 D-3-phosphoglycerate dehydrogenase from Staphylococcus haemolyticus JCSC1435
42% identity, 95% coverage

DA471_RS00355 phosphoglycerate dehydrogenase from Staphylococcus aureus
SAR1801 D-3-phosphoglycerate dehydrogenase from Staphylococcus aureus subsp. aureus MRSA252
41% identity, 95% coverage

SA1545 D-3-phosphoglycerate dehydrogenase from Staphylococcus aureus subsp. aureus N315
41% identity, 95% coverage

SACOL1773 D-3-phosphoglycerate dehydrogenase from Staphylococcus aureus subsp. aureus COL
NWMN_1617 D-3-phosphoglycerate dehydrogenase from Staphylococcus aureus subsp. aureus str. Newman
41% identity, 95% coverage

CCNA_03322 D-3-phosphoglycerate dehydrogenase from Caulobacter crescentus NA1000
CC3215 D-3-phosphoglycerate dehydrogenase from Caulobacter crescentus CB15
42% identity, 94% coverage

MPNT_20138 phosphoglycerate dehydrogenase from Candidatus Methylacidithermus pantelleriae
41% identity, 95% coverage

NJ7G_0555 phosphoglycerate dehydrogenase from Natrinema sp. J7-2
41% identity, 94% coverage

Q9Z564 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Streptomyces coelicolor (see paper)
SCO5515 probable D-3-phosphoglycerate dehydrogenase from Streptomyces coelicolor A3(2)
40% identity, 94% coverage

mru_0678 phosphoglycerate dehydrogenase SerA from Methanobrevibacter ruminantium M1
39% identity, 95% coverage

ARB_07891 uncharacterized protein from Trichophyton benhamiae CBS 112371
41% identity, 91% coverage

HVO_2968 D-3-phosphoglycerate dehydrogenase from Haloferax volcanii DS2
39% identity, 95% coverage

A9PEK1 D-3-phosphoglycerate dehydrogenase from Populus trichocarpa
40% identity, 88% coverage

SERA2_ARATH / O04130 D-3-phosphoglycerate dehydrogenase 2, chloroplastic; PGDH; EC 1.1.1.95 from Arabidopsis thaliana (Mouse-ear cress) (see 3 papers)
O04130 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Arabidopsis thaliana (see 3 papers)
NP_564034 D-3-phosphoglycerate dehydrogenase from Arabidopsis thaliana
AT1G17745 PGDH (3-PHOSPHOGLYCERATE DEHYDROGENASE); phosphoglycerate dehydrogenase from Arabidopsis thaliana
39% identity, 84% coverage

F6I5Y5 D-3-phosphoglycerate dehydrogenase from Vitis vinifera
41% identity, 80% coverage

XP_002283022 D-3-phosphoglycerate dehydrogenase 3, chloroplastic from Vitis vinifera
40% identity, 86% coverage

A0A2R6X868 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Marchantia polymorpha (see paper)
40% identity, 83% coverage

XP_002518687 D-3-phosphoglycerate dehydrogenase 1, chloroplastic from Ricinus communis
B9RYA3 D-3-phosphoglycerate dehydrogenase from Ricinus communis
40% identity, 88% coverage

A2YVQ8 D-3-phosphoglycerate dehydrogenase from Oryza sativa subsp. indica
39% identity, 84% coverage

LOC101246616 D-3-phosphoglycerate dehydrogenase 3, chloroplastic-like from Solanum lycopersicum
40% identity, 88% coverage

Q6ZAA5 D-3-phosphoglycerate dehydrogenase from Oryza sativa subsp. japonica
38% identity, 84% coverage

A0A1Z4EAX3 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Beta vulgaris (see paper)
38% identity, 84% coverage

Q8TYK0 D-3-phosphoglycerate dehydrogenase from Methanopyrus kandleri (strain AV19 / DSM 6324 / JCM 9639 / NBRC 100938)
40% identity, 95% coverage

GRMZM2G009323 D-3-phosphoglycerate dehydrogenase from Zea mays
39% identity, 85% coverage

LOC100280688 uncharacterized protein LOC100280688 from Zea mays
38% identity, 84% coverage

LOC107808555 D-3-phosphoglycerate dehydrogenase 2, chloroplastic from Nicotiana tabacum
38% identity, 82% coverage

VDAG_01596 D-3-phosphoglycerate dehydrogenase from Verticillium dahliae VdLs.17
39% identity, 79% coverage

A0A1Z4EAX4 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Beta vulgaris (see paper)
39% identity, 84% coverage

A3BE72 D-3-phosphoglycerate dehydrogenase from Oryza sativa subsp. japonica
40% identity, 85% coverage

SERA1_ARATH / O49485 D-3-phosphoglycerate dehydrogenase 1, chloroplastic; Protein EMBRYO SAC DEVELOPMENT ARREST 9; EC 1.1.1.95 from Arabidopsis thaliana (Mouse-ear cress) (see 2 papers)
O49485 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Arabidopsis thaliana (see 4 papers)
NP_195146 D-3-phosphoglycerate dehydrogenase from Arabidopsis thaliana
AT4G34200 EDA9 (embryo sac development arrest 9); ATP binding from Arabidopsis thaliana
38% identity, 87% coverage

M1AIT2 D-3-phosphoglycerate dehydrogenase from Solanum tuberosum
38% identity, 83% coverage

CHLNCDRAFT_56437 hypothetical protein from Chlorella variabilis
38% identity, 85% coverage

B6SKK1 D-3-phosphoglycerate dehydrogenase from Zea mays
38% identity, 84% coverage

MAB_3304c D-3-phosphoglycerate dehydrogenase (SerA) from Mycobacterium abscessus ATCC 19977
39% identity, 93% coverage

A0QUY2 D-3-phosphoglycerate dehydrogenase from Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155)
MSMEG_2378 D-3-phosphoglycerate dehydrogenase from Mycobacterium smegmatis str. MC2 155
MSMEG_2378, MSMEI_2318 phosphoglycerate dehydrogenase from Mycolicibacterium smegmatis MC2 155
39% identity, 94% coverage

B4G011 D-3-phosphoglycerate dehydrogenase from Zea mays
40% identity, 85% coverage

SERA3_ARATH / Q9LT69 D-3-phosphoglycerate dehydrogenase 3, chloroplastic; EC 1.1.1.95 from Arabidopsis thaliana (Mouse-ear cress) (see 2 papers)
Q9LT69 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Arabidopsis thaliana (see 4 papers)
AT3G19480, NP_566637 D-3-phosphoglycerate dehydrogenase from Arabidopsis thaliana
NP_566637 D-3-phosphoglycerate dehydrogenase, putative / 3-PGDH, putative from Arabidopsis thaliana
38% identity, 89% coverage

XP_042922996 uncharacterized protein from Chlamydomonas reinhardtii
38% identity, 84% coverage

MMAR_1715 D-3-phosphoglycerate dehydrogenase SerA1 from Mycobacterium marinum M
40% identity, 93% coverage

P9WNX3 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Mycobacterium tuberculosis (see 5 papers)
Rv2996c PROBABLE D-3-PHOSPHOGLYCERATE DEHYDROGENASE SERA1 (PGDH) from Mycobacterium tuberculosis H37Rv
MT3074 D-3-phosphoglycerate dehydrogenase from Mycobacterium tuberculosis CDC1551
BCG_3017c putative D-3-phosphoglycerate dehydrogenase serA1 from Mycobacterium bovis BCG str. Pasteur 1173P2
YP_177916 D-3-phosphoglycerate dehydrogenase from Mycobacterium tuberculosis H37Rv
40% identity, 95% coverage

3dc2A / P9WNX3 Crystal structure of serine bound d-3-phosphoglycerate dehydrogenase from mycobacterium tuberculosis (see paper)
40% identity, 94% coverage

MUL_1952 D-3-phosphoglycerate dehydrogenase SerA1 from Mycobacterium ulcerans Agy99
40% identity, 93% coverage

C629_RS07285 phosphoglycerate dehydrogenase from Corynebacterium glutamicum SCgG2
cg1451 D-3-phosphoglycerate dehydrogenase from Corynebacterium glutamicum ATCC 13032
38% identity, 92% coverage

AFUA_2G04490 D-3-phosphoglycerate dehydrogenase from Aspergillus fumigatus Af293
37% identity, 83% coverage

MAVA5_16825 phosphoglycerate dehydrogenase from Mycobacterium avium subsp. hominissuis A5
39% identity, 94% coverage

ML1692 D-3-phosphoglycerate dehydrogenase from Mycobacterium leprae TN
40% identity, 92% coverage

MAP3033c SerA from Mycobacterium avium subsp. paratuberculosis str. k10
39% identity, 94% coverage

ATE51_01764 phosphoglycerate dehydrogenase from Campylobacter coli
36% identity, 93% coverage

PITG_00133 phosphoserine aminotransferase, putative from Phytophthora infestans T30-4
36% identity, 15% coverage

Cj0891c D-3-phosphoglycerate dehydrogenase from Campylobacter jejuni subsp. jejuni NCTC 11168
36% identity, 93% coverage

CJJ81176_0900 D-3-phosphoglycerate dehydrogenase from Campylobacter jejuni subsp. jejuni 81-176
36% identity, 93% coverage

DR_1291 D-3-phosphoglycerate dehydrogenase from Deinococcus radiodurans R1
36% identity, 97% coverage

SERA_MOUSE / Q61753 D-3-phosphoglycerate dehydrogenase; 3-PGDH; A10; EC 1.1.1.95 from Mus musculus (Mouse) (see paper)
Q61753 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Mus musculus (see 2 papers)
NP_058662 D-3-phosphoglycerate dehydrogenase from Mus musculus
43% identity, 74% coverage

Phgdh / O08651 3-phosphoglycerate dehydrogenase subunit (EC 1.1.1.95) from Rattus norvegicus (see paper)
SERA_RAT / O08651 D-3-phosphoglycerate dehydrogenase; 3-PGDH; EC 1.1.1.95 from Rattus norvegicus (Rat) (see paper)
O08651 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Rattus norvegicus (see paper)
NP_113808 D-3-phosphoglycerate dehydrogenase from Rattus norvegicus
43% identity, 74% coverage

PHGDH / O43175 D-3-phosphoglycerate dehydrogenase (EC 1.1.1.95) from Homo sapiens (see 3 papers)
SERA_HUMAN / O43175 D-3-phosphoglycerate dehydrogenase; 3-PGDH; 2-oxoglutarate reductase; Malate dehydrogenase; EC 1.1.1.95; EC 1.1.1.399; EC 1.1.1.37 from Homo sapiens (Human) (see 7 papers)
O43175 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Homo sapiens (see 8 papers)
44% identity, 72% coverage

XP_849835 D-3-phosphoglycerate dehydrogenase isoform X6 from Canis lupus familiaris
43% identity, 74% coverage

A5GFY8 D-3-phosphoglycerate dehydrogenase from Sus scrofa
42% identity, 74% coverage

Q5EAD2 D-3-phosphoglycerate dehydrogenase from Bos taurus
43% identity, 73% coverage

B7P3M8 D-3-phosphoglycerate dehydrogenase (Fragment) from Ixodes scapularis
38% identity, 81% coverage

HP0397 phosphoglycerate dehydrogenase (serA) from Helicobacter pylori 26695
O25158 D-3-phosphoglycerate dehydrogenase from Helicobacter pylori (strain ATCC 700392 / 26695)
35% identity, 88% coverage

B3KSC3 D-3-phosphoglycerate dehydrogenase from Homo sapiens
44% identity, 66% coverage

LOC105252893 D-3-phosphoglycerate dehydrogenase from Camponotus floridanus
40% identity, 73% coverage

XP_017888762 D-3-phosphoglycerate dehydrogenase isoform X2 from Ceratina calcarata
39% identity, 79% coverage

7dkmA / O43175 Phgdh covalently linked to oridonin (see paper)
47% identity, 55% coverage

PF1394 phosphoglycerate dehydrogenase from Pyrococcus furiosus DSM 3638
47% identity, 54% coverage

TK1966 D-3-phosphoglycerate dehydrogenase from Thermococcus kodakaraensis KOD1
47% identity, 54% coverage

CG6287 uncharacterized protein from Drosophila melanogaster
46% identity, 54% coverage

TEL01S_RS09140 D-2-hydroxyacid dehydrogenase from Pseudothermotoga elfii DSM 9442 = NBRC 107921
44% identity, 54% coverage

HPR_THEMA / Q9X1C1 Hydroxypyruvate reductase; HPR; EC 1.1.1.81 from Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8) (see paper)
TM1401 D-3-phosphoglycerate dehydrogenase from Thermotoga maritima MSB8
46% identity, 53% coverage

H9JRZ9 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Bombyx mori (see paper)
43% identity, 49% coverage

TRQ7_RS07315 hydroxypyruvate reductase from Thermotoga sp. RQ7
44% identity, 53% coverage

1wwkA / O50095 Crystal structure of phosphoglycerate dehydrogenase from pyrococcus horikoshii ot3
44% identity, 54% coverage

XP_011539528 D-3-phosphoglycerate dehydrogenase isoform X1 from Homo sapiens
42% identity, 50% coverage

7cvpA / O43175 The crystal structure of human phgdh from biortus.
47% identity, 40% coverage

CAC0015 D-3-phosphoglycerate dehydrogenase from Clostridium acetobutylicum ATCC 824
38% identity, 54% coverage

Q9UMY2 D-3-phosphoglycerate dehydrogenase (Fragment) from Homo sapiens
42% identity, 52% coverage

A5DEL5 phosphoglycerate dehydrogenase from Meyerozyma guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC 10279 / NRRL Y-324)
PGUG_01716 uncharacterized protein from Meyerozyma guilliermondii ATCC 6260
38% identity, 56% coverage

BMF77_03393 hydroxyacid dehydrogenase from Dolichospermum sp. UHCC 0315A
40% identity, 54% coverage

NCU01439 D-3-phosphoglycerate dehydrogenase 1 from Neurospora crassa OR74A
38% identity, 57% coverage

8wprA / Q7WRR2 Anabaena mcyi r166a with prebound NAD and malate
40% identity, 54% coverage

Q8U3Y2 glyoxylate reductase (NADP+) (EC 1.1.1.79); hydroxypyruvate reductase (EC 1.1.1.81) from Pyrococcus furiosus (see paper)
PF0319 putative phosphoglycerate dehydrogenase from Pyrococcus furiosus DSM 3638
36% identity, 53% coverage

5aovA / Q8U3Y2 Ternary crystal structure of pyrococcus furiosus glyoxylate hydroxypyruvate reductase in presence of glyoxylate (see paper)
36% identity, 53% coverage

YP_009328950 D-3-phosphoglycerate dehydrogenase, type2 from Cedratvirus A11
38% identity, 56% coverage

SPCC364.07 D-3 phosphoglycerate dehydrogenase (predicted) from Schizosaccharomyces pombe
P87228 Putative D-3-phosphoglycerate dehydrogenase from Schizosaccharomyces pombe (strain 972 / ATCC 24843)
35% identity, 59% coverage

CDIF630erm_01131 D-2-hydroxyacid dehydrogenase from Clostridioides difficile
CD0995 putative D-3-phosphoglycerate dehydrogenase from Clostridium difficile 630
37% identity, 55% coverage

A4XGF7 D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding protein from Caldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903 / Tp8T 6331)
37% identity, 56% coverage

SERA_YEAST / P40054 D-3-phosphoglycerate dehydrogenase 1; 3-PGDH 1; 2-oxoglutarate reductase; EC 1.1.1.95; EC 1.1.1.399 from Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) (see 2 papers)
NP_011004 phosphoglycerate dehydrogenase SER3 from Saccharomyces cerevisiae S288C
YER081W 3-phosphoglycerate dehydrogenase, catalyzes the first step in serine and glycine biosynthesis; isozyme of Ser33p from Saccharomyces cerevisiae
36% identity, 58% coverage

CLJU_c03860 phosphoglycerate dehydrogenase from Clostridium ljungdahlii DSM 13528
40% identity, 49% coverage

ZP_02639608 D-3-phosphoglycerate dehydrogenase from Clostridium perfringens CPE str. F4969
38% identity, 54% coverage

SJAG_02313 D-3 phosphoglycerate dehydrogenase from Schizosaccharomyces japonicus yFS275
35% identity, 59% coverage

TM0327 phosphoglycerate dehydrogenase, putative from Thermotoga maritima MSB8
43% identity, 50% coverage

SER33_YEAST / P40510 D-3-phosphoglycerate dehydrogenase 2; 3-PGDH 2; 2-oxoglutarate reductase; EC 1.1.1.95; EC 1.1.1.399 from Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) (see 2 papers)
YIL074C 3-phosphoglycerate dehydrogenase, catalyzes the first step in serine and glycine biosynthesis; isozyme of Ser3p from Saccharomyces cerevisiae
36% identity, 56% coverage

8q2iB / P40510 Crystal structure of ser33 in complex 2hg (2-hydroxyglutarate) and serine
36% identity, 56% coverage

TERTU_0393 D-3-phosphoglycerate dehydrogenase from Teredinibacter turnerae T7901
36% identity, 56% coverage

VP2593 D-3-phosphoglycerate dehydrogenase from Vibrio parahaemolyticus RIMD 2210633
VP_RS12615 phosphoglycerate dehydrogenase from Vibrio parahaemolyticus RIMD 2210633
36% identity, 56% coverage

GYAR_THELN / Q9C4M5 Glyoxylate reductase; EC 1.1.1.26 from Thermococcus litoralis (strain ATCC 51850 / DSM 5473 / JCM 8560 / NS-C) (see paper)
Q9C4M5 glyoxylate reductase (EC 1.1.1.26) from Thermococcus litoralis (see paper)
OCC_02245 glyoxylate reductase from Thermococcus litoralis DSM 5473
36% identity, 53% coverage

VC2481 D-3-phosphoglycerate dehydrogenase from Vibrio cholerae O1 biovar eltor str. N16961
36% identity, 56% coverage

PN96_00930 phosphoglycerate dehydrogenase from Vibrio natriegens NBRC 15636 = ATCC 14048 = DSM 759
34% identity, 56% coverage

XHV734_2726 phosphoglycerate dehydrogenase from Xanthomonas hortorum pv. vitians
36% identity, 58% coverage

ABUW_0331 phosphoglycerate dehydrogenase from Acinetobacter baumannii
35% identity, 56% coverage

VAS14_20256 phosphoglycerate dehydrogenase from Photobacterium angustum S14
35% identity, 56% coverage

APL_1452 D-3-phosphoglycerate dehydrogenase from Actinobacillus pleuropneumoniae L20
34% identity, 57% coverage

2eklA / Q972A9 Structure of st1218 protein from sulfolobus tokodaii
38% identity, 52% coverage

Q972A9 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Sulfurisphaera tokodaii (see paper)
ST1218 313aa long hypothetical D-3-phosphoglycerate dehydrogenase from Sulfolobus tokodaii str. 7
38% identity, 52% coverage

TK0683 glyoxylate reductase from Thermococcus kodakaraensis KOD1
35% identity, 53% coverage

ETAE_2949 D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding protein from Edwardsiella tarda EIB202
34% identity, 57% coverage

C0PKN2 phosphoglycerate dehydrogenase from Zea mays
36% identity, 61% coverage

F8AEA4 glyoxylate reductase (NADP+) (EC 1.1.1.79); hydroxypyruvate reductase (EC 1.1.1.81) from Pyrococcus yayanosii (see paper)
PYCH_09300 glyoxylate reductase from Pyrococcus yayanosii CH1
36% identity, 53% coverage

6biiA / F8AEA4 Crystal structure of pyrococcus yayanosii glyoxylate hydroxypyruvate reductase in complex with NADP and malonate (re-refinement of 5aow) (see paper)
36% identity, 53% coverage

NE1688 D-isomer specific 2-hydroxyacid dehydrogenase from Nitrosomonas europaea ATCC 19718
37% identity, 55% coverage

A0KLT8 D-3-phosphoglycerate dehydrogenase from Aeromonas hydrophila subsp. hydrophila (strain ATCC 7966 / DSM 30187 / BCRC 13018 / CCUG 14551 / JCM 1027 / KCTC 2358 / NCIMB 9240 / NCTC 8049)
36% identity, 56% coverage

HI0465 D-3-phosphoglycerate dehydrogenase (serA) from Haemophilus influenzae Rd KW20
33% identity, 61% coverage

YPTB3189 D-3-phosphoglycerate dehydrogenase (PGDH) from Yersinia pseudotuberculosis IP 32953
YPO0914 D-3-phosphoglycerate dehydrogenase from Yersinia pestis CO92
34% identity, 60% coverage

PP_5155 phosphoglycerate dehydrogenase from Pseudomonas putida KT2440
PP5155 D-3-phosphoglycerate dehydrogenase from Pseudomonas putida KT2440
35% identity, 56% coverage

U3RH61 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Acanthamoeba castellanii (see paper)
33% identity, 59% coverage

lmo0078 similar to phosphoglycerate dehydrogenase from Listeria monocytogenes EGD-e
36% identity, 52% coverage

NJ56_RS09235 phosphoglycerate dehydrogenase from Yersinia ruckeri
34% identity, 60% coverage

RAYM_04224 D-2-hydroxyacid dehydrogenase from Riemerella anatipestifer RA-YM
37% identity, 53% coverage

YE3400 D-3-phosphoglycerate dehydrogenase from Yersinia enterocolitica subsp. enterocolitica 8081
33% identity, 60% coverage

PSPTO_5294 D-3-phosphoglycerate dehydrogenase from Pseudomonas syringae pv. tomato str. DC3000
34% identity, 56% coverage

Q6C284 phosphoglycerate dehydrogenase from Yarrowia lipolytica (strain CLIB 122 / E 150)
33% identity, 57% coverage

E4T4B2 D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding protein from Paludibacter propionicigenes (strain DSM 17365 / JCM 13257 / WB4)
36% identity, 53% coverage

PSPPH_4885 D-3-phosphoglycerate dehydrogenase from Pseudomonas syringae pv. phaseolicola 1448A
34% identity, 56% coverage

PITG_13165 D-3-phosphoglycerate dehydrogenase, putative from Phytophthora infestans T30-4
27% identity, 86% coverage

OEOE_0025 Lactate dehydrogenase related enzyme from Oenococcus oeni PSU-1
33% identity, 56% coverage

BC3248 D-3-phosphoglycerate dehydrogenase from Bacillus cereus ATCC 14579
36% identity, 59% coverage

Psyr_4852 Amino acid-binding ACT:D-isomer specific 2-hydroxyacid dehydrogenase, catalytic region:D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding from Pseudomonas syringae pv. syringae B728a
33% identity, 56% coverage

PMI2031 D-3-phosphoglycerate dehydrogenase from Proteus mirabilis HI4320
34% identity, 56% coverage

DDB_G0281071 3-phosphoglycerate dehydrogenase from Dictyostelium discoideum AX4
34% identity, 56% coverage

A4VGK3 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Pseudomonas stutzeri (see paper)
35% identity, 56% coverage

Echvi_2777 fused D-3-phosphoglycerate dehydrogenase / phosphoserine phosphatase (EC 1.1.1.95; EC 3.1.3.3) from Echinicola vietnamensis KMM 6221, DSM 17526
33% identity, 52% coverage

FTT_1230 D-3-phosphoglycerate dehydrogenase from Francisella tularensis subsp. tularensis SCHU S4
33% identity, 57% coverage

2p9eA / P0A9T0 Crystal structure of g336v mutant of e.Coli phosphoglycerate dehydrogenase (see paper)
33% identity, 56% coverage

Pcar_0629 3-phosphoglycerate dehydrogenase from Pelobacter carbinolicus str. DSM 2380
35% identity, 54% coverage

Q81FZ7 Glyoxylate reductase (NADP+) from Bacillus cereus (strain ATCC 14579 / DSM 31 / CCUG 7414 / JCM 2152 / NBRC 15305 / NCIMB 9373 / NCTC 2599 / NRRL B-3711)
39% identity, 56% coverage

OEOE_1672 Lactate dehydrogenase related enzyme from Oenococcus oeni PSU-1
39% identity, 45% coverage

Clo1313_0586 phosphoglycerate dehydrogenase from Acetivibrio thermocellus DSM 1313
38% identity, 49% coverage

SerA / b2913 phosphoglycerate dehydrogenase (EC 1.1.1.95; EC 1.1.1.399) from Escherichia coli K-12 substr. MG1655 (see 36 papers)
serA / P0A9T0 phosphoglycerate dehydrogenase (EC 1.1.1.95; EC 1.1.1.399) from Escherichia coli (strain K12) (see 36 papers)
SERA_ECOLI / P0A9T0 D-3-phosphoglycerate dehydrogenase; PGDH; 2-oxoglutarate reductase; EC 1.1.1.95; EC 1.1.1.399 from Escherichia coli (strain K12) (see paper)
C3SVM7 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Escherichia coli (see paper)
P0A9T0 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Escherichia coli (see 7 papers)
serA / RF|NP_417388 D-3-phosphoglycerate dehydrogenase; EC 1.1.1.95 from Escherichia coli K12 (see 5 papers)
NP_417388 phosphoglycerate dehydrogenase from Escherichia coli str. K-12 substr. MG1655
b2913 D-3-phosphoglycerate dehydrogenase from Escherichia coli str. K-12 substr. MG1655
c3494 D-3-phosphoglycerate dehydrogenase from Escherichia coli CFT073
ECs3784 D-3-phosphoglycerate dehydrogenase from Escherichia coli O157:H7 str. Sakai
B21_RS14345, ETEC_3105 phosphoglycerate dehydrogenase from Escherichia coli ETEC H10407
33% identity, 56% coverage

Q5HLU4 Putative 2-hydroxyacid dehydrogenase SERP1888 from Staphylococcus epidermidis (strain ATCC 35984 / DSM 28319 / BCRC 17069 / CCUG 31568 / BM 3577 / RP62A)
39% identity, 52% coverage

BMEII0813 D-3-PHOSPHOGLYCERATE DEHYDROGENASE from Brucella melitensis 16M
35% identity, 56% coverage

DSY4020 hypothetical protein from Desulfitobacterium hafniense Y51
Dhaf_1344 D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding from Desulfitobacterium hafniense DCB-2
34% identity, 56% coverage

Q8CNB8 Putative 2-hydroxyacid dehydrogenase SE_1879 from Staphylococcus epidermidis (strain ATCC 12228 / FDA PCI 1200)
39% identity, 52% coverage

slcC / Q1QWN6 S-sulfolactate dehydrogenase subunit (EC 1.1.1.310) from Chromohalobacter salexigens (strain ATCC BAA-138 / DSM 3043 / CIP 106854 / NCIMB 13768 / 1H11) (see paper)
SLCC_CHRSD / Q1QWN6 (S)-sulfolactate dehydrogenase; (2S)-3-sulfolactate dehydrogenase; (S)-sulfolactate oxidoreductase; EC 1.1.1.310 from Chromohalobacter salexigens (strain ATCC BAA-138 / DSM 3043 / CIP 106854 / NCIMB 13768 / 1H11) (see paper)
33% identity, 54% coverage

PA14_04110 D-3-phosphoglycerate dehydrogenase from Pseudomonas aeruginosa UCBPP-PA14
34% identity, 56% coverage

Q9I6H5 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Pseudomonas aeruginosa (see paper)
PA0316 D-3-phosphoglycerate dehydrogenase from Pseudomonas aeruginosa PAO1
NP_249007 D-3-phosphoglycerate dehydrogenase from Pseudomonas aeruginosa PAO1
34% identity, 56% coverage

E9AG23 D-3-phosphoglycerate dehydrogenase-like protein from Leishmania infantum
33% identity, 61% coverage

BAB2_0783 Amino acid-binding ACT:D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain:D-isomer specific 2-hydroxyacid dehydr... from Brucella melitensis biovar Abortus 2308
34% identity, 56% coverage

STM14_3699 phosphoglycerate dehydrogenase from Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S
STM3062 D-3-phosphoglycerate dehydrogenase from Salmonella typhimurium LT2
33% identity, 56% coverage

SAR2389 putative D-isomer specific 2-hydroxyacid dehydrogenase from Staphylococcus aureus subsp. aureus MRSA252
Q6GEC9 Putative 2-hydroxyacid dehydrogenase SAR2389 from Staphylococcus aureus (strain MRSA252)
38% identity, 52% coverage

DSVG11_0961 phosphoglycerate dehydrogenase from Desulfovibrio sp. G11
35% identity, 54% coverage

SAUSA300_2254 glycerate dehydrogenase-like protein from Staphylococcus aureus subsp. aureus USA300_FPR3757
MW2224 ORFID:MW2224~hypothetical protein, similar to glycerate dehydrogenase from Staphylococcus aureus subsp. aureus MW2
38% identity, 52% coverage

SAOV_2344 2-hydroxyacid dehydrogenase family protein from Staphylococcus aureus subsp. aureus ED133
38% identity, 52% coverage

BIF_01365 phosphoglycerate dehydrogenase from Bifidobacterium animalis subsp. lactis BB-12
33% identity, 57% coverage

Q608T2 hydroxypyruvate reductase (EC 1.1.1.81) from Methylococcus capsulatus (see paper)
34% identity, 51% coverage

DSY1673 hypothetical protein from Desulfitobacterium hafniense Y51
31% identity, 57% coverage

Dhaf_2820 Glyoxylate reductase from Desulfitobacterium hafniense DCB-2
31% identity, 57% coverage

SAV2305 glycerate dehydrogenase from Staphylococcus aureus subsp. aureus Mu50
SA2098 glycerate dehydrogenase from Staphylococcus aureus subsp. aureus N315
38% identity, 52% coverage

SSU0623 D-isomer specific 2-hydroxyacid dehydrogenase from Streptococcus suis P1/7
ZY05719_03245 phosphoglycerate dehydrogenase from Streptococcus suis
33% identity, 64% coverage

lpg0242 D-3-phosphoglycerate dehydrogenase from Legionella pneumophila subsp. pneumophila str. Philadelphia 1
39% identity, 46% coverage

BMEI1952 gluconate 2-dehydrogenase from Brucella melitensis 16M
36% identity, 49% coverage

BPHYT_RS11290 2-ketogluconate 6-phosphate reductase (EC 1.1.1.43) from Burkholderia phytofirmans PsJN
34% identity, 56% coverage

SMa2137 probable glycerate from Sinorhizobium meliloti 1021
36% identity, 49% coverage

BPSL1577 2-ketogluconate reductase from Burkholderia pseudomallei K96243
37% identity, 49% coverage

PITG_10264 D-3-phosphoglycerate dehydrogenase from Phytophthora infestans T30-4
39% identity, 45% coverage

PH0597 glyoxylate reductase from Pyrococcus horikoshii OT3
O58320 Glyoxylate reductase from Pyrococcus horikoshii (strain ATCC 700860 / DSM 12428 / JCM 9974 / NBRC 100139 / OT-3)
36% identity, 53% coverage

2dbqA / O58320 Crystal structure of glyoxylate reductase (ph0597) from pyrococcus horikoshii ot3, complexed with NADP (i41) (see paper)
36% identity, 53% coverage

Mchl_2132 D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding from Methylobacterium chloromethanicum CM4
40% identity, 44% coverage

YPO1288 putative D-isomer specific 2-hydroxyacid dehydrogenase family protein from Yersinia pestis CO92
35% identity, 45% coverage

BP1026B_I0581 2-hydroxyacid dehydrogenase from Burkholderia pseudomallei 1026b
36% identity, 46% coverage

C645_RS00700, WP_005672300 2-hydroxyacid dehydrogenase from Haemophilus influenzae
38% identity, 44% coverage

BLJ_1306 phosphoglycerate dehydrogenase from Bifidobacterium longum subsp. longum JDM301
32% identity, 57% coverage

STM3646 2-keto-D-gluconate reductase from Salmonella typhimurium LT2
35% identity, 48% coverage

Blon_0845 D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding from Bifidobacterium longum subsp. infantis ATCC 15697
33% identity, 52% coverage

Ssal_00539 3-phosphoglycerate dehydrogenase family protein from Streptococcus salivarius 57.I
34% identity, 56% coverage

slr2123 D-isomer specific 2-hydroxyacid dehydrogenase family from Synechocystis sp. PCC 6803
36% identity, 47% coverage

hprA / Q59516 hydroxypyruvate reductase subunit (EC 1.1.1.81) from Methylorubrum extorquens (strain ATCC 14718 / DSM 1338 / JCM 2805 / NCIMB 9133 / AM1) (see 2 papers)
39% identity, 44% coverage

D3F4F1 D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding protein from Conexibacter woesei (strain DSM 14684 / CCUG 47730 / CIP 108061 / JCM 11494 / NBRC 100937 / ID131577)
37% identity, 57% coverage

L0084 D-3-phosphoglycerate dehydrogenase (EC 1.1.1.95) from Lactococcus lactis subsp. lactis Il1403
33% identity, 58% coverage

WP_124144079 phosphonate dehydrogenase from Okeania hirsuta
31% identity, 56% coverage

Dhaf_2079 D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding from Desulfitobacterium hafniense DCB-2
34% identity, 47% coverage

Q0W9V5 Glyoxylate/hydroxypyruvate reductase B from Yersinia pestis
34% identity, 48% coverage

SEET0819_01960 glyoxylate/hydroxypyruvate reductase GhrB from Salmonella enterica subsp. enterica serovar Tennessee str.
34% identity, 48% coverage

DVU0339 D-isomer specific 2-hydroxyacid dehydrogenase family protein from Desulfovibrio vulgaris Hildenborough
35% identity, 49% coverage

DvMF_1902 D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding from Desulfovibrio vulgaris str. Miyazaki F
35% identity, 46% coverage

lp_0203 phosphoglycerate dehydrogenase from Lactobacillus plantarum WCFS1
34% identity, 54% coverage

YPO2329 D-lactate dehydrogenase from Yersinia pestis CO92
YPTB2248 D-lactate dehydrogenase from Yersinia pseudotuberculosis IP 32953
35% identity, 45% coverage

NMB1685 D-lactate dehydrogenase from Neisseria meningitidis MC58
37% identity, 45% coverage

HPR_BLAHS / C0CMQ8 Hydroxypyruvate reductase; EC 1.1.1.81 from Blautia hydrogenotrophica (strain DSM 10507 / JCM 14656 / S5a33) (Ruminococcus hydrogenotrophicus) (see paper)
31% identity, 53% coverage

NSP_RS15555 phosphonate dehydrogenase from Nodularia spumigena CCY9414
31% identity, 56% coverage

F8A9V0 D-lactate dehydrogenase (EC 1.1.1.28) from Thermodesulfatator indicus (see paper)
39% identity, 41% coverage

gbs1619 Unknown from Streptococcus agalactiae NEM316
32% identity, 64% coverage

PA3896 probable 2-hydroxyacid dehydrogenase from Pseudomonas aeruginosa PAO1
35% identity, 45% coverage

TkrA / b3553 glyoxylate reductase (EC 1.1.1.79; EC 1.1.1.215) from Escherichia coli K-12 substr. MG1655 (see 3 papers)
ghrB / P37666 glyoxylate reductase (EC 1.1.1.79; EC 1.1.1.215) from Escherichia coli (strain K12) (see 4 papers)
GHRB_ECOLI / P37666 Glyoxylate/hydroxypyruvate reductase B; 2-ketoaldonate reductase; 2-ketogluconate reductase; 2KR; EC 1.1.1.79; EC 1.1.1.81; EC 1.1.1.215 from Escherichia coli (strain K12) (see 2 papers)
ghrB / GB|AAC76577.2 glyoxylate/hydroxypyruvate reductase B; EC 1.1.1.215; EC 1.1.1.79; EC 1.1.1.81 from Escherichia coli K12 (see 6 papers)
NP_418009 glyoxylate reductase from Escherichia coli str. K-12 substr. MG1655
34% identity, 48% coverage

PG_2171 2-hydroxyacid dehydrogenase family protein from Porphyromonas gingivalis W83
34% identity, 56% coverage

ECs4438 2-hydroxyacid dehydrogenase from Escherichia coli O157:H7 str. Sakai
34% identity, 48% coverage

TON_0569 D-isomer specific 2-hydroxyacid dehydrogenase from Thermococcus onnurineus NA1
32% identity, 58% coverage

NGO1336 putative dehydrogenase, lactate from Neisseria gonorrhoeae FA 1090
36% identity, 45% coverage

sS8_1475 2-hydroxyacid dehydrogenase from Methylocaldum marinum
33% identity, 51% coverage

WP_265276617 phosphonate dehydrogenase from Nostoc sp. KVJ3
31% identity, 52% coverage

EIP97_RS14860 2-hydroxyacid dehydrogenase from Pseudomonas aeruginosa UCBPP-PA14
36% identity, 47% coverage

PG1279 D-isomer specific 2-hydroxyacid dehydrogenase family protein from Porphyromonas gingivalis W83
32% identity, 56% coverage

BL02138 2-hydroxyacid dehydrogenase from Bacillus licheniformis DSM 13 = ATCC 14580
34% identity, 47% coverage

EC958_3959 glyoxylate/hydroxypyruvate reductase GhrB from Escherichia coli O25b:H4-ST131
34% identity, 48% coverage

4zgsA / B0LUZ5 Identification of the pyruvate reductase of chlamydomonas reinhardtii (see paper)
34% identity, 48% coverage

PP3376 2-ketogluconate 6-phosphate reductase from Pseudomonas putida KT2440
PP_3376 2-hydroxyacid dehydrogenase from Pseudomonas putida KT2440
35% identity, 49% coverage

TK0551 D-isomer specific 2-hydroxyacid dehydrogenase from Thermococcus kodakaraensis KOD1
30% identity, 56% coverage

DSY0996 hypothetical protein from Desulfitobacterium hafniense Y51
33% identity, 47% coverage

PA2263 probable 2-hydroxyacid dehydrogenase from Pseudomonas aeruginosa PAO1
35% identity, 47% coverage

BCAL3179 probable D-lactate dehydrogenase from Burkholderia cenocepacia J2315
37% identity, 43% coverage

AO353_26885 phosphogluconate 2-dehydrogenase KguD (EC 1.1.1.43) from Pseudomonas fluorescens FW300-N2E3
33% identity, 54% coverage

PF0370 phosphoglycerate dehydrogenase from Pyrococcus furiosus DSM 3638
30% identity, 56% coverage

SMc01622 PUTATIVE OXIDOREDUCTASE PROTEIN from Sinorhizobium meliloti 1021
34% identity, 49% coverage

kguD / Q5TM04 2-dehydro-6-phosphogluconate reductase (EC 1.1.1.43) from Pseudomonas fluorescens (strain ATCC BAA-477 / NRRL B-23932 / Pf-5) (see 2 papers)
32% identity, 54% coverage

PHYSODRAFT_503916 hypothetical protein from Phytophthora sojae
37% identity, 44% coverage

ETAE_1771 D-lactate dehydrogenase from Edwardsiella tarda EIB202
34% identity, 45% coverage

lmo2824 similar to D-3-phosphoglycerate dehydrogenase from Listeria monocytogenes EGD-e
LMON_2843 phosphoglycerate dehydrogenase from Listeria monocytogenes EGD
33% identity, 55% coverage

Tery_0368 D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding from Trichodesmium erythraeum IMS101
30% identity, 56% coverage

Thicy_1457 2-hydroxyacid dehydrogenase from Thiomicrospira cyclica ALM1
36% identity, 41% coverage

PGDH1 / A2DLU8 phosphoglycerate dehydrogenase (EC 1.1.1.95) from Trichomonas vaginalis (strain ATCC PRA-98 / G3) (see paper)
33% identity, 56% coverage

ZMO_RS05565 2-hydroxyacid dehydrogenase from Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821
ZMO1237 D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding protein from Zymomonas mobilis subsp. mobilis ZM4
34% identity, 44% coverage

O66939 D-lactate dehydrogenase (EC 1.1.1.28) from Aquifex aeolicus (see paper)
3kb6B / O66939 Crystal structure of d-lactate dehydrogenase from aquifex aeolicus complexed with NAD and lactic acid (see paper)
aq_727 D-lactate dehydrogenase from Aquifex aeolicus VF5
33% identity, 46% coverage

1sc6D / P0A9T0 Crystal structure of w139g d-3-phosphoglycerate dehydrogenase complexed with NAD+ (see paper)
32% identity, 56% coverage

alr0058 D-lactate dehydrogenase from Nostoc sp. PCC 7120
34% identity, 45% coverage

PS417_12555 phosphogluconate 2-dehydrogenase KguD (EC 1.1.1.43) from Pseudomonas simiae WCS417
33% identity, 51% coverage

DVU1412 D-isomer specific 2-hydroxyacid dehydrogenase family protein from Desulfovibrio vulgaris Hildenborough
33% identity, 47% coverage

BRADO4794 putative D-3-phosphoglycerate dehydrogenase from Bradyrhizobium sp. ORS278
34% identity, 49% coverage

WP_049588155 2-hydroxyacid dehydrogenase from Pseudomonas alloputida
35% identity, 46% coverage

PP1649, PP_1649 D-lactate dehydrogenase from Pseudomonas putida KT2440
35% identity, 46% coverage

PG1190 glycerate dehydrogenase from Porphyromonas gingivalis W83
HMPREF1322_RS05995 D-2-hydroxyacid dehydrogenase from Porphyromonas gingivalis W50
34% identity, 47% coverage

HtpH / b1380 D-lactate dehydrogenase (EC 1.1.1.28) from Escherichia coli K-12 substr. MG1655 (see 4 papers)
ldhA / P52643 D-lactate dehydrogenase (EC 1.1.1.28) from Escherichia coli (strain K12) (see 22 papers)
ldhA D-lactate dehydrogenase; EC 1.1.1.28 from Escherichia coli K12 (see 7 papers)
NP_415898 D-lactate dehydrogenase from Escherichia coli str. K-12 substr. MG1655
P52643 D-lactate dehydrogenase from Escherichia coli (strain K12)
b1380 D-lactate dehydrogenase from Escherichia coli str. K-12 substr. MG1655
36% identity, 43% coverage

A9762_00050 NAD(P)-dependent oxidoreductase from Pandoraea sp. ISTKB
35% identity, 47% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 793,807 different protein sequences to 1,259,118 scientific articles. Searches against EuropePMC were last performed on March 13 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory