PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for Q93ZS6 AT3g05090/T12H1_5 (Arabidopsis thaliana) (753 a.a., MHRVGSAGSN...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 698 similar proteins in the literature:

AT3G05090 transducin family protein / WD-40 repeat family protein from Arabidopsis thaliana
Q93ZS6 AT3g05090/T12H1_5 from Arabidopsis thaliana
100% identity, 100% coverage

F6HTW0 Uncharacterized protein from Vitis vinifera
74% identity, 93% coverage

NP_001026135 WD repeat-containing protein 48 from Gallus gallus
36% identity, 99% coverage

NP_080512 WD repeat-containing protein 48 isoform 1 from Mus musculus
35% identity, 99% coverage

WDR48_HUMAN / Q8TAF3 WD repeat-containing protein 48; USP1-associated factor 1; WD repeat endosomal protein; p80 from Homo sapiens (Human) (see 18 papers)
NP_065890 WD repeat-containing protein 48 isoform 1 from Homo sapiens
35% identity, 99% coverage

D3Z8C7 WD repeat-containing protein 48 from Rattus norvegicus
34% identity, 97% coverage

5l8eA / Q8TAF3 Structure of uaf1 (see paper)
40% identity, 64% coverage

WDR48_CAEEL / Q20059 WD repeat-containing protein 48 homolog from Caenorhabditis elegans (see paper)
29% identity, 97% coverage

NP_497931 WD repeat-containing protein 48 homolog from Caenorhabditis elegans
29% identity, 97% coverage

Q7PXD9 WD repeat-containing protein 48 homolog from Anopheles gambiae
30% identity, 98% coverage

WDR48_DROME / Q1LZ08 WD repeat-containing protein 48 homolog; USP1-associated factor 1 from Drosophila melanogaster (Fruit fly) (see paper)
29% identity, 98% coverage

BU107_SCHPO / Q09731 UBP9-binding protein bun107; Binding ubp9 protein of 107 kDa from Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast) (see paper)
SPAC31A2.14 WD repeat protein, human WRDR48 family from Schizosaccharomyces pombe
29% identity, 54% coverage

Afu7g08290 vegetative incompatibility WD repeat protein, putative from Aspergillus fumigatus Af293
28% identity, 45% coverage

Q00808 Vegetative incompatibility protein HET-E-1 from Podospora anserina
32% identity, 21% coverage

alr7129 WD-repeat protein from Nostoc sp. PCC 7120
29% identity, 24% coverage

AFUA_7G07030, Afu7g07030 vegetative incompatibility WD repeat protein, putative from Aspergillus fumigatus Af293
30% identity, 37% coverage

Ava_2183 Peptidase C14, caspase catalytic subunit p20 from Anabaena variabilis ATCC 29413
27% identity, 18% coverage

6y7pA The complex between the eight-bladed symmetrical designer protein tako8 and 1:2 zirconium(iv) wells-dawson (zrwd)
30% identity, 39% coverage

6y7oC The complex between the eight-bladed symmetrical designer protein tako8 and the silicotungstic acid keggin (sta)
30% identity, 42% coverage

Ava_2629 Possible Transcriptional Regulator, Fis family from Anabaena variabilis ATCC 29413
27% identity, 24% coverage

NLE1_HUMAN / Q9NVX2 Notchless protein homolog 1 from Homo sapiens (Human) (see 2 papers)
NP_060566 notchless protein homolog 1 isoform a from Homo sapiens
28% identity, 36% coverage

alr4877 WD-repeat protein from Nostoc sp. PCC 7120
29% identity, 39% coverage

Q58D20 Notchless protein homolog 1 from Bos taurus
27% identity, 39% coverage

NLE1_MOUSE / Q8VEJ4 Notchless protein homolog 1 from Mus musculus (Mouse) (see 2 papers)
NP_663406 notchless protein homolog 1 from Mus musculus
27% identity, 36% coverage

Q3TC83 Notchless protein homolog 1 from Mus musculus
27% identity, 36% coverage

P49695 Probable serine/threonine-protein kinase PkwA from Thermomonospora curvata
28% identity, 41% coverage

8q1nA / P61964 Cyclic peptide binder of the wbm-site of wdr5 (see paper)
25% identity, 43% coverage

8inkW / Q9NVX2 8inkW (see paper)
28% identity, 36% coverage

Glo7428_1095 caspase family protein from Gloeocapsa sp. PCC 7428
27% identity, 36% coverage

NP_001086974 WD repeat domain 5 L homeolog from Xenopus laevis
26% identity, 39% coverage

LOC105664430 protein will die slowly from Ceratitis capitata
26% identity, 39% coverage

NP_001011411 WD repeat-containing protein 5 from Xenopus tropicalis
26% identity, 39% coverage

NLE1_XENLA / Q7ZXK9 Notchless protein homolog 1 from Xenopus laevis (African clawed frog) (see paper)
28% identity, 36% coverage

LOC105559784 coatomer subunit alpha from Vollenhovia emeryi
25% identity, 24% coverage

NP_001006198 WD repeat-containing protein 5 from Gallus gallus
26% identity, 39% coverage

LOTGIDRAFT_218145 hypothetical protein from Lottia gigantea
25% identity, 26% coverage

WDR5_HUMAN / P61964 WD repeat-containing protein 5; BMP2-induced 3-kb gene protein from Homo sapiens (Human) (see 26 papers)
WDR5_MOUSE / P61965 WD repeat-containing protein 5; BMP2-induced 3-kb gene protein; WD repeat-containing protein BIG-3 from Mus musculus (Mouse) (see 5 papers)
WDR5_RAT / Q498M4 WD repeat-containing protein 5 from Rattus norvegicus (Rat) (see paper)
NP_543124 WD repeat-containing protein 5 from Mus musculus
26% identity, 39% coverage

Q2KIG2 WD repeat-containing protein 5 from Bos taurus
26% identity, 39% coverage

XP_003353752 WD repeat-containing protein 5 from Sus scrofa
26% identity, 39% coverage

WDR51_CAEEL / Q17963 WD repeat-containing protein wdr-5.1 from Caenorhabditis elegans (see 11 papers)
NP_497749 WD repeat-containing protein wdr-5.1 from Caenorhabditis elegans
26% identity, 40% coverage

WDR53_CAEEL / Q23256 WD repeat-containing protein wdr-5.3 from Caenorhabditis elegans (see 2 papers)
28% identity, 35% coverage

WDR52_CAEEL / Q93847 WD repeat-containing protein wdr-5.2 from Caenorhabditis elegans (see paper)
26% identity, 39% coverage

XP_002127700 WD repeat-containing protein 5 from Ciona intestinalis
25% identity, 42% coverage

NP_491069 Coatomer subunit alpha from Caenorhabditis elegans
25% identity, 28% coverage

LOC5577214, XP_001663309 coatomer subunit alpha from Aedes aegypti
24% identity, 26% coverage

NP_001083934 lissencephaly-1 homolog from Xenopus laevis
26% identity, 36% coverage

LACBIDRAFT_395470 uncharacterized protein from Laccaria bicolor S238N-H82
26% identity, 26% coverage

V5HP83 Putative copi vesicle coat from Ixodes ricinus
24% identity, 44% coverage

WDR5B_MOUSE / Q9D7H2 WD repeat-containing protein 5B from Mus musculus (Mouse) (see paper)
26% identity, 37% coverage

DAW1_CHLRE / Q3Y8L7 Dynein assembly factor with WD repeat domains 1; Outer row dynein assembly protein 16 from Chlamydomonas reinhardtii (Chlamydomonas smithii) (see paper)
28% identity, 37% coverage

Q86VZ2 WD repeat-containing protein 5B from Homo sapiens
NP_061942 WD repeat-containing protein 5B from Homo sapiens
25% identity, 40% coverage

Ava_4855 Serine/Threonine protein kinase with WD40 repeats from Anabaena variabilis ATCC 29413
28% identity, 37% coverage

Q9W0B8 Coatomer subunit alpha from Drosophila melanogaster
25% identity, 24% coverage

Tery_0184 serine/threonine protein kinase with WD40 repeats from Trichodesmium erythraeum IMS101
27% identity, 42% coverage

CG10931 uncharacterized protein from Drosophila melanogaster
27% identity, 42% coverage

LIS1_DICDI / Q8I0F4 Lissencephaly-1 homolog; DdLIS1 from Dictyostelium discoideum (Social amoeba) (see paper)
lis1 / CAD55133.1 LIS1 protein from Dictyostelium discoideum (see paper)
30% identity, 37% coverage

An16g02460 uncharacterized protein from Aspergillus niger
23% identity, 33% coverage

alr4559 WD-40 repeat-protein from Nostoc sp. PCC 7120
26% identity, 33% coverage

Q94A40 Coatomer subunit alpha-1 from Arabidopsis thaliana
AT1G62020 coatomer protein complex, subunit alpha, putative from Arabidopsis thaliana
25% identity, 22% coverage

H9L3L2 Coatomer subunit alpha from Gallus gallus
23% identity, 26% coverage

NP_001100950 striatin-4 isoform 1 from Rattus norvegicus
F1M6V8 Striatin 4 from Rattus norvegicus
25% identity, 36% coverage

copA coatomer alpha subunit from Emericella nidulans (see 2 papers)
23% identity, 33% coverage

all3169 WD repeat protein with Ser/Thr protein protein kinase motif from Nostoc sp. PCC 7120
24% identity, 38% coverage

XP_001928732 coatomer subunit alpha isoform X1 from Sus scrofa
24% identity, 23% coverage

L8I566 Coatomer subunit alpha (Fragment) from Bos mutus
24% identity, 24% coverage

XP_005203500 coatomer subunit alpha isoform X1 from Bos taurus
24% identity, 23% coverage

Q96WV5 Putative coatomer subunit alpha from Schizosaccharomyces pombe (strain 972 / ATCC 24843)
25% identity, 22% coverage

NCU05939 cell division control protein 4 from Neurospora crassa OR74A
29% identity, 25% coverage

Q27954 Coatomer subunit alpha from Bos taurus
24% identity, 24% coverage

F8WHL2 Coatomer subunit alpha from Mus musculus
24% identity, 23% coverage

Ava_2184 Peptidase C14, caspase catalytic subunit p20 from Anabaena variabilis ATCC 29413
27% identity, 24% coverage

B5DFK1 Coatomer subunit alpha from Rattus norvegicus
24% identity, 24% coverage

NP_001099115 coatomer subunit alpha from Bos taurus
24% identity, 24% coverage

DAW1_HUMAN / Q8N136 Dynein assembly factor with WD repeat domains 1; Outer row dynein assembly protein 16 homolog; WD repeat-containing protein 69 from Homo sapiens (Human) (see 3 papers)
27% identity, 35% coverage

COPA_MOUSE / Q8CIE6 Coatomer subunit alpha; Alpha-coat protein; Alpha-COP from Mus musculus (Mouse) (see paper)
NP_034068 coatomer subunit alpha from Mus musculus
24% identity, 24% coverage

COPA_HUMAN / P53621 Coatomer subunit alpha; Alpha-coat protein; Alpha-COP; HEP-COP; HEPCOP from Homo sapiens (Human) (see 4 papers)
NP_004362 coatomer subunit alpha isoform 2 from Homo sapiens
23% identity, 26% coverage

alr3119 WD repeat protein with Ser/Thr protein kinase motif from Nostoc sp. PCC 7120
27% identity, 37% coverage

CNC05910 hypothetical protein from Cryptococcus neoformans var. neoformans JEC21
25% identity, 31% coverage

NLE1_YEAST / P25382 Ribosome assembly protein 4; Notchless protein homolog 1; Ribosome biogenesis factor RSA4 from Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) (see 6 papers)
NP_009997, YCR072C Rsa4p from Saccharomyces cerevisiae
NP_009997 Rsa4p from Saccharomyces cerevisiae S288C
26% identity, 36% coverage

K4BVH7 Coatomer subunit alpha from Solanum lycopersicum
25% identity, 22% coverage

7uoox / P25382 7uoox (see paper)
26% identity, 36% coverage

8xi2T / A0A2K3DAW8 Cryo-em structure of the chlamydomonas c Complex (see paper)
26% identity, 44% coverage

SPSK_02314 glucose repression regulatory protein TUP1 from Sporothrix schenckii 1099-18
29% identity, 35% coverage

UABAM_01722 DUF4062 domain-containing protein from Candidatus Uabimicrobium amorphum
23% identity, 23% coverage

8ro0E / Q19211 8ro0E (see paper)
25% identity, 42% coverage

SPBR_00318 glucose repression regulatory protein TUP1 from Sporothrix brasiliensis 5110
29% identity, 35% coverage

Tsp_00685 lissencephaly-1 from Trichinella spiralis
27% identity, 19% coverage

Tery_4467 serine/threonine protein kinase with WD40 repeats from Trichodesmium erythraeum IMS101
24% identity, 34% coverage

slr8038 WD-repeat protein from Synechocystis sp. PCC 6803
26% identity, 21% coverage

AT2G21390 coatomer protein complex, subunit alpha, putative from Arabidopsis thaliana
25% identity, 22% coverage

B0XYA9 Coatomer subunit alpha from Aspergillus fumigatus (strain CBS 144.89 / FGSC A1163 / CEA10)
23% identity, 33% coverage

Q70I39 Coatomer subunit alpha from Lotus japonicus
24% identity, 22% coverage

LIS1B_DANRE / Q803D2 Lissencephaly-1 homolog B; Platelet-activating factor acetylhydrolase IB subunit alpha b from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
24% identity, 41% coverage

Ava_3867 Serine/Threonine protein kinase with WD40 repeats from Anabaena variabilis ATCC 29413
25% identity, 38% coverage

UABAM_04996 caspase family protein from Candidatus Uabimicrobium amorphum
25% identity, 16% coverage

L7IXA5 Coatomer subunit alpha from Pyricularia oryzae (strain P131)
24% identity, 26% coverage

PRL1_ARATH / Q42384 Protein pleiotropic regulatory locus 1; Protein PRL1; MOS4-associated complex protein 2; MAC protein 2 from Arabidopsis thaliana (Mouse-ear cress) (see 10 papers)
NP_193325 pleiotropic regulatory locus 1 from Arabidopsis thaliana
AT4G15900 PRL1 (PLEIOTROPIC REGULATORY LOCUS 1); basal transcription repressor/ nucleotide binding / protein binding from Arabidopsis thaliana
25% identity, 45% coverage

all0664 WD-40 repeat protein from Nostoc sp. PCC 7120
23% identity, 34% coverage

Ava_2851 Serine/Threonine protein kinase with WD40 repeats from Anabaena variabilis ATCC 29413
28% identity, 31% coverage

all0438 serine/threonine kinase with WD-40 repeat from Nostoc sp. PCC 7120
28% identity, 31% coverage

CNAG_03554 coatomer protein complex, subunit alpha (xenin) from Cryptococcus neoformans var. grubii H99
24% identity, 23% coverage

FBXW7_RAT / D3Z902 F-box/WD repeat-containing protein 7 from Rattus norvegicus (Rat) (see paper)
NP_001406457 F-box/WD repeat-containing protein 7 from Rattus norvegicus
24% identity, 47% coverage

FBXW7_MOUSE / Q8VBV4 F-box/WD repeat-containing protein 7; F-box and WD-40 domain-containing protein 7; F-box protein FBW7; F-box protein Fbxw6; F-box-WD40 repeat protein 6; SEL-10 from Mus musculus (Mouse) (see 7 papers)
XP_006501719 F-box/WD repeat-containing protein 7 isoform X1 from Mus musculus
24% identity, 47% coverage

PAAG_00103 WD repeat-containing protein from Paracoccidioides lutzii Pb01
25% identity, 37% coverage

LIS1_USTMA / Q4P9P9 Nuclear distribution protein PAC1; Lissencephaly-1 homolog; LIS-1; nudF homolog from Ustilago maydis (strain 521 / FGSC 9021) (Corn smut fungus) (see paper)
28% identity, 31% coverage

CNG00480 coatomer alpha subunit from Cryptococcus neoformans var. neoformans JEC21
24% identity, 23% coverage

NP_001359172 dynein assembly factor with WD repeat domains 1 isoform b from Mus musculus
26% identity, 35% coverage

SEL10_CAEEL / Q93794 F-box/WD repeat-containing protein sel-10; Egg laying defective protein 41; Suppressor/enhancer of lin-12 protein 10 from Caenorhabditis elegans (see 9 papers)
NP_506421 F-box/WD repeat-containing protein sel-10 from Caenorhabditis elegans
26% identity, 33% coverage

NP_001164280 archipelago from Tribolium castaneum
28% identity, 37% coverage

Q22D06 WD domain, G-beta repeat protein from Tetrahymena thermophila (strain SB210)
20% identity, 12% coverage

FBXW7_BOVIN / F1MNN4 F-box/WD repeat-containing protein 7; F-box and WD-40 domain-containing protein 7 from Bos taurus (Bovine) (see paper)
24% identity, 47% coverage

L8B5P6 F-box and WD-40 domain-containing protein 7 alpha from Xenopus laevis
24% identity, 47% coverage

XP_044911707 F-box/WD repeat-containing protein 7 isoform X1 from Felis catus
24% identity, 47% coverage

DAW1_MOUSE / D3Z7A5 Dynein assembly factor with WD repeat domains 1 from Mus musculus (Mouse) (see paper)
26% identity, 35% coverage

CNAG_05294 F-box and WD-40 domain-containing protein CDC4 from Cryptococcus neoformans var. grubii H99
28% identity, 25% coverage

STRN4_HUMAN / Q9NRL3 Striatin-4; Zinedin from Homo sapiens (Human) (see 2 papers)
NP_037535 striatin-4 isoform 1 from Homo sapiens
25% identity, 37% coverage

FBXW7_HUMAN / Q969H0 F-box/WD repeat-containing protein 7; Archipelago homolog; hAgo; F-box and WD-40 domain-containing protein 7; F-box protein FBX30; SEL-10; hCdc4 from Homo sapiens (Human) (see 28 papers)
24% identity, 47% coverage

KTNB1_DANRE / Q7ZUV2 Katanin p80 WD40 repeat-containing subunit B1; Katanin p80 subunit B1; p80 katanin from Danio rerio (Zebrafish) (Brachydanio rerio) (see 2 papers)
27% identity, 30% coverage

Ava_0039 Peptidase C14, caspase catalytic subunit p20 from Anabaena variabilis ATCC 29413
24% identity, 22% coverage

XP_005171005 F-box/WD repeat-containing protein 7 isoform X3 from Danio rerio
24% identity, 46% coverage

NP_079921 U5 small nuclear ribonucleoprotein 40 kDa protein from Mus musculus
Q6PE01 U5 small nuclear ribonucleoprotein 40 kDa protein from Mus musculus
25% identity, 36% coverage

Tery_3681 serine/threonine protein kinase with WD40 repeats from Trichodesmium erythraeum IMS101
26% identity, 36% coverage

DDB_G0287961 DENN domain-containing protein from Dictyostelium discoideum AX4
25% identity, 16% coverage

FBXW7_DROME / Q9VZF4 F-box/WD repeat-containing protein 7; F-box and WD-40 domain-containing protein 7; Protein archipelago from Drosophila melanogaster (Fruit fly) (see 5 papers)
NP_523922 archipelago, isoform C from Drosophila melanogaster
27% identity, 21% coverage

XP_002517787 notchless protein homolog from Ricinus communis
24% identity, 36% coverage

WDS_DROME / Q9V3J8 Protein will die slowly from Drosophila melanogaster (Fruit fly) (see 7 papers)
NP_001245503 will die slowly, isoform B from Drosophila melanogaster
27% identity, 35% coverage

SNR40_HUMAN / Q96DI7 U5 small nuclear ribonucleoprotein 40 kDa protein; U5 snRNP 40 kDa protein; U5-40K; 38 kDa-splicing factor; Prp8-binding protein; hPRP8BP; U5 snRNP-specific 40 kDa protein; WD repeat-containing protein 57 from Homo sapiens (Human) (see 13 papers)
NP_004805 U5 small nuclear ribonucleoprotein 40 kDa protein from Homo sapiens
25% identity, 36% coverage

AT1G11160 nucleotide binding from Arabidopsis thaliana
27% identity, 20% coverage

POP2_SCHPO / O14170 WD repeat-containing protein pop2; Proteolysis factor sud1 from Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast) (see 4 papers)
pop2 / RF|NP_594956.1 F-box/WD repeat protein Pop2 from Schizosaccharomyces pombe (see 3 papers)
NP_594956 F-box/WD repeat protein Pop2 from Schizosaccharomyces pombe
SPAC4D7.03 F-box/WD repeat protein Pop2 from Schizosaccharomyces pombe
23% identity, 36% coverage

KTN81_ARATH / A0A1P8AW69 Katanin p80 WD40 repeat-containing subunit B1 homolog KTN80.1 from Arabidopsis thaliana (Mouse-ear cress) (see paper)
27% identity, 20% coverage

SPCC16A11.02 U3 snoRNP-associated protein Utp13 (predicted) from Schizosaccharomyces pombe
25% identity, 37% coverage

NCU03244 WD repeat protein from Neurospora crassa OR74A
32% identity, 29% coverage

HPODL_01497 Guanine nucleotide-binding protein subunit beta-like protein from Ogataea parapolymorpha DL-1
28% identity, 27% coverage

Tery_4060 serine/threonine protein kinase with WD40 repeats from Trichodesmium erythraeum IMS101
26% identity, 24% coverage

CND05450 hypothetical protein from Cryptococcus neoformans var. neoformans JEC21
26% identity, 36% coverage

K7EN33 Notchless homolog 1 (Fragment) from Homo sapiens
25% identity, 35% coverage

Cri9333_3253 caspase family protein from Crinalium epipsammum PCC 9333
25% identity, 21% coverage

C1GAF5 Coatomer subunit alpha from Paracoccidioides brasiliensis (strain Pb18)
PADG_04241 coatomer subunit alpha from Paracoccidioides brasiliensis Pb18
24% identity, 23% coverage

B7Z2C8 cDNA FLJ55681, highly similar to F-box/WD repeat protein 7 from Homo sapiens
24% identity, 47% coverage

K1PPW8 Coatomer subunit beta from Magallana gigas
22% identity, 37% coverage

KTNB1_STRPU / O61585 Katanin p80 WD40 repeat-containing subunit B1; Katanin p80 subunit B1; p80 katanin from Strongylocentrotus purpuratus (Purple sea urchin) (see 3 papers)
27% identity, 28% coverage

MHCK A / AAA66070.1 myosin heavy chain kinase A from Dictyostelium discoideum (see paper)
25% identity, 27% coverage

MHCKA_DICDI / P42527 Myosin heavy chain kinase A; MHCK-A; EC 2.7.11.7 from Dictyostelium discoideum (Social amoeba) (see paper)
P42527 myosin-heavy-chain kinase (EC 2.7.11.7) from Dictyostelium discoideum (see 11 papers)
XP_635119 Alpha kinase family protein from Dictyostelium discoideum AX4
25% identity, 27% coverage

8r09E / Q96DI7 8r09E (see paper)
25% identity, 36% coverage

NCU02966 pre-mRNA splicing factor prp46 from Neurospora crassa OR74A
26% identity, 27% coverage

LIS1_CHICK / Q9PTR5 Lissencephaly-1 homolog from Gallus gallus (Chicken) (see paper)
XP_015151147 lissencephaly-1 homolog isoform X1 from Gallus gallus
23% identity, 41% coverage

NP_001300561 F-box domain-containing protein from Caenorhabditis elegans
25% identity, 41% coverage

LIN23_CAEEL / Q09990 F-box/WD repeat-containing protein lin-23; Abnormal cell lineage protein 23 from Caenorhabditis elegans (see paper)
25% identity, 41% coverage

alr0671 WD-repeat protein from Nostoc sp. PCC 7120
27% identity, 33% coverage

NP_001278731 uncharacterized protein LOC100282591 from Zea mays
25% identity, 36% coverage

2ovqB / Q969H0 Structure of the skp1-fbw7-cyclinedegc complex (see paper)
24% identity, 47% coverage

NP_566557 Transducin/WD40 repeat-like superfamily protein from Arabidopsis thaliana
AT3G16650 PP1/PP2A phosphatases pleiotropic regulator 2 (PRL2) from Arabidopsis thaliana
24% identity, 21% coverage

AT3G21540 transducin family protein / WD-40 repeat family protein from Arabidopsis thaliana
36% identity, 12% coverage

EX895_005820 hypothetical protein from Sporisorium graminicola
27% identity, 31% coverage

LIS1_RAT / P63004 Platelet-activating factor acetylhydrolase IB subunit alpha; Lissencephaly-1 protein; LIS-1; PAF acetylhydrolase 45 kDa subunit; PAF-AH 45 kDa subunit; PAF-AH alpha; PAFAH alpha from Rattus norvegicus (Rat) (see 8 papers)
LIS1_MOUSE / P63005 Platelet-activating factor acetylhydrolase IB subunit beta; Lissencephaly-1 protein; LIS-1; PAF acetylhydrolase 45 kDa subunit; PAF-AH 45 kDa subunit; PAF-AH alpha; PAFAH alpha from Mus musculus (Mouse) (see 30 papers)
NP_038653 platelet-activating factor acetylhydrolase IB subunit beta from Mus musculus
23% identity, 41% coverage

LIS1_CAEEL / Q9NDC9 Lissencephaly-1 homolog; Pronuclear migration abnormal protein 1 from Caenorhabditis elegans (see 6 papers)
NP_499755 Lissencephaly-1 homolog from Caenorhabditis elegans
26% identity, 36% coverage

LIS1_BOVIN / P43033 Platelet-activating factor acetylhydrolase IB subunit beta; Lissencephaly-1 protein; LIS-1; PAF acetylhydrolase 45 kDa subunit; PAF-AH 45 kDa subunit; PAF-AH alpha; PAFAH alpha from Bos taurus (Bovine) (see 4 papers)
NP_777088 platelet-activating factor acetylhydrolase IB subunit beta from Bos taurus
23% identity, 41% coverage

LIS1_HUMAN / P43034 Platelet-activating factor acetylhydrolase IB subunit beta; Lissencephaly-1 protein; LIS-1; PAF acetylhydrolase 45 kDa subunit; PAF-AH 45 kDa subunit; PAF-AH alpha; PAFAH alpha from Homo sapiens (Human) (see 16 papers)
XP_016880190 platelet-activating factor acetylhydrolase IB subunit beta isoform X2 from Homo sapiens
23% identity, 41% coverage

CCM_02300 WD repeat-containing protein from Cordyceps militaris CM01
29% identity, 37% coverage

PF3D7_0302000 pre-mRNA-splicing factor PRP46, putative from Plasmodium falciparum 3D7
31% identity, 27% coverage

MGG_08829 transcriptional repressor rco-1 from Pyricularia oryzae 70-15
28% identity, 31% coverage

Tery_1627 serine/threonine protein kinase with WD40 repeats from Trichodesmium erythraeum IMS101
25% identity, 42% coverage

Q7PMU5 Coatomer subunit beta' from Anopheles gambiae
24% identity, 27% coverage

Q4DKP1 Uncharacterized protein from Trypanosoma cruzi (strain CL Brener)
24% identity, 38% coverage

POP1_SCHPO / P87060 WD repeat-containing protein pop1; WD repeat-containing protein ste16 from Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast) (see 6 papers)
pop1 / RF|XP_001713146.1 cullin 1 adaptor protein Pop1 from Schizosaccharomyces pombe (see 6 papers)
27% identity, 36% coverage

SCONB_EMENI / Q00659 Probable E3 ubiquitin ligase complex SCF subunit sconB; Sulfur controller B; Sulfur metabolite repression control protein B from Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) (Aspergillus nidulans) (see paper)
28% identity, 31% coverage

WDR5B_ARATH / Q9SY00 COMPASS-like H3K4 histone methylase component WDR5B; AtWDR5B from Arabidopsis thaliana (Mouse-ear cress) (see 2 papers)
AT4G02730 transducin family protein / WD-40 repeat family protein from Arabidopsis thaliana
22% identity, 41% coverage

E9HMX3 F-box domain-containing protein from Daphnia pulex
26% identity, 35% coverage

RCO1_NEUCR / P78706 Transcriptional repressor rco-1 from Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) (see paper)
NCU06205, XP_962921 transcriptional repressor rco-1 from Neurospora crassa OR74A
28% identity, 31% coverage

L7M0T0 F-box domain-containing protein from Rhipicephalus pulchellus
25% identity, 35% coverage

MHCKB_DICDI / P90648 Myosin heavy chain kinase B; MHCK-B; EC 2.7.11.7 from Dictyostelium discoideum (Social amoeba) (see paper)
P90648 myosin-heavy-chain kinase (EC 2.7.11.7) from Dictyostelium discoideum (see 5 papers)
DDB_G0289115 Alpha kinase family protein from Dictyostelium discoideum AX4
26% identity, 38% coverage

TRCB_XENLA / Q91854 Beta-TrCP; Beta-transducin repeat-containing protein from Xenopus laevis (African clawed frog) (see paper)
NP_001081064 beta-TrCP from Xenopus laevis
25% identity, 36% coverage

B3KPF6 cDNA FLJ31732 fis, clone NT2RI2006856, highly similar to Striatin-4 from Homo sapiens
24% identity, 37% coverage

F7D3K4 Platelet-activating factor acetylhydrolase IB subunit alpha from Equus caballus
26% identity, 31% coverage

NP_524430 supernumerary limbs, isoform A from Drosophila melanogaster
Q9VDE3 LD08669p from Drosophila melanogaster
26% identity, 35% coverage

NCU06483 F-box and WD repeat-containing protein from Neurospora crassa OR74A
24% identity, 40% coverage

An15g00140 uncharacterized protein from Aspergillus niger
29% identity, 31% coverage

6rxtUL / G0RZL9 6rxtUL (see paper)
25% identity, 39% coverage

NLE1_SCHPO / O74855 Ribosome assembly protein 4; Notchless protein homolog 1; Ribosome biogenesis factor rsa4 from Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast) (see paper)
24% identity, 36% coverage

4j8gB / Q96WV5 Crystal structure of alpha-cop/e19 complex (see paper)
24% identity, 35% coverage

P46800 Small ribosomal subunit protein RACK1 from Dictyostelium discoideum
DDB_G0275045 hypothetical protein from Dictyostelium discoideum AX4
25% identity, 32% coverage

CPAR2_503400 uncharacterized protein from Candida parapsilosis
29% identity, 22% coverage

TOZ_ARATH / Q9LFE2 Protein TORMOZ EMBRYO DEFECTIVE from Arabidopsis thaliana (Mouse-ear cress) (see paper)
AT5G16750 TOZ (TORMOZEMBRYO DEFECTIVE); nucleotide binding from Arabidopsis thaliana
NP_568338 Transducin family protein / WD-40 repeat family protein from Arabidopsis thaliana
23% identity, 26% coverage

3v7dB / P07834 Crystal structure of scskp1-sccdc4-psic1 peptide complex (see paper)
25% identity, 34% coverage

NLE1_ARATH / Q9FLX9 Notchless protein homolog from Arabidopsis thaliana (Mouse-ear cress) (see paper)
NP_200094 WD-40 repeat family protein / notchless protein from Arabidopsis thaliana
AT5G52820 WD-40 repeat family protein / notchless protein, putative from Arabidopsis thaliana
25% identity, 36% coverage

sll0163 beta transducin-like protein from Synechocystis sp. PCC 6803
23% identity, 20% coverage

rcoA transcriptional corepressor (Eurofung) from Emericella nidulans (see 3 papers)
28% identity, 31% coverage

FGSG_05038 nuclear distribution protein nudF from Fusarium graminearum PH-1
28% identity, 27% coverage

NP_572778 transport and golgi organization 4 from Drosophila melanogaster
23% identity, 47% coverage

NP_608501 uncharacterized protein, isoform A from Drosophila melanogaster
Q9VPL0 GM13767p from Drosophila melanogaster
26% identity, 38% coverage

F7FZQ6 Platelet-activating factor acetylhydrolase IB subunit alpha from Monodelphis domestica
23% identity, 41% coverage

Ava_3079 Peptidase C14, caspase catalytic subunit p20 from Anabaena variabilis ATCC 29413
25% identity, 15% coverage

FBW1A_HUMAN / Q9Y297 F-box/WD repeat-containing protein 1A; E3RSIkappaB; Epididymis tissue protein Li 2a; F-box and WD repeats protein beta-TrCP; pIkappaBalpha-E3 receptor subunit from Homo sapiens (Human) (see 44 papers)
NP_378663 F-box/WD repeat-containing protein 1A isoform 1 from Homo sapiens
24% identity, 36% coverage

AT2G43770 transducin family protein / WD-40 repeat family protein from Arabidopsis thaliana
NP_181905 Transducin/WD40 repeat-like superfamily protein from Arabidopsis thaliana
25% identity, 32% coverage

F2Z521 Platelet-activating factor acetylhydrolase IB subunit alpha from Sus scrofa
31% identity, 16% coverage

KTN84_ARATH / Q8H0T9 Katanin p80 WD40 repeat-containing subunit B1 homolog KTN80.4 from Arabidopsis thaliana (Mouse-ear cress) (see paper)
AT5G23430 transducin family protein / WD-40 repeat family protein from Arabidopsis thaliana
26% identity, 32% coverage

AFUA_2G14110 sulfur metabolite repression control protein SconB, putative from Aspergillus fumigatus Af293
30% identity, 31% coverage

CG3909, NP_649969 uncharacterized protein from Drosophila melanogaster
24% identity, 36% coverage

XP_006231593 F-box/WD repeat-containing protein 1A isoform X2 from Rattus norvegicus
25% identity, 39% coverage

C6KSR5 Coatomer alpha subunit, putative from Plasmodium falciparum (isolate 3D7)
25% identity, 18% coverage

orf19.3778 putative uncharacterized protein from Candida albicans (see paper)
25% identity, 33% coverage

D6WA15 Supernumerary limbs from Tribolium castaneum
24% identity, 39% coverage

Afu6g13030 cell division control protein Cdc4, putative from Aspergillus fumigatus Af293
28% identity, 23% coverage

LIS1_DROME / Q7KNS3 Lissencephaly-1 homolog; DLis-1; Dlis1; Lissencephaly1 from Drosophila melanogaster (Fruit fly) (see 6 papers)
NP_477160 Lissencephaly-1, isoform A from Drosophila melanogaster
32% identity, 16% coverage

all0284 WD-40 repeat protein from Nostoc sp. PCC 7120
24% identity, 16% coverage

NP_001263303 WD repeat-containing protein 38 isoform 1 from Homo sapiens
24% identity, 37% coverage

FBW1A_MOUSE / Q3ULA2 F-box/WD repeat-containing protein 1A; Beta-TrCP protein E3RS-IkappaB; Beta-transducin repeat-containing protein; Beta-TrCP; E3RSIkappaB; mE3RS-IkappaB; F-box and WD repeats protein beta-TrCP; HOS; Ubiquitin ligase FWD1; pIkappaB-E3 receptor subunit from Mus musculus (Mouse) (see 13 papers)
NP_001032847 F-box/WD repeat-containing protein 1A isoform a from Mus musculus
24% identity, 35% coverage

G3I574 F-box/WD repeat-containing protein 1A from Cricetulus griseus
24% identity, 36% coverage

COPA_YEAST / P53622 Coatomer subunit alpha; Alpha-coat protein; Alpha-COP; Retrieval from endoplasmic reticulum protein 1; Secretory protein 22; Suppressor of osmo-sensitivity 1 from Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) (see 3 papers)
NP_010136 coatomer subunit alpha from Saccharomyces cerevisiae S288C
YDL145C Alpha subunit of COPI vesicle coatomer complex, which surrounds transport vesicles in the early secretory pathway from Saccharomyces cerevisiae
22% identity, 23% coverage

MHCKD_DICDI / Q54SF9 Myosin heavy chain kinase D; MHCK-D; EC 2.7.11.7 from Dictyostelium discoideum (Social amoeba) (see paper)
26% identity, 25% coverage

Tery_0059 serine/threonine protein kinase with WD40 repeats from Trichodesmium erythraeum IMS101
27% identity, 33% coverage

G6FXC6 WD40 repeat-containing protein from Fischerella thermalis JSC-11
25% identity, 22% coverage

O76734 General transcriptional corepressor tupA from Dictyostelium discoideum
29% identity, 31% coverage

Tery_2471 peptidase C14, caspase catalytic subunit p20 from Trichodesmium erythraeum IMS101
24% identity, 22% coverage

KTN83_ARATH / F4KB17 Katanin p80 WD40 repeat-containing subunit B1 homolog KTN80.3 from Arabidopsis thaliana (Mouse-ear cress) (see paper)
AT5G08390 hypothetical protein from Arabidopsis thaliana
25% identity, 25% coverage

Q177S9 Beta'-coat protein (Fragment) from Aedes aegypti
24% identity, 27% coverage

8ro1T / G5EEL2 8ro1T (see paper)
27% identity, 21% coverage

CNAG_02153 glucose repression regulatory protein TUP1 from Cryptococcus neoformans var. grubii H99
24% identity, 35% coverage

NP_001185277 Transducin/WD40 repeat-like superfamily protein from Arabidopsis thaliana
24% identity, 21% coverage

KTN82_ARATH / F4HTH8 Katanin p80 WD40 repeat-containing subunit B1 homolog KTN80.2; DDB1 binding WD40 hypersensitive to ABA 3; DWD hypersensitive to ABA 3 from Arabidopsis thaliana (Mouse-ear cress) (see 2 papers)
24% identity, 20% coverage

AT1G61210 WD-40 repeat family protein / katanin p80 subunit, putative from Arabidopsis thaliana
24% identity, 21% coverage

XP_001657383 coatomer subunit beta' isoform X2 from Aedes aegypti
23% identity, 27% coverage

KTNB1_MOUSE / Q8BG40 Katanin p80 WD40 repeat-containing subunit B1; Katanin p80 subunit B1; p80 katanin from Mus musculus (Mouse) (see 3 papers)
Q8BG40 microtubule-severing ATPase (subunit 1/2) (EC 5.6.1.1) from Mus musculus (see paper)
XP_006531475 katanin p80 WD40 repeat-containing subunit B1 isoform X1 from Mus musculus
26% identity, 29% coverage

XP_570974 general transcriptional repressor, putative from Cryptococcus neoformans var. neoformans JEC21
XP_570974 general transcriptional repressor from Cryptococcus neoformans var. neoformans JEC21
24% identity, 35% coverage

KTNB1_HUMAN / Q9BVA0 Katanin p80 WD40 repeat-containing subunit B1; Katanin p80 subunit B1; p80 katanin from Homo sapiens (Human) (see 8 papers)
NP_005877 katanin p80 WD40 repeat-containing subunit B1 from Homo sapiens
26% identity, 29% coverage

Q4VFZ4 Katanin p80 WD40 repeat-containing subunit B1 from Rattus norvegicus
26% identity, 29% coverage

XP_017456690 katanin p80 WD40 repeat-containing subunit B1 isoform X2 from Rattus norvegicus
26% identity, 29% coverage

KTNB1_XENLA / Q4V7Y7 Katanin p80 WD40 repeat-containing subunit B1; Katanin p80 subunit B1; p80 katanin from Xenopus laevis (African clawed frog) (see paper)
30% identity, 21% coverage

CDC4_YEAST / P07834 Cell division control protein 4; E3 ubiquitin ligase complex SCF subunit CDC4; F-box protein CDC4 from Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) (see 15 papers)
NP_116585 SCF ubiquitin ligase complex subunit CDC4 from Saccharomyces cerevisiae S288C
YFL009W Cdc4p from Saccharomyces cerevisiae
25% identity, 30% coverage

C0P165 Transcriptional repressor Tup1 N-terminal domain-containing protein from Ajellomyces capsulatus (strain G186AR / H82 / ATCC MYA-2454 / RMSCC 2432)
28% identity, 31% coverage

AFUA_5G13140 WD repeat-containing protein from Aspergillus fumigatus Af293
26% identity, 35% coverage

WDR5A_ARATH / Q9M2Z2 COMPASS-like H3K4 histone methylase component WDR5A; AtWDR5A from Arabidopsis thaliana (Mouse-ear cress) (see 3 papers)
NP_190535 Transducin/WD40 repeat-like superfamily protein from Arabidopsis thaliana
AT3G49660 transducin family protein / WD-40 repeat family protein from Arabidopsis thaliana
25% identity, 32% coverage

LIS1_EMENI / Q00664 Nuclear distribution protein nudF; Lissencephaly-1 homolog; LIS-1; Nuclear migration protein nudF from Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) (Aspergillus nidulans) (see 6 papers)
nudF / GB|AAA91301.1 nuclear distribution protein nudF from Emericella nidulans (see 5 papers)
XP_663801 dynein regulator nudF from Aspergillus nidulans FGSC A4
26% identity, 31% coverage

TUP11_SCHPO / Q09715 Transcriptional repressor tup11 from Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast) (see 3 papers)
tup11 transcriptional corepressor Tup11 from Schizosaccharomyces pombe (see 5 papers)
NP_592873 transcriptional corepressor Tup11 from Schizosaccharomyces pombe
NP_592873, SPAC18B11.10 transcriptional corepressor Tup11 from Schizosaccharomyces pombe
25% identity, 37% coverage

Npun_R6612 WD-40 repeat-containing protein from Nostoc punctiforme
26% identity, 23% coverage

CNA06710 ubiquitin-protein ligase from Cryptococcus neoformans var. neoformans JEC21
23% identity, 30% coverage

PRP46_SCHPO / O13615 Pre-mRNA-splicing factor prp5; Complexed with cdc5 protein 1; Pre-mRNA-processing protein 5 from Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast) (see 3 papers)
prp5 / GI|13810226 WD repeat protein Prp5 from Schizosaccharomyces pombe (see 3 papers)
SPBP22H7.07 WD repeat protein Prp5 from Schizosaccharomyces pombe
29% identity, 31% coverage

B2RQS1 Striatin, calmodulin binding protein 3 from Mus musculus
NP_001165569 striatin-3 isoform 2 from Mus musculus
25% identity, 37% coverage

TTHERM_00497660 katanin con80 domain protein from Tetrahymena thermophila SB210
28% identity, 25% coverage

P58405 Striatin-3 from Rattus norvegicus
NP_001025068 striatin-3 from Rattus norvegicus
25% identity, 35% coverage

7ajtUA / P25635 structure of the 90S-exosome super-complex (state Pre-A1-exosome) (see paper)
24% identity, 39% coverage

6lqsB1 / P25635 structure of 90S small subunit preribosomes in transition states (State D) (see paper)
24% identity, 40% coverage

XP_001684560 activated protein kinase c receptor (LACK) from Leishmania major strain Friedlin
Q4Q7Y7 Activated protein kinase c receptor from Leishmania major
XP_001684561 activated protein kinase c receptor (LACK) from Leishmania major strain Friedlin
31% identity, 19% coverage

Q76LS6 LACK from Leishmania donovani
P62884 Small ribosomal subunit protein RACK1 from Leishmania infantum
LDBPK_282970 activated protein kinase c receptor (LACK) from Leishmania donovani
31% identity, 19% coverage

STRN3_MOUSE / Q9ERG2 Striatin-3; Cell cycle autoantigen SG2NA; S/G2 antigen from Mus musculus (Mouse) (see paper)
25% identity, 35% coverage

XP_416167 apoptotic protease-activating factor 1 isoform X1 from Gallus gallus
27% identity, 19% coverage

TTHERM_01345820 coatomer alpha subunit, putative from Tetrahymena thermophila SB210
22% identity, 22% coverage

Q9BIJ5 LACK protective antigen from Leishmania donovani
31% identity, 19% coverage

5t2a7 / Q9BIJ5 5t2a7 (see paper)
31% identity, 19% coverage

D2XMQ7 Beta-TCRP E3 ligase from Saccoglossus kowalevskii
23% identity, 33% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 789,361 different protein sequences to 1,256,019 scientific articles. Searches against EuropePMC were last performed on January 10 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory