PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for 58 a.a. (RPRTTITAKQ...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 264 similar proteins in the literature:

NP_001179714 LIM/homeobox protein Lhx4 from Bos taurus
100% identity, 15% coverage

LHX4_HUMAN / Q969G2 LIM/homeobox protein Lhx4; LIM homeobox protein 4 from Homo sapiens (Human) (see 4 papers)
100% identity, 15% coverage

5hodA / Q969G2 Structure of lhx4 transcription factor complexed with DNA (see paper)
100% identity, 95% coverage

NP_001116445 LIM/homeobox protein Lhx4 from Danio rerio
98% identity, 15% coverage

XP_063128548 LIM/homeobox protein Lhx4 isoform X1 from Rattus norvegicus
100% identity, 15% coverage

XP_015145978 LIM/homeobox protein Lhx4 from Gallus gallus
100% identity, 13% coverage

XP_011508407 LIM/homeobox protein Lhx4 isoform X1 from Homo sapiens
100% identity, 18% coverage

XP_006529232 LIM/homeobox protein Lhx4 isoform X1 from Mus musculus
100% identity, 18% coverage

LHX3_XENLA / P36200 LIM/homeobox protein Lhx3; LIM homeobox protein 3; Homeobox protein LIM-3; xLIM-3 from Xenopus laevis (African clawed frog) (see paper)
98% identity, 15% coverage

LHX3_CHICK / P53412 LIM/homeobox protein Lhx3; LIM homeobox protein 3; Homeobox protein LIM-3 from Gallus gallus (Chicken) (see paper)
98% identity, 15% coverage

NP_571283 LIM/homeobox protein Lhx3 from Danio rerio
97% identity, 15% coverage

Q9VJ02 Lim3, isoform A from Drosophila melanogaster
93% identity, 13% coverage

NP_001246088 Lim3, isoform F from Drosophila melanogaster
NP_724161 Lim3, isoform B from Drosophila melanogaster
93% identity, 11% coverage

M9PD53 Lim3, isoform G from Drosophila melanogaster
93% identity, 10% coverage

XP_005213396 LIM/homeobox protein Lhx3 isoform X1 from Bos taurus
95% identity, 14% coverage

LHX3_PIG / O97581 LIM/homeobox protein Lhx3; LIM homeobox protein 3; Homeobox protein LIM-3; Homeobox protein P-LIM from Sus scrofa (Pig) (see paper)
95% identity, 15% coverage

LHX3_HALRO / Q25132 LIM/homeobox protein Lhx3; Hr-Lhx3; LIM homeobox protein 3; LIM/homeobox protein LIM; HrLIM from Halocynthia roretzi (Sea squirt) (Cynthia roretzi) (see 2 papers)
93% identity, 8% coverage

NP_001290229 LIM/homeobox protein Lhx3 from Sus scrofa
95% identity, 14% coverage

NP_001184116 LIM/homeobox protein Lhx3 from Canis lupus familiaris
95% identity, 14% coverage

LHX3_MOUSE / P50481 LIM/homeobox protein Lhx3; LIM homeobox protein 3; Homeobox protein LIM-3; Homeobox protein P-LIM from Mus musculus (Mouse) (see 7 papers)
NP_034841 LIM/homeobox protein Lhx3 isoform b from Mus musculus
95% identity, 14% coverage

LHX3_HUMAN / Q9UBR4 LIM/homeobox protein Lhx3; LIM homeobox protein 3 from Homo sapiens (Human) (see 5 papers)
NP_835258 LIM/homeobox protein Lhx3 isoform a from Homo sapiens
95% identity, 15% coverage

NP_001158395 lim domain homeobox 3/4 transcription factor from Saccoglossus kowalevskii
95% identity, 15% coverage

HM14_CAEEL / P20271 LIM/homeobox protein ceh-14; Homeobox protein ceh-14 from Caenorhabditis elegans (see 6 papers)
NP_509273 LIM/homeobox protein ceh-14 from Caenorhabditis elegans
88% identity, 17% coverage

Q9V472 DLim1 from Drosophila melanogaster
NP_572505 LIM homeobox 1, isoform A from Drosophila melanogaster
70% identity, 11% coverage

LHX1_DANRE / Q90476 LIM/homeobox protein Lhx1; LIM homeobox protein 1; Homeobox protein Lim-1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_571291 LIM/homeobox protein Lhx1 from Danio rerio
74% identity, 14% coverage

LIN11_CAEEL / P20154 Protein lin-11; Abnormal cell lineage protein 11 from Caenorhabditis elegans (see 4 papers)
NP_492696 Protein lin-11 from Caenorhabditis elegans
75% identity, 14% coverage

NP_571293 LIM/homeobox protein Lhx5 from Danio rerio
74% identity, 14% coverage

LHX1_MOUSE / P63006 LIM/homeobox protein Lhx1; LIM homeobox protein 1; Homeobox protein Lim-1 from Mus musculus (Mouse) (see 5 papers)
LHX1_RAT / P63007 LIM/homeobox protein Lhx1; LIM homeobox protein 1; Homeobox protein Lim-1; Rlim from Rattus norvegicus (Rat) (see 2 papers)
NP_032524 LIM/homeobox protein Lhx1 from Mus musculus
74% identity, 14% coverage

LHX1_HUMAN / P48742 LIM/homeobox protein Lhx1; LIM homeobox protein 1; Homeobox protein Lim-1; hLim-1 from Homo sapiens (Human) (see paper)
NP_005559 LIM/homeobox protein Lhx1 from Homo sapiens
74% identity, 14% coverage

LHX1_CHICK / P53411 LIM/homeobox protein Lhx1; LIM homeobox protein 1; Homeobox protein Lim-1 from Gallus gallus (Chicken) (see paper)
74% identity, 14% coverage

LHX1_XENLA / P29674 LIM/homeobox protein Lhx1; LIM homeobox protein 1; Homeobox protein Lim-1; Xlim1; x-Lhx1; xLIM-1 from Xenopus laevis (African clawed frog) (see 23 papers)
NP_001084128 LIM/homeobox protein Lhx1 from Xenopus laevis
74% identity, 14% coverage

NP_032525 LIM/homeobox protein Lhx5 from Mus musculus
74% identity, 14% coverage

NP_001259358 LIM homeobox 1, isoform B from Drosophila melanogaster
M9PJE4 LIM homeobox 1, isoform B from Drosophila melanogaster
70% identity, 16% coverage

NP_071758 LIM/homeobox protein Lhx5 from Homo sapiens
Q9H2C1 LIM/homeobox protein Lhx5 from Homo sapiens
74% identity, 14% coverage

LHX5_XENLA / P37137 LIM/homeobox protein Lhx5; LIM homeobox protein 5; Homeobox protein LIM-5; xLIM-5; xLIM-2A from Xenopus laevis (African clawed frog) (see paper)
NP_001084038 LIM/homeobox protein Lhx5 from Xenopus laevis
74% identity, 14% coverage

MEC3_CAEEL / P09088 Mechanosensory protein 3 from Caenorhabditis elegans (see 5 papers)
NP_001023111 Mechanosensory protein 3 from Caenorhabditis elegans
61% identity, 18% coverage

NP_729801 LIM homeobox transcription factor 1 alpha, isoform B from Drosophila melanogaster
Q9VTW5 LIM homeobox transcription factor 1 alpha, isoform B from Drosophila melanogaster
55% identity, 9% coverage

LIM6_CAEEL / Q21192 LIM/homeobox protein lim-6 from Caenorhabditis elegans (see 6 papers)
NP_001256980 LIM/homeobox protein lim-6 from Caenorhabditis elegans
61% identity, 18% coverage

LMX1A_HUMAN / Q8TE12 LIM homeobox transcription factor 1-alpha; LIM/homeobox protein 1.1; LMX-1.1; LIM/homeobox protein LMX1A from Homo sapiens (Human) (see 2 papers)
NP_001167540 LIM homeobox transcription factor 1-alpha from Homo sapiens
NP_796372 LIM homeobox transcription factor 1-alpha from Homo sapiens
58% identity, 15% coverage

NP_001020339 LIM homeobox transcription factor 1, beta a isoform 2 from Danio rerio
58% identity, 15% coverage

XP_017454198 LIM homeobox transcription factor 1-alpha isoform X1 from Rattus norvegicus
58% identity, 15% coverage

Q9JKU8 LIM homeobox transcription factor 1-alpha from Mus musculus
XP_017177173 LIM homeobox transcription factor 1-alpha isoform X1 from Mus musculus
58% identity, 15% coverage

XP_006497809 LIM homeobox transcription factor 1-beta isoform X1 from Mus musculus
58% identity, 14% coverage

LMX1B_HUMAN / O60663 LIM homeobox transcription factor 1-beta; LIM/homeobox protein 1.2; LMX-1.2; LIM/homeobox protein LMX1B from Homo sapiens (Human) (see 12 papers)
58% identity, 14% coverage

NP_001167617 LIM homeobox transcription factor 1-beta isoform 3 from Homo sapiens
58% identity, 14% coverage

8ik5C / Q8TE12 Transcription factor lmx1a homeobox domain in complex with wnt1 promoter
58% identity, 85% coverage

O88609 LIM homeobox transcription factor 1-beta from Mus musculus
58% identity, 14% coverage

P53413 LIM/homeobox protein LMX-1.2 from Gallus gallus
NP_990689 LIM/homeobox protein LMX-1.2 from Gallus gallus
58% identity, 14% coverage

LMX1B_XENLA / Q8UVR3 LIM homeobox transcription factor 1-beta.1; LIM homeobox protein 1b; Xlmx1b from Xenopus laevis (African clawed frog) (see 3 papers)
58% identity, 14% coverage

XP_018087415 LIM homeobox transcription factor 1-beta.1 isoform X1 from Xenopus laevis
58% identity, 15% coverage

CG4328, NP_648567 uncharacterized protein from Drosophila melanogaster
Q9VTW3 FI06571p from Drosophila melanogaster
53% identity, 9% coverage

LOC100644486 insulin gene enhancer protein ISL-1 from Bombus terrestris
48% identity, 12% coverage

HESXB_XENLA / Q91898 Homeobox expressed in ES cells 1-B; Homeobox protein ANF-1; XANF-1; Xanf1 from Xenopus laevis (African clawed frog) (see 9 papers)
NP_001156042 homeobox expressed in ES cells 1-B from Xenopus laevis
50% identity, 30% coverage

ISL2A_DANRE / P53406 Insulin gene enhancer protein isl-2a; Islet-2A; Insulin gene enhancer protein isl-2; Islet-2 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_571045 insulin gene enhancer protein isl-2a from Danio rerio
48% identity, 16% coverage

ISL2B_DANRE / P53407 Insulin gene enhancer protein isl-2b; Islet-2B; Insulin gene enhancer protein isl-3; Islet-3 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_571039 insulin gene enhancer protein isl-2b from Danio rerio
48% identity, 16% coverage

HESXA_XENLA / Q91617 Homeobox expressed in ES cells 1-A; Homeobox protein ANF-2; xANF-2 from Xenopus laevis (African clawed frog) (see paper)
50% identity, 31% coverage

NP_001158279 tailup from Tribolium castaneum
48% identity, 13% coverage

XP_005165321 insulin gene enhancer protein isl-1 isoform X1 from Danio rerio
48% identity, 16% coverage

ISL1_DANRE / P53405 Insulin gene enhancer protein isl-1; Islet-1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
48% identity, 16% coverage

ISL1_CHICK / P50211 Insulin gene enhancer protein ISL-1; Islet-1 from Gallus gallus (Chicken) (see paper)
NP_990745 insulin gene enhancer protein ISL-1 from Gallus gallus
48% identity, 16% coverage

ISL1_HUMAN / P61371 Insulin gene enhancer protein ISL-1; Islet-1 from Homo sapiens (Human) (see 2 papers)
ISL1_MOUSE / P61372 Insulin gene enhancer protein ISL-1; Islet-1 from Mus musculus (Mouse) (see 7 papers)
ISL1_MESAU / P61373 Insulin gene enhancer protein ISL-1; Islet-1 from Mesocricetus auratus (Golden hamster) (see paper)
ISL1_RAT / P61374 Insulin gene enhancer protein ISL-1; Islet-1 from Rattus norvegicus (Rat) (see 2 papers)
NP_059035 insulin gene enhancer protein ISL-1 from Rattus norvegicus
NP_002193 insulin gene enhancer protein ISL-1 from Homo sapiens
XP_004017051 insulin gene enhancer protein ISL-1 from Ovis aries
48% identity, 16% coverage

XP_006510819 insulin gene enhancer protein ISL-2 isoform X1 from Mus musculus
48% identity, 15% coverage

NP_001104188 ISL LIM homeobox 1 S homeolog from Xenopus laevis
48% identity, 16% coverage

ISL2_MOUSE / Q9CXV0 Insulin gene enhancer protein ISL-2; Islet-2 from Mus musculus (Mouse) (see paper)
48% identity, 16% coverage

T265_11894 hypothetical protein from Opisthorchis viverrini
44% identity, 22% coverage

NP_001016712 homeobox expressed in ES cells 1 from Xenopus tropicalis
46% identity, 30% coverage

B5LDT8 Lim1 (Fragment) from Trichoplax adhaerens
48% identity, 93% coverage

NP_476775 tailup, isoform A from Drosophila melanogaster
Q9VJ37 Tailup, isoform A from Drosophila melanogaster
46% identity, 10% coverage

P79775 Homeobox protein ANF-1 from Gallus gallus
46% identity, 29% coverage

XP_068078243 zinc finger homeobox protein 3 isoform X1 from Danio rerio
50% identity, 1% coverage

NP_001158238 zinc finger homeobox protein 3 isoform B from Homo sapiens
50% identity, 2% coverage

XP_006530648 zinc finger homeobox protein 3 isoform X1 from Mus musculus
50% identity, 1% coverage

ZFHX3_MOUSE / Q61329 Zinc finger homeobox protein 3; AT motif-binding factor 1; AT-binding transcription factor 1; Alpha-fetoprotein enhancer-binding protein; Zinc finger homeodomain protein 3; ZFH-3 from Mus musculus (Mouse) (see 5 papers)
50% identity, 1% coverage

ZFHX3_HUMAN / Q15911 Zinc finger homeobox protein 3; AT motif-binding factor 1; AT-binding transcription factor 1; Alpha-fetoprotein enhancer-binding protein; Zinc finger homeodomain protein 3; ZFH-3 from Homo sapiens (Human) (see 14 papers)
50% identity, 1% coverage

ZFHX2_HUMAN / Q9C0A1 Zinc finger homeobox protein 2; Zinc finger homeodomain protein 2; ZFH-2 from Homo sapiens (Human) (see paper)
50% identity, 2% coverage

ZFHX4_MOUSE / Q9JJN2 Zinc finger homeobox protein 4; Zinc finger homeodomain protein 4; ZFH-4 from Mus musculus (Mouse) (see 2 papers)
50% identity, 2% coverage

ZFHX4_HUMAN / Q86UP3 Zinc finger homeobox protein 4; Zinc finger homeodomain protein 4; ZFH-4 from Homo sapiens (Human) (see paper)
50% identity, 2% coverage

XP_036019316 zinc finger homeobox protein 4 isoform X1 from Mus musculus
50% identity, 2% coverage

XP_010816072 homeobox expressed in ES cells 1 isoform X4 from Bos taurus
45% identity, 31% coverage

ZFHX2_MOUSE / Q2MHN3 Zinc finger homeobox protein 2; Zinc finger homeodomain protein 5 from Mus musculus (Mouse) (see 3 papers)
50% identity, 2% coverage

HESX1_HUMAN / Q9UBX0 Homeobox expressed in ES cells 1; Homeobox protein ANF; hAnf from Homo sapiens (Human) (see 6 papers)
XP_005265583 homeobox expressed in ES cells 1 isoform X1 from Homo sapiens
45% identity, 30% coverage

P28167 Zinc finger protein 2 from Drosophila melanogaster
48% identity, 2% coverage

NP_001245425 Zn finger homeodomain 2, isoform B from Drosophila melanogaster
48% identity, 2% coverage

D4AEG9 Homeobox expressed in ES cells 1 from Rattus norvegicus
45% identity, 30% coverage

HESX1_MOUSE / Q61658 Homeobox expressed in ES cells 1; Anterior-restricted homeobox protein; Homeobox protein ANF; Rathke pouch homeo box from Mus musculus (Mouse) (see 2 papers)
45% identity, 30% coverage

L8HQC4 Zinc finger homeobox protein 2 from Bos mutus
50% identity, 2% coverage

LOC118272755 zinc finger homeobox protein 3 from Spodoptera frugiperda
48% identity, 2% coverage

XP_047287780 zinc finger homeobox protein 2 isoform X5 from Homo sapiens
50% identity, 2% coverage

GSBN_DROME / P09083 Protein gooseberry-neuro; BSH4; Protein gooseberry proximal from Drosophila melanogaster (Fruit fly) (see paper)
NP_523862 gooseberry-neuro from Drosophila melanogaster
45% identity, 12% coverage

FGSG_09019 hypothetical protein from Fusarium graminearum PH-1
49% identity, 8% coverage

NP_001025475 homeobox protein aristaless-like 4 from Bos taurus
48% identity, 14% coverage

NP_001073284 mix-type homeobox gene 2 from Danio rerio
51% identity, 18% coverage

NP_001009767 homeobox protein prophet of Pit-1 from Ovis aries
46% identity, 25% coverage

ALX1_DANRE / Q1LVQ7 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_001038539 ALX homeobox protein 1 from Danio rerio
53% identity, 17% coverage

ALX4_MOUSE / O35137 Homeobox protein aristaless-like 4; ALX-4 from Mus musculus (Mouse) (see paper)
NP_031468 homeobox protein aristaless-like 4 from Mus musculus
48% identity, 14% coverage

XP_006501135 LIM/homeobox protein Lhx8 isoform X1 from Mus musculus
41% identity, 17% coverage

NP_571015 homeobox protein MIXL1 from Danio rerio
45% identity, 17% coverage

O35652 LIM/homeobox protein Lhx8 from Mus musculus
41% identity, 15% coverage

8osbE / Q9H161 Twist1-tcf4-alx4 complex on specific DNA (see paper)
51% identity, 85% coverage

NP_001088995 paired box 7 L homeolog from Xenopus laevis
46% identity, 11% coverage

Q68G74 LIM/homeobox protein Lhx8 from Homo sapiens
NP_001001933 LIM/homeobox protein Lhx8 isoform 1 from Homo sapiens
41% identity, 16% coverage

DUXA_HUMAN / A6NLW8 Double homeobox protein A from Homo sapiens (Human) (see paper)
47% identity, 26% coverage

MIXL1_CHICK / O73592 Homeobox protein MIXL1; Homeodomain protein MIX; cMIX; MIX1 homeobox-like protein 1; Mix.1 homeobox-like protein from Gallus gallus (Chicken) (see 2 papers)
46% identity, 26% coverage

XP_018080096 paired box 7 L homeolog isoform X1 from Xenopus laevis
46% identity, 11% coverage

Q8MJI9 Homeobox protein prophet of Pit-1 from Bos taurus
46% identity, 25% coverage

AWH_DROME / Q8IRC7 LIM/homeobox protein Awh; Protein arrowhead from Drosophila melanogaster (Fruit fly) (see paper)
NP_728906 arrowhead, isoform B from Drosophila melanogaster
45% identity, 20% coverage

XP_015327641 homeobox protein prophet of Pit-1 isoform X1 from Bos taurus
46% identity, 25% coverage

NP_523907 arrowhead, isoform A from Drosophila melanogaster
45% identity, 26% coverage

PAX3H_CAEEL / G5ED66 Paired box protein 3 homolog from Caenorhabditis elegans (see paper)
NP_496189 Paired box protein 3 homolog from Caenorhabditis elegans
43% identity, 18% coverage

ZAG1_CAEEL / G5EBU4 Zinc finger E-box-binding homeobox protein zag-1; Zinc finger involved in axon guidance 1; ZAG-1 from Caenorhabditis elegans (see 4 papers)
NP_500424 Zinc finger E-box-binding homeobox protein zag-1 from Caenorhabditis elegans
41% identity, 9% coverage

Q91574 ALX homeobox protein 1 from Xenopus laevis
53% identity, 16% coverage

UNC42_CAEEL / L8E946 Homeobox protein unc-42; Uncoordinated protein 42 from Caenorhabditis elegans (see 5 papers)
48% identity, 20% coverage

NP_505519 Homeobox protein unc-42 from Caenorhabditis elegans
48% identity, 21% coverage

3a01F / Q06453 Crystal structure of aristaless and clawless homeodomains bound to dna (see paper)
49% identity, 87% coverage

XP_001340966 homeobox protein aristaless-like 4 from Danio rerio
48% identity, 15% coverage

LIM7_CAEEL / G5EC36 LIM/homeobox protein lim-7 from Caenorhabditis elegans (see 2 papers)
NP_491668 LIM/homeobox protein lim-7 from Caenorhabditis elegans
41% identity, 12% coverage

XP_001946004 LIM/homeobox protein Lhx9 from Acyrthosiphon pisum
46% identity, 10% coverage

ALX1_GEOFO / P0DMV5 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Geospiza fortis (Medium ground-finch) (see paper)
53% identity, 16% coverage

ALX1_HUMAN / Q15699 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Homo sapiens (Human) (see 4 papers)
NP_008913 ALX homeobox protein 1 from Homo sapiens
53% identity, 16% coverage

ALX1_MOUSE / Q8C8B0 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Mus musculus (Mouse) (see 2 papers)
NP_766141 ALX homeobox protein 1 isoform 1 from Mus musculus
53% identity, 16% coverage

ALX1_RAT / Q63087 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Rattus norvegicus (Rat) (see 4 papers)
53% identity, 16% coverage

M9PEI5 Arrowhead, isoform C from Drosophila melanogaster
45% identity, 20% coverage

NP_726006 retinal homeobox from Drosophila melanogaster
48% identity, 6% coverage

NP_001001263 homeobox protein prophet of Pit-1 from Sus scrofa
46% identity, 24% coverage

P97458 Homeobox protein prophet of Pit-1 from Mus musculus
NP_032962 homeobox protein prophet of Pit-1 from Mus musculus
46% identity, 25% coverage

MIXL1_HUMAN / Q9H2W2 Homeobox protein MIXL1; Homeodomain protein MIX; hMix; MIX1 homeobox-like protein 1; Mix.1 homeobox-like protein from Homo sapiens (Human) (see 4 papers)
45% identity, 24% coverage

NP_001284898 visual system homeobox 1, isoform B from Drosophila melanogaster
51% identity, 6% coverage

1fjlA / P06601 Homeodomain from the drosophila paired protein bound to a DNA oligonucleotide (see paper)
45% identity, 82% coverage

Q6BDC3 LIM homeobox 8 from Danio rerio
41% identity, 17% coverage

PROP1_HUMAN / O75360 Homeobox protein prophet of Pit-1; PROP-1; Pituitary-specific homeodomain factor from Homo sapiens (Human) (see 7 papers)
NP_006252 homeobox protein prophet of Pit-1 from Homo sapiens
46% identity, 25% coverage

LHX6_MOUSE / Q9R1R0 LIM/homeobox protein Lhx6; LIM homeobox protein 6; LIM/homeobox protein Lhx6.1 from Mus musculus (Mouse) (see 8 papers)
41% identity, 15% coverage

R7TKD0 Transcription factor Pax3/7 (Fragment) from Capitella teleta
47% identity, 21% coverage

XP_006234128 LIM/homeobox protein Lhx6 isoform X1 from Rattus norvegicus
41% identity, 15% coverage

NP_001098373 retinal homeobox protein Rx2 from Oryzias latipes
48% identity, 17% coverage

RHTO_06229 Homeobox domain containing protein, transcription factor from Rhodotorula toruloides NP11
46% identity, 9% coverage

K1QWY6 Paired box protein Pax-6 from Magallana gigas
45% identity, 16% coverage

CG9876 uncharacterized protein from Drosophila melanogaster
45% identity, 20% coverage

LOC658656, NP_001139341 apterous a from Tribolium castaneum
46% identity, 12% coverage

ALX4_HUMAN / Q9H161 Homeobox protein aristaless-like 4 from Homo sapiens (Human) (see 4 papers)
NP_068745 homeobox protein aristaless-like 4 from Homo sapiens
51% identity, 13% coverage

NP_571635 mix-type homeobox gene 1 from Danio rerio
41% identity, 19% coverage

F2Z897 Apterous A splicing isoform type B from Bombyx mori
46% identity, 14% coverage

NCU03593 homeobox domain-containing protein from Neurospora crassa OR74A
47% identity, 8% coverage

XP_011607069 paired box protein Pax-3 isoform X1 from Takifugu rubripes
45% identity, 11% coverage

XP_017213137 paired box protein Pax-3a isoform X1 from Danio rerio
45% identity, 11% coverage

LOC552251 homeobox protein aristaless from Apis mellifera
49% identity, 14% coverage

P29673 Protein apterous from Drosophila melanogaster
NP_724428 apterous, isoform A from Drosophila melanogaster
46% identity, 12% coverage

A1JVI8 Homeobox protein from Mus musculus
43% identity, 7% coverage

ALX3_HUMAN / O95076 Homeobox protein aristaless-like 3; Proline-rich transcription factor ALX3 from Homo sapiens (Human) (see paper)
NP_006483 homeobox protein aristaless-like 3 from Homo sapiens
46% identity, 16% coverage

LOC725574 LIM/homeobox protein Awh from Apis mellifera
43% identity, 17% coverage

XP_001949543 LIM/homeobox protein Lhx2 from Acyrthosiphon pisum
45% identity, 16% coverage

VSX1_MOUSE / Q91V10 Visual system homeobox 1; Homeodomain protein RINX; Retinal inner nuclear layer homeobox protein; Transcription factor VSX1 from Mus musculus (Mouse) (see 2 papers)
NP_473409 visual system homeobox 1 from Mus musculus
49% identity, 14% coverage

MIXL1_MOUSE / Q9WUI0 Homeobox protein MIXL1; Homeodomain protein MIX; mMix; MIX1 homeobox-like protein 1; Mix.1 homeobox-like protein from Mus musculus (Mouse) (see 11 papers)
43% identity, 24% coverage

T1G8F8 Uncharacterized protein from Helobdella robusta
43% identity, 19% coverage

XP_005167424 LIM/homeobox protein Lhx2b isoform X1 from Danio rerio
45% identity, 13% coverage

XP_012818027 visual system homeobox 1 isoform X1 from Xenopus tropicalis
49% identity, 15% coverage

TRF2_PYRO7 / G4N2B2 Pyriculol/pyriculariol biosynthesis cluster transcription factor 1 from Pyricularia oryzae (strain 70-15 / ATCC MYA-4617 / FGSC 8958) (Rice blast fungus) (Magnaporthe oryzae) (see paper)
45% identity, 8% coverage

NP_492586 Homeobox protein ceh-5 from Caenorhabditis elegans
42% identity, 43% coverage

LIM4_CAEEL / G5EEA1 LIM/homeobox protein lim-4 from Caenorhabditis elegans (see 5 papers)
NP_508669 LIM/homeobox protein lim-4 from Caenorhabditis elegans
43% identity, 16% coverage

NP_990100 visual system homeobox 1 from Gallus gallus
49% identity, 15% coverage

XP_695330 homeobox protein aristaless-like 3 isoform X2 from Danio rerio
46% identity, 15% coverage

NP_001287047 eyegone, isoform C from Drosophila melanogaster
50% identity, 8% coverage

3cmyA / P23760 Structure of a homeodomain in complex with DNA (see paper)
47% identity, 88% coverage

NP_031467 homeobox protein aristaless-like 3 from Mus musculus
46% identity, 16% coverage

CIMG_09071 homeobox domain-containing protein from Coccidioides immitis RS
44% identity, 9% coverage

VSX1_HUMAN / Q9NZR4 Visual system homeobox 1; Homeodomain protein RINX; Retinal inner nuclear layer homeobox protein; Transcription factor VSX1 from Homo sapiens (Human) (see 10 papers)
46% identity, 15% coverage

Q5ZNB2 Paired box protein 7 from Salvelinus alpinus
49% identity, 11% coverage

AL_DROME / Q06453 Homeobox protein aristaless from Drosophila melanogaster (Fruit fly) (see paper)
NP_722629 aristaless from Drosophila melanogaster
49% identity, 12% coverage

NP_477026 reversed polarity from Drosophila melanogaster
47% identity, 8% coverage

NP_001245266 paired box protein Pax-7a from Oncorhynchus mykiss
49% identity, 11% coverage

NP_001245265 paired box gene 7b from Oncorhynchus mykiss
49% identity, 11% coverage

NP_006874 short stature homeobox protein isoform SHOXb from Homo sapiens
47% identity, 24% coverage

NP_001033832 visual system homeobox 2, isoform A from Drosophila melanogaster
51% identity, 8% coverage

LOC102223853 dorsal root ganglia homeobox protein-like from Xiphophorus maculatus
45% identity, 18% coverage

PAX7_MOUSE / P47239 Paired box protein Pax-7 from Mus musculus (Mouse) (see 3 papers)
49% identity, 11% coverage

Q683Y9 Paired box protein 7 from Salmo salar
49% identity, 10% coverage

NP_001139621 paired box protein Pax-7b from Danio rerio
49% identity, 10% coverage

NP_001007013 homeobox protein aristaless-like 3 from Rattus norvegicus
46% identity, 16% coverage

XP_066838118 paired box protein Pax-7 isoform X1 from Anser cygnoides
49% identity, 10% coverage

NP_571400 paired box protein Pax-7a isoform PAX7C from Danio rerio
49% identity, 10% coverage

XP_009304561 paired box protein Pax-7a isoform X1 from Danio rerio
49% identity, 11% coverage

XP_006538694 paired box protein Pax-7 isoform X3 from Mus musculus
49% identity, 11% coverage

LOC536229, XP_015316176 paired box protein Pax-7 from Bos taurus
49% identity, 13% coverage

XP_006239325 paired box protein Pax-7 isoform X1 from Rattus norvegicus
49% identity, 10% coverage

XP_002274194 homeobox-leucine zipper protein REVOLUTA isoform X1 from Vitis vinifera
41% identity, 6% coverage

PAX7_HUMAN / P23759 Paired box protein Pax-7; HuP1 from Homo sapiens (Human) (see 3 papers)
NP_001128726 paired box protein Pax-7 isoform 3 from Homo sapiens
49% identity, 10% coverage

NP_990396 paired box protein Pax-7 from Gallus gallus
49% identity, 10% coverage

NP_001020793 short stature homeobox protein from Canis lupus familiaris
47% identity, 18% coverage

SHOX_HUMAN / O15266 Short stature homeobox protein; Pseudoautosomal homeobox-containing osteogenic protein; Short stature homeobox-containing protein from Homo sapiens (Human) (see 4 papers)
47% identity, 18% coverage

XP_020951119 paired box protein Pax-7 isoform X3 from Sus scrofa
49% identity, 11% coverage

NP_723721 paired, isoform B from Drosophila melanogaster
P06601 Segmentation protein paired from Drosophila melanogaster
41% identity, 9% coverage

UNC4_RAT / P97830 Homeobox protein unc-4 homolog; Homeobox protein Uncx4.1; Paired-type homeodomain transcription factor 1 from Rattus norvegicus (Rat) (see paper)
46% identity, 11% coverage

HDG11_ARATH / Q9FX31 Homeobox-leucine zipper protein HDG11; HD-ZIP protein HDG11; Homeodomain GLABRA 2-like protein 11; Homeodomain transcription factor HDG11; Protein ENHANCED DROUGHT TOLERANCE 1; Protein HOMEODOMAIN GLABROUS 11 from Arabidopsis thaliana (Mouse-ear cress) (see 7 papers)
AT1G73360, NP_177479 homeodomain GLABROUS 11 from Arabidopsis thaliana
NP_177479 HDG11 (HOMEODOMAIN GLABROUS 11); DNA binding / transcription factor from Arabidopsis thaliana
45% identity, 7% coverage

VSX1_CARAU / Q90277 Visual system homeobox 1; Homeobox protein VSX-1; Transcription factor VSX1 from Carassius auratus (Goldfish) (see paper)
45% identity, 16% coverage

NP_001315326 paired box protein Pax-3b from Danio rerio
45% identity, 12% coverage

A6NJT0 Homeobox protein unc-4 homolog from Homo sapiens
NP_001073930 homeobox protein unc-4 homolog from Homo sapiens
46% identity, 11% coverage

RAX2_BOVIN / Q7YRX0 Retina and anterior neural fold homeobox protein 2; Q50-type retinal homeobox protein; Retina and anterior neural fold homeobox-like protein 1 from Bos taurus (Bovine) (see paper)
48% identity, 29% coverage

XP_004251138 homeobox-leucine zipper protein REVOLUTA from Solanum lycopersicum
41% identity, 6% coverage

XP_008330042 LIM/homeobox protein Lhx9 isoform X1 from Cynoglossus semilaevis
45% identity, 14% coverage

LHX2_MOUSE / Q9Z0S2 LIM/homeobox protein Lhx2; Homeobox protein LH-2; LIM homeobox protein 2 from Mus musculus (Mouse) (see 2 papers)
45% identity, 14% coverage

8ejoA / A0A6I8NF41 Crystal structure of the homeodomain of platypus sdux in complex with DNA (see paper)
44% identity, 93% coverage

XP_012818036 short stature homeobox protein 2 isoform X1 from Xenopus tropicalis
47% identity, 18% coverage

NP_477330 PvuII-PstI homology 13 from Drosophila melanogaster
43% identity, 16% coverage

UNC4_MOUSE / O08934 Homeobox protein unc-4 homolog; Homeobox protein Uncx4.1 from Mus musculus (Mouse) (see 11 papers)
NP_038730 homeobox protein unc-4 homolog from Mus musculus
46% identity, 11% coverage

RX_HUMAN / Q9Y2V3 Retinal homeobox protein Rx; Retina and anterior neural fold homeobox protein from Homo sapiens (Human) (see 2 papers)
NP_038463 retinal homeobox protein Rx from Homo sapiens
48% identity, 16% coverage

XP_005167911 short stature homeobox protein isoform X1 from Danio rerio
47% identity, 17% coverage

RAX2_HUMAN / Q96IS3 Retina and anterior neural fold homeobox protein 2; Q50-type retinal homeobox protein; Retina and anterior neural fold homeobox-like protein 1 from Homo sapiens (Human) (see 2 papers)
NP_116142 retina and anterior neural fold homeobox protein 2 from Homo sapiens
48% identity, 29% coverage

VSX1_DANRE / O42250 Visual system homeobox 1; Transcription factor VSX1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_571408 visual system homeobox 1 from Danio rerio
45% identity, 16% coverage

DUX1_HUMAN / O43812 Double homeobox protein 1 from Homo sapiens (Human) (see 2 papers)
TC 1.I.1.1.3 / O43812 Nuclear Pore Complex, NPC with 86 protein components from Homo sapiens
44% identity, 32% coverage

NP_446130 retinal homeobox protein Rx from Rattus norvegicus
48% identity, 16% coverage

O35602 Retinal homeobox protein Rx from Mus musculus
NP_038861 retinal homeobox protein Rx from Mus musculus
48% identity, 16% coverage

NP_957490 short stature homeobox protein 2 from Danio rerio
47% identity, 18% coverage

AFUA_4G10220, Afu4g10220 homeobox transcription factor (RfeB), putative from Aspergillus fumigatus Af293
44% identity, 9% coverage

NP_733401 Zn finger homeodomain 1, isoform E from Drosophila melanogaster
40% identity, 5% coverage

HM10_CAEEL / P41935 Homeobox protein ceh-10 from Caenorhabditis elegans (see 7 papers)
45% identity, 15% coverage

XP_003029756 regulator of mushroom development from Schizophyllum commune H4-8
47% identity, 7% coverage

NP_001157150 short stature homeobox protein 2 isoform c from Homo sapiens
47% identity, 17% coverage

XP_006713790 short stature homeobox protein 2 isoform X1 from Homo sapiens
47% identity, 15% coverage

O60902 Short stature homeobox protein 2 from Homo sapiens
47% identity, 16% coverage

NP_001289286 short stature homeobox protein 2 isoform 2 from Mus musculus
47% identity, 17% coverage

NP_037160 short stature homeobox protein 2 from Rattus norvegicus
47% identity, 16% coverage

DUX5_HUMAN / Q96PT3 Double homeobox protein 5 from Homo sapiens (Human) (see paper)
44% identity, 28% coverage

SHOX2_MOUSE / P70390 Short stature homeobox protein 2; Homeobox protein Og12X; OG-12; Paired family homeodomain protein Prx3 from Mus musculus (Mouse) (see paper)
P70390 glutaredoxin-dependent peroxiredoxin (EC 1.11.1.25) from Mus musculus (see paper)
47% identity, 16% coverage

XP_023014225 paired mesoderm homeobox protein 2A-like from Leptinotarsa decemlineata
43% identity, 16% coverage

NP_001139175 paired like homeobox 2Ba from Danio rerio
47% identity, 34% coverage

NP_571120 homeobox protein engrailed-1a from Danio rerio
43% identity, 24% coverage

LOC100121225 paired mesoderm homeobox protein 2 from Nasonia vitripennis
45% identity, 21% coverage

NP_726607 eyeless, isoform B from Drosophila melanogaster
39% identity, 9% coverage

NP_571300 retinal homeobox protein Rx1 from Danio rerio
48% identity, 17% coverage

CC1G_01962 hypothetical protein from Coprinopsis cinerea okayama7#130
44% identity, 8% coverage

NP_001014693 eyeless, isoform D from Drosophila melanogaster
39% identity, 6% coverage

HOX29_ORYSI / A2WLR5 Homeobox-leucine zipper protein HOX29; HD-ZIP protein HOX29; Homeodomain transcription factor HOX29; OsHox29 from Oryza sativa subsp. indica (Rice) (see paper)
41% identity, 6% coverage

RXA_XENLA / O42201 Retinal homeobox protein Rx-A; Rx1A; Xrx1; Retina and anterior neural fold homeobox protein A from Xenopus laevis (African clawed frog) (see 2 papers)
NP_001081687 retinal homeobox protein Rx-A from Xenopus laevis
48% identity, 17% coverage

ATHB8_ARATH / Q39123 Homeobox-leucine zipper protein ATHB-8; HD-ZIP protein ATHB-8; Homeodomain transcription factor ATHB-8 from Arabidopsis thaliana (Mouse-ear cress) (see 7 papers)
AT4G32880, NP_195014 homeobox-leucine zipper protein ATHB-8 from Arabidopsis thaliana
NP_195014 ATHB-8 (HOMEOBOX GENE 8); DNA binding / transcription factor from Arabidopsis thaliana
39% identity, 6% coverage

XP_002936715 retinal homeobox protein Rx from Xenopus tropicalis
48% identity, 17% coverage

ZFH1_DROME / P28166 Zinc finger protein 1; Zinc finger homeodomain protein 1 from Drosophila melanogaster (Fruit fly) (see paper)
40% identity, 6% coverage

PAX6_DROME / O18381 Paired box protein Pax-6; Protein eyeless from Drosophila melanogaster (Fruit fly) (see paper)
39% identity, 7% coverage

RXB_XENLA / O42567 Retinal homeobox protein Rx-B; Retina and anterior neural fold homeobox protein B; Rx2A; Xrx2 from Xenopus laevis (African clawed frog) (see paper)
48% identity, 17% coverage

ISX_MOUSE / A1A546 Intestine-specific homeobox from Mus musculus (Mouse) (see 2 papers)
NP_082113 intestine-specific homeobox isoform 2 from Mus musculus
47% identity, 21% coverage

NP_571302 retinal homeobox protein Rx3 from Danio rerio
48% identity, 19% coverage

HOX29_ORYSJ / Q5QMZ9 Homeobox-leucine zipper protein HOX29; HD-ZIP protein HOX29; Homeodomain transcription factor HOX29; OSHB5; OsHox29 from Oryza sativa subsp. japonica (Rice) (see 2 papers)
41% identity, 6% coverage

Q6FKZ3 Candida glabrata strain CBS138 chromosome L complete sequence from Candida glabrata (strain ATCC 2001 / BCRC 20586 / JCM 3761 / NBRC 0622 / NRRL Y-65 / CBS 138)
37% identity, 10% coverage

A0A0B7A551 Uncharacterized protein (Fragment) from Arion vulgaris
45% identity, 12% coverage

NP_571459 aristaless-related homeobox protein from Danio rerio
49% identity, 12% coverage

LOC103708475 homeobox-leucine zipper protein ATHB-15-like from Phoenix dactylifera
39% identity, 6% coverage

GSB_DROME / P09082 Protein gooseberry; BSH9; Protein gooseberry distal from Drosophila melanogaster (Fruit fly) (see paper)
NP_523863 gooseberry from Drosophila melanogaster
39% identity, 13% coverage

NP_001084383 paired like homeobox 2B L homeolog from Xenopus laevis
43% identity, 19% coverage

PAX3A_XENLA / Q645N4 Paired box protein Pax-3-A; xPax3-A; Paired-domain transcription factor Pax3-A from Xenopus laevis (African clawed frog) (see 7 papers)
47% identity, 11% coverage

REV_ARATH / Q9SE43 Homeobox-leucine zipper protein REVOLUTA; HD-ZIP protein REV; Homeodomain transcription factor REV; Protein AMPHIVASAL VASCULAR BUNDLE 1; Protein INTERFASCICULAR FIBERLESS 1 from Arabidopsis thaliana (Mouse-ear cress) (see 12 papers)
AT5G60690, NP_200877 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein from Arabidopsis thaliana
NP_200877 REV (REVOLUTA); DNA binding / lipid binding / transcription factor from Arabidopsis thaliana
39% identity, 6% coverage

NP_001086450 aristaless related homeobox L homeolog from Xenopus laevis
49% identity, 10% coverage

Q05917 Homeobox protein engrailed-2 from Gallus gallus
NP_001254648 homeobox protein engrailed-2 from Gallus gallus
41% identity, 19% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 798,070 different protein sequences to 1,261,478 scientific articles. Searches against EuropePMC were last performed on May 12 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory