PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for 86 a.a. (AGSDSEEGLL...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 250 similar proteins in the literature:

ARX_MOUSE / O35085 Homeobox protein ARX; Aristaless-related homeobox from Mus musculus (Mouse) (see 3 papers)
NP_031518 homeobox protein ARX from Mus musculus
99% identity, 15% coverage

NP_001093644 homeobox protein ARX from Rattus norvegicus
99% identity, 15% coverage

ARX_HUMAN / Q96QS3 Homeobox protein ARX; Aristaless-related homeobox from Homo sapiens (Human) (see 9 papers)
NP_620689 homeobox protein ARX from Homo sapiens
99% identity, 15% coverage

NP_001086450 aristaless related homeobox L homeolog from Xenopus laevis
97% identity, 16% coverage

NP_001079329 aristaless related homeobox S homeolog from Xenopus laevis
95% identity, 16% coverage

NP_571459 aristaless-related homeobox protein from Danio rerio
94% identity, 19% coverage

AL_DROME / Q06453 Homeobox protein aristaless from Drosophila melanogaster (Fruit fly) (see paper)
NP_722629 aristaless from Drosophila melanogaster
74% identity, 20% coverage

LOC552251 homeobox protein aristaless from Apis mellifera
82% identity, 18% coverage

NP_477330 PvuII-PstI homology 13 from Drosophila melanogaster
78% identity, 20% coverage

LOC102223853 dorsal root ganglia homeobox protein-like from Xiphophorus maculatus
69% identity, 25% coverage

NP_001025475 homeobox protein aristaless-like 4 from Bos taurus
77% identity, 19% coverage

ALX4_MOUSE / O35137 Homeobox protein aristaless-like 4; ALX-4 from Mus musculus (Mouse) (see paper)
NP_031468 homeobox protein aristaless-like 4 from Mus musculus
77% identity, 19% coverage

ALX4_HUMAN / Q9H161 Homeobox protein aristaless-like 4 from Homo sapiens (Human) (see 4 papers)
NP_068745 homeobox protein aristaless-like 4 from Homo sapiens
77% identity, 18% coverage

XP_023014225 paired mesoderm homeobox protein 2A-like from Leptinotarsa decemlineata
76% identity, 19% coverage

A6NNA5 Dorsal root ganglia homeobox protein from Homo sapiens
67% identity, 32% coverage

XP_063131133 dorsal root ganglia homeobox protein isoform X1 from Rattus norvegicus
70% identity, 30% coverage

DRGX_MOUSE / Q8BYH0 Dorsal root ganglia homeobox protein; Dorsal root ganglion 11; Homeobox protein DRG11; Paired-related homeobox protein-like 1 from Mus musculus (Mouse) (see 3 papers)
XP_006518475 dorsal root ganglia homeobox protein isoform X2 from Mus musculus
70% identity, 30% coverage

XP_001340966 homeobox protein aristaless-like 4 from Danio rerio
80% identity, 18% coverage

NP_726006 retinal homeobox from Drosophila melanogaster
65% identity, 8% coverage

ARXH_CAEEL / Q21836 Homeobox ARX homolog alr-1; Aristaless-related homeobox alr-1 from Caenorhabditis elegans (see 4 papers)
NP_509860 Homeobox ARX homolog alr-1 from Caenorhabditis elegans
66% identity, 22% coverage

XP_005156907 dorsal root ganglia homeobox protein isoform X1 from Danio rerio
71% identity, 26% coverage

NP_788420 homeobrain from Drosophila melanogaster
74% identity, 18% coverage

Q62798 Dorsal root ganglia homeobox protein from Rattus norvegicus
69% identity, 30% coverage

ALX3_HUMAN / O95076 Homeobox protein aristaless-like 3; Proline-rich transcription factor ALX3 from Homo sapiens (Human) (see paper)
NP_006483 homeobox protein aristaless-like 3 from Homo sapiens
76% identity, 19% coverage

NP_001007013 homeobox protein aristaless-like 3 from Rattus norvegicus
76% identity, 19% coverage

NP_031467 homeobox protein aristaless-like 3 from Mus musculus
76% identity, 19% coverage

XP_017213137 paired box protein Pax-3a isoform X1 from Danio rerio
75% identity, 15% coverage

Q91574 ALX homeobox protein 1 from Xenopus laevis
68% identity, 22% coverage

ALX1_GEOFO / P0DMV5 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Geospiza fortis (Medium ground-finch) (see paper)
68% identity, 23% coverage

ALX1_MOUSE / Q8C8B0 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Mus musculus (Mouse) (see 2 papers)
NP_766141 ALX homeobox protein 1 isoform 1 from Mus musculus
68% identity, 23% coverage

ALX1_HUMAN / Q15699 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Homo sapiens (Human) (see 4 papers)
NP_008913 ALX homeobox protein 1 from Homo sapiens
68% identity, 23% coverage

ALX1_RAT / Q63087 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Rattus norvegicus (Rat) (see 4 papers)
68% identity, 23% coverage

PAX3B_XENLA / Q0IH87 Paired box protein Pax-3-B; xPax3-B; Paired-domain transcription factor Pax3-B from Xenopus laevis (African clawed frog) (see 7 papers)
74% identity, 16% coverage

XP_006245193 paired box protein Pax-3 isoform X1 from Rattus norvegicus
74% identity, 16% coverage

NP_001120838 paired box protein Pax-3 isoform PAX3i from Homo sapiens
74% identity, 16% coverage

Q8BRF1 Paired box 3 from Mus musculus
NP_001152992 paired box protein Pax-3 isoform b from Mus musculus
74% identity, 15% coverage

PAX3A_XENLA / Q645N4 Paired box protein Pax-3-A; xPax3-A; Paired-domain transcription factor Pax3-A from Xenopus laevis (African clawed frog) (see 7 papers)
74% identity, 15% coverage

NP_571300 retinal homeobox protein Rx1 from Danio rerio
71% identity, 21% coverage

XP_001495022 paired box protein Pax-3 isoform X2 from Equus caballus
74% identity, 16% coverage

NP_001193747 paired box protein Pax-3 from Bos taurus
74% identity, 15% coverage

PAX3_HUMAN / P23760 Paired box protein Pax-3; HuP2 from Homo sapiens (Human) (see 26 papers)
NP_852122 paired box protein Pax-3 isoform PAX3 from Homo sapiens
74% identity, 16% coverage

PAX3_MOUSE / P24610 Paired box protein Pax-3 from Mus musculus (Mouse) (see 3 papers)
74% identity, 16% coverage

NP_001014818 paired like homeobox 2Bb from Danio rerio
77% identity, 23% coverage

XP_695330 homeobox protein aristaless-like 3 isoform X2 from Danio rerio
69% identity, 20% coverage

ALX1_DANRE / Q1LVQ7 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_001038539 ALX homeobox protein 1 from Danio rerio
69% identity, 23% coverage

NP_996953 paired mesoderm homeobox protein 2A from Danio rerio
77% identity, 23% coverage

NP_446321 paired mesoderm homeobox protein 2A from Rattus norvegicus
77% identity, 23% coverage

NP_032913 paired mesoderm homeobox protein 2A from Mus musculus
Q62066 Paired mesoderm homeobox protein 2A from Mus musculus
77% identity, 23% coverage

CEH17_CAEEL / G5EC89 Homeobox protein ceh-17 from Caenorhabditis elegans (see 2 papers)
78% identity, 27% coverage

O35602 Retinal homeobox protein Rx from Mus musculus
NP_038861 retinal homeobox protein Rx from Mus musculus
70% identity, 20% coverage

RXB_XENLA / O42567 Retinal homeobox protein Rx-B; Retina and anterior neural fold homeobox protein B; Rx2A; Xrx2 from Xenopus laevis (African clawed frog) (see paper)
73% identity, 20% coverage

RXA_XENLA / O42201 Retinal homeobox protein Rx-A; Rx1A; Xrx1; Retina and anterior neural fold homeobox protein A from Xenopus laevis (African clawed frog) (see 2 papers)
NP_001081687 retinal homeobox protein Rx-A from Xenopus laevis
73% identity, 20% coverage

NP_001084383 paired like homeobox 2B L homeolog from Xenopus laevis
77% identity, 22% coverage

PHX2B_MOUSE / O35690 Paired mesoderm homeobox protein 2B; Neuroblastoma Phox; NBPhox; PHOX2B homeodomain protein; Paired-like homeobox 2B from Mus musculus (Mouse) (see paper)
PHX2B_HUMAN / Q99453 Paired mesoderm homeobox protein 2B; Neuroblastoma Phox; NBPhox; PHOX2B homeodomain protein; Paired-like homeobox 2B from Homo sapiens (Human) (see 3 papers)
NP_003915 paired mesoderm homeobox protein 2B from Homo sapiens
NP_032914 paired mesoderm homeobox protein 2B from Mus musculus
77% identity, 20% coverage

NP_989600 paired box protein Pax-3 isoform a from Gallus gallus
74% identity, 15% coverage

PHX2A_HUMAN / O14813 Paired mesoderm homeobox protein 2A; ARIX1 homeodomain protein; Aristaless homeobox protein homolog; Paired-like homeobox 2A from Homo sapiens (Human) (see paper)
NP_005160 paired mesoderm homeobox protein 2A isoform 1 from Homo sapiens
77% identity, 23% coverage

XP_015140956 paired mesoderm homeobox protein 2B from Gallus gallus
77% identity, 22% coverage

XP_011607069 paired box protein Pax-3 isoform X1 from Takifugu rubripes
73% identity, 15% coverage

NP_571400 paired box protein Pax-7a isoform PAX7C from Danio rerio
71% identity, 15% coverage

XP_009304561 paired box protein Pax-7a isoform X1 from Danio rerio
71% identity, 15% coverage

NP_001088995 paired box 7 L homeolog from Xenopus laevis
65% identity, 17% coverage

PAX7_MOUSE / P47239 Paired box protein Pax-7 from Mus musculus (Mouse) (see 3 papers)
71% identity, 15% coverage

XP_006538694 paired box protein Pax-7 isoform X3 from Mus musculus
71% identity, 15% coverage

XP_006239325 paired box protein Pax-7 isoform X1 from Rattus norvegicus
71% identity, 15% coverage

XP_018080096 paired box 7 L homeolog isoform X1 from Xenopus laevis
65% identity, 17% coverage

3a01F / Q06453 Crystal structure of aristaless and clawless homeodomains bound to dna (see paper)
82% identity, 71% coverage

NP_446130 retinal homeobox protein Rx from Rattus norvegicus
70% identity, 20% coverage

XP_066838118 paired box protein Pax-7 isoform X1 from Anser cygnoides
71% identity, 14% coverage

RX_HUMAN / Q9Y2V3 Retinal homeobox protein Rx; Retina and anterior neural fold homeobox protein from Homo sapiens (Human) (see 2 papers)
NP_038463 retinal homeobox protein Rx from Homo sapiens
70% identity, 20% coverage

A0A0B7A551 Uncharacterized protein (Fragment) from Arion vulgaris
71% identity, 16% coverage

NP_001098373 retinal homeobox protein Rx2 from Oryzias latipes
65% identity, 24% coverage

NP_001139621 paired box protein Pax-7b from Danio rerio
71% identity, 15% coverage

LOC536229, XP_015316176 paired box protein Pax-7 from Bos taurus
71% identity, 18% coverage

Ci-Rx / CAC34833.1 Ci-Rx protein from Ciona intestinalis (see paper)
63% identity, 9% coverage

XP_018669748 prx1 protein isoform X1 from Ciona intestinalis
63% identity, 10% coverage

PAX7_HUMAN / P23759 Paired box protein Pax-7; HuP1 from Homo sapiens (Human) (see 3 papers)
NP_001128726 paired box protein Pax-7 isoform 3 from Homo sapiens
65% identity, 17% coverage

XP_020951119 paired box protein Pax-7 isoform X3 from Sus scrofa
65% identity, 17% coverage

8osbE / Q9H161 Twist1-tcf4-alx4 complex on specific DNA (see paper)
84% identity, 71% coverage

Q5ZNB2 Paired box protein 7 from Salvelinus alpinus
65% identity, 17% coverage

NP_990396 paired box protein Pax-7 from Gallus gallus
65% identity, 16% coverage

NP_001245266 paired box protein Pax-7a from Oncorhynchus mykiss
65% identity, 17% coverage

Q683Y9 Paired box protein 7 from Salmo salar
71% identity, 15% coverage

UNC4_MOUSE / O08934 Homeobox protein unc-4 homolog; Homeobox protein Uncx4.1 from Mus musculus (Mouse) (see 11 papers)
NP_038730 homeobox protein unc-4 homolog from Mus musculus
66% identity, 14% coverage

NP_001245265 paired box gene 7b from Oncorhynchus mykiss
65% identity, 17% coverage

A6NJT0 Homeobox protein unc-4 homolog from Homo sapiens
NP_001073930 homeobox protein unc-4 homolog from Homo sapiens
73% identity, 12% coverage

NP_571302 retinal homeobox protein Rx3 from Danio rerio
64% identity, 25% coverage

XP_002936715 retinal homeobox protein Rx from Xenopus tropicalis
73% identity, 20% coverage

UNC4_RAT / P97830 Homeobox protein unc-4 homolog; Homeobox protein Uncx4.1; Paired-type homeodomain transcription factor 1 from Rattus norvegicus (Rat) (see paper)
66% identity, 14% coverage

VSX1_DANRE / O42250 Visual system homeobox 1; Transcription factor VSX1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_571408 visual system homeobox 1 from Danio rerio
71% identity, 19% coverage

XP_015149815 homeobox protein unc-4 homolog isoform X1 from Gallus gallus
73% identity, 13% coverage

VSX1_CARAU / Q90277 Visual system homeobox 1; Homeobox protein VSX-1; Transcription factor VSX1 from Carassius auratus (Goldfish) (see paper)
71% identity, 19% coverage

UNC4_CAEEL / P29506 Homeobox protein unc-4; Homeobox protein ceh-4; Uncoordinated protein 4 from Caenorhabditis elegans (see 8 papers)
NP_496138 Homeobox protein unc-4 from Caenorhabditis elegans
68% identity, 28% coverage

NP_001315326 paired box protein Pax-3b from Danio rerio
76% identity, 14% coverage

VSX1_HUMAN / Q9NZR4 Visual system homeobox 1; Homeodomain protein RINX; Retinal inner nuclear layer homeobox protein; Transcription factor VSX1 from Homo sapiens (Human) (see 10 papers)
68% identity, 18% coverage

R7TKD0 Transcription factor Pax3/7 (Fragment) from Capitella teleta
68% identity, 28% coverage

RAX2_HUMAN / Q96IS3 Retina and anterior neural fold homeobox protein 2; Q50-type retinal homeobox protein; Retina and anterior neural fold homeobox-like protein 1 from Homo sapiens (Human) (see 2 papers)
NP_116142 retina and anterior neural fold homeobox protein 2 from Homo sapiens
59% identity, 41% coverage

VSX1_MOUSE / Q91V10 Visual system homeobox 1; Homeodomain protein RINX; Retinal inner nuclear layer homeobox protein; Transcription factor VSX1 from Mus musculus (Mouse) (see 2 papers)
NP_473409 visual system homeobox 1 from Mus musculus
68% identity, 18% coverage

NP_990100 visual system homeobox 1 from Gallus gallus
67% identity, 19% coverage

XP_012818027 visual system homeobox 1 isoform X1 from Xenopus tropicalis
68% identity, 19% coverage

RAX2_BOVIN / Q7YRX0 Retina and anterior neural fold homeobox protein 2; Q50-type retinal homeobox protein; Retina and anterior neural fold homeobox-like protein 1 from Bos taurus (Bovine) (see paper)
59% identity, 41% coverage

NP_723721 paired, isoform B from Drosophila melanogaster
P06601 Segmentation protein paired from Drosophila melanogaster
64% identity, 12% coverage

Bm8528 Uncharacterized protein from Brugia malayi
73% identity, 35% coverage

NP_001079212 pituitary homeobox 3 from Xenopus laevis
65% identity, 25% coverage

PITX3_HUMAN / O75364 Pituitary homeobox 3; Homeobox protein PITX3; Paired-like homeodomain transcription factor 3 from Homo sapiens (Human) (see 2 papers)
NP_005020 pituitary homeobox 3 from Homo sapiens
64% identity, 25% coverage

PITX3_MOUSE / O35160 Pituitary homeobox 3; Homeobox protein PITX3; Paired-like homeodomain transcription factor 3 from Mus musculus (Mouse) (see 6 papers)
NP_032878 pituitary homeobox 3 from Mus musculus
64% identity, 25% coverage

PITX3_RAT / P81062 Pituitary homeobox 3; Homeobox protein PITX3; Paired-like homeodomain transcription factor 3 from Rattus norvegicus (Rat) (see paper)
XP_006231540 pituitary homeobox 3 isoform X1 from Rattus norvegicus
64% identity, 25% coverage

XP_006526827 pituitary homeobox 3 isoform X1 from Mus musculus
64% identity, 19% coverage

XP_005164261 homeobox protein unc-4 homolog from Danio rerio
66% identity, 15% coverage

NP_001082023 paired like homeodomain 3 L homeolog from Xenopus laevis
65% identity, 25% coverage

NP_001014693 eyeless, isoform D from Drosophila melanogaster
54% identity, 9% coverage

XP_421631 pituitary homeobox 3 isoform X1 from Gallus gallus
65% identity, 25% coverage

PITX3_DANRE / Q6QU75 Pituitary homeobox 3; Bicoid-like homeodomain transcription factor Pitx3; Homeobox protein PITX3; Paired-like homeodomain transcription factor 3 from Danio rerio (Zebrafish) (Brachydanio rerio) (see 3 papers)
NP_991238 pituitary homeobox 3 from Danio rerio
65% identity, 25% coverage

NP_726607 eyeless, isoform B from Drosophila melanogaster
54% identity, 13% coverage

PAX6_DROME / O18381 Paired box protein Pax-6; Protein eyeless from Drosophila melanogaster (Fruit fly) (see paper)
54% identity, 9% coverage

Q25411 Pax6-like protein from Lineus sanguineus
61% identity, 20% coverage

NP_001158383 paired box 6 from Saccoglossus kowalevskii
66% identity, 14% coverage

T1F6U6 Paired box protein Pax-6 from Helobdella robusta
61% identity, 10% coverage

VSX1_XENLA / Q0P031 Visual system homeobox 1; Transcription factor vsx1; Xvsx1 from Xenopus laevis (African clawed frog) (see 2 papers)
66% identity, 19% coverage

NP_001290114 paired box protein Pax-7 from Meleagris gallopavo
58% identity, 15% coverage

UNC4_DANRE / Q50D79 Homeobox protein unc-4 homolog; Homeobox protein Uncx4.1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_001018616 homeobox protein unc-4 homolog from Danio rerio
58% identity, 18% coverage

ISX_MOUSE / A1A546 Intestine-specific homeobox from Mus musculus (Mouse) (see 2 papers)
NP_082113 intestine-specific homeobox isoform 2 from Mus musculus
59% identity, 30% coverage

XP_018111896 paired box 6 L homeolog isoform X1 from Xenopus laevis
60% identity, 16% coverage

NP_001020793 short stature homeobox protein from Canis lupus familiaris
67% identity, 25% coverage

Q5IGV4 Homeodomain transcription factor PaxC from Nematostella vectensis
64% identity, 15% coverage

GSBN_DROME / P09083 Protein gooseberry-neuro; BSH4; Protein gooseberry proximal from Drosophila melanogaster (Fruit fly) (see paper)
NP_523862 gooseberry-neuro from Drosophila melanogaster
69% identity, 15% coverage

V3ZQV3 Uncharacterized protein (Fragment) from Lottia gigantea
XP_009066032 hypothetical protein from Lottia gigantea
62% identity, 18% coverage

SHOX_HUMAN / O15266 Short stature homeobox protein; Pseudoautosomal homeobox-containing osteogenic protein; Short stature homeobox-containing protein from Homo sapiens (Human) (see 4 papers)
67% identity, 25% coverage

V4AMZ8 Uncharacterized protein (Fragment) from Lottia gigantea
70% identity, 23% coverage

XP_012818036 short stature homeobox protein 2 isoform X1 from Xenopus tropicalis
71% identity, 23% coverage

T1G400 Uncharacterized protein from Helobdella robusta
68% identity, 16% coverage

NP_006874 short stature homeobox protein isoform SHOXb from Homo sapiens
67% identity, 32% coverage

PAX6_CHICK / P47237 Paired box protein Pax-6 from Gallus gallus (Chicken) (see paper)
65% identity, 30% coverage

NP_001091013 paired box gene 6 from Canis lupus familiaris
60% identity, 17% coverage

NP_001231127 paired box protein Pax-6 isoform 1 from Mus musculus
NP_001231129 paired box protein Pax-6 isoform 1 from Mus musculus
60% identity, 17% coverage

O60902 Short stature homeobox protein 2 from Homo sapiens
68% identity, 21% coverage

NP_037160 short stature homeobox protein 2 from Rattus norvegicus
68% identity, 21% coverage

HM08_CAEEL / Q94398 Homeobox protein ceh-8 from Caenorhabditis elegans (see paper)
60% identity, 26% coverage

SHOX2_MOUSE / P70390 Short stature homeobox protein 2; Homeobox protein Og12X; OG-12; Paired family homeodomain protein Prx3 from Mus musculus (Mouse) (see paper)
P70390 glutaredoxin-dependent peroxiredoxin (EC 1.11.1.25) from Mus musculus (see paper)
68% identity, 21% coverage

XP_012307699 paired box protein Pax-6 isoform X2 from Aotus nancymaae
60% identity, 17% coverage

VSX2_MOUSE / Q61412 Visual system homeobox 2; Ceh-10 homeodomain-containing homolog; Homeobox protein CHX10 from Mus musculus (Mouse) (see 8 papers)
72% identity, 16% coverage

NP_571537 visual system homeobox 2 from Danio rerio
72% identity, 15% coverage

PAX6_HUMAN / P26367 Paired box protein Pax-6; Aniridia type II protein; Oculorhombin from Homo sapiens (Human) (see 26 papers)
PAX6_MOUSE / P63015 Paired box protein Pax-6; Oculorhombin from Mus musculus (Mouse) (see 6 papers)
NP_001121084 paired box protein Pax-6 isoform a from Homo sapiens
NP_000271 paired box protein Pax-6 isoform a from Homo sapiens
NP_001253186 paired box protein Pax-6 from Macaca mulatta
60% identity, 17% coverage

NP_001289286 short stature homeobox protein 2 isoform 2 from Mus musculus
69% identity, 21% coverage

NP_001157150 short stature homeobox protein 2 isoform c from Homo sapiens
69% identity, 21% coverage

VSX2_DANRE / O42477 Visual system homeobox 2; Ceh-10 homeodomain-containing homolog; Homeobox protein ALX; Homeobox protein CHX10; Transcription factor VSX2 from Danio rerio (Zebrafish) (Brachydanio rerio) (see 2 papers)
72% identity, 15% coverage

VSX2_CHICK / Q9IAL1 Visual system homeobox 2; Ceh-10 homeodomain-containing homolog; Homeobox protein CHX10 from Gallus gallus (Chicken) (see paper)
NP_990099 visual system homeobox 2 from Gallus gallus
72% identity, 15% coverage

VSX2_HUMAN / P58304 Visual system homeobox 2; Ceh-10 homeodomain-containing homolog; Homeobox protein CHX10 from Homo sapiens (Human) (see 8 papers)
NP_878314 visual system homeobox 2 from Homo sapiens
72% identity, 16% coverage

UNC4_DROME / O77215 Homeobox protein unc-4; Paired-like homeodomain protein unc-4; DPHD-1 from Drosophila melanogaster (Fruit fly) (see paper)
74% identity, 10% coverage

PAX6_RAT / P63016 Paired box protein Pax-6; Oculorhombin from Rattus norvegicus (Rat) (see 4 papers)
60% identity, 17% coverage

XP_006713790 short stature homeobox protein 2 isoform X1 from Homo sapiens
69% identity, 20% coverage

NP_001288356 visual system homeobox 2 isoform 1 from Mus musculus
72% identity, 15% coverage

XP_017449515 visual system homeobox 2 isoform X1 from Rattus norvegicus
72% identity, 15% coverage

HM10_CAEEL / P41935 Homeobox protein ceh-10 from Caenorhabditis elegans (see 7 papers)
61% identity, 20% coverage

DMBX1_MOUSE / Q91ZK4 Diencephalon/mesencephalon homeobox protein 1; Diencephalon/mesencephalon-expressed brain homeobox gene 1 protein; Orthodenticle homolog 3; Paired-like homeobox protein DMBX1; Paired-type homeobox Atx from Mus musculus (Mouse) (see 8 papers)
XP_017175447 diencephalon/mesencephalon homeobox protein 1 isoform X3 from Mus musculus
65% identity, 18% coverage

DMX1B_DANRE / Q566X8 Diencephalon/mesencephalon homeobox protein 1-B from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_001017625 diencephalon/mesencephalon homeobox protein 1-B from Danio rerio
65% identity, 18% coverage

LOC107439785 diencephalon/mesencephalon homeobox protein 1-A from Parasteatoda tepidariorum
58% identity, 31% coverage

NP_001139175 paired like homeobox 2Ba from Danio rerio
67% identity, 43% coverage

3cmyA / P23760 Structure of a homeodomain in complex with DNA (see paper)
78% identity, 67% coverage

XP_005168876 paired box protein Pax-6b isoform X1 from Danio rerio
60% identity, 16% coverage

NP_001290437 intestine-specific homeobox from Homo sapiens
62% identity, 27% coverage

XP_065401022 paired mesoderm homeobox protein 2B from Macaca fascicularis
53% identity, 19% coverage

DMBX1_CHICK / F1NEA7 Diencephalon/mesencephalon homeobox protein 1 from Gallus gallus (Chicken) (see 2 papers)
65% identity, 19% coverage

XP_005167911 short stature homeobox protein isoform X1 from Danio rerio
65% identity, 24% coverage

DMX1A_DANRE / Q8JI10 Diencephalon/mesencephalon homeobox protein 1-A; Paired homeobox protein 1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see 2 papers)
65% identity, 18% coverage

DMBX1_HUMAN / Q8NFW5 Diencephalon/mesencephalon homeobox protein 1; Orthodenticle homolog 3; Paired-like homeobox protein DMBX1 from Homo sapiens (Human) (see 2 papers)
65% identity, 18% coverage

NP_571379 paired box protein Pax-6 from Danio rerio
60% identity, 16% coverage

NP_996314 Ptx1, isoform C from Drosophila melanogaster
57% identity, 14% coverage

PITX_DROME / O18400 Pituitary homeobox homolog Ptx1; D-PTX1 from Drosophila melanogaster (Fruit fly) (see paper)
57% identity, 15% coverage

LOC101486458 visual system homeobox 2 from Maylandia zebra
70% identity, 15% coverage

T1G8F8 Uncharacterized protein from Helobdella robusta
64% identity, 25% coverage

VAB3_CAEEL / G5EDS1 Paired box protein 6 homolog; Homeobox and paired domain-containing protein vab-3; Protein male abnormal 18; Variable abnormal morphology protein 3 from Caenorhabditis elegans (see 14 papers)
NP_001024570 Paired box protein 6 homolog from Caenorhabditis elegans
63% identity, 14% coverage

CG9876 uncharacterized protein from Drosophila melanogaster
62% identity, 25% coverage

NP_477026 reversed polarity from Drosophila melanogaster
75% identity, 8% coverage

NP_957490 short stature homeobox protein 2 from Danio rerio
67% identity, 23% coverage

T1FMW8 Uncharacterized protein from Helobdella robusta
63% identity, 14% coverage

NP_524638 twin of eyeless, isoform A from Drosophila melanogaster
58% identity, 13% coverage

NP_001033832 visual system homeobox 2, isoform A from Drosophila melanogaster
64% identity, 10% coverage

NP_523389 Ods-site homeobox from Drosophila melanogaster
67% identity, 16% coverage

1fjlA / P06601 Homeodomain from the drosophila paired protein bound to a DNA oligonucleotide (see paper)
71% identity, 69% coverage

Smp_126560 putative orthopedia homeobox protein from Schistosoma mansoni
73% identity, 5% coverage

OTX1B_DANRE / Q91994 Homeobox protein OTX1 B; zOtx1; Orthodenticle homolog 1 B from Danio rerio (Zebrafish) (Brachydanio rerio) (see 3 papers)
59% identity, 21% coverage

NP_571325 homeobox protein OTX1 B from Danio rerio
59% identity, 21% coverage

XP_013834407 pituitary homeobox 2 isoform X1 from Sus scrofa
61% identity, 22% coverage

XP_005207658 pituitary homeobox 2 isoform X1 from Bos taurus
61% identity, 22% coverage

PITX2_XENLA / Q9PWR3 Pituitary homeobox 2; Homeobox protein PITX2; Paired-like homeodomain transcription factor 2; xPtx2 from Xenopus laevis (African clawed frog) (see 2 papers)
60% identity, 22% coverage

PITX2_CHICK / O93385 Pituitary homeobox 2; Homeobox protein PITX2; cPITX2; Paired-like homeodomain transcription factor 2 from Gallus gallus (Chicken) (see paper)
62% identity, 21% coverage

NP_990341 pituitary homeobox 2 from Gallus gallus
62% identity, 21% coverage

PITX2_MOUSE / P97474 Pituitary homeobox 2; ALL1-responsive protein ARP1; BRX1 homeoprotein; Bicoid-related homeobox protein 1; Homeobox protein PITX2; Orthodenticle-like homeobox 2; Paired-like homeodomain transcription factor 2; Solurshin from Mus musculus (Mouse) (see 8 papers)
NP_035228 pituitary homeobox 2 isoform b from Mus musculus
61% identity, 22% coverage

PITX2_HUMAN / Q99697 Pituitary homeobox 2; ALL1-responsive protein ARP1; Homeobox protein PITX2; Paired-like homeodomain transcription factor 2; RIEG bicoid-related homeobox transcription factor; Solurshin from Homo sapiens (Human) (see 12 papers)
61% identity, 22% coverage

XP_005681362 pituitary homeobox 2 isoform X2 from Capra hircus
NP_001191328 pituitary homeobox 2 isoform a from Homo sapiens
XP_051676055 pituitary homeobox 2 isoform X2 from Oryctolagus cuniculus
61% identity, 26% coverage

NP_062207 pituitary homeobox 2 isoform 2 from Rattus norvegicus
61% identity, 26% coverage

XP_015134823 paired mesoderm homeobox protein 2 isoform X1 from Gallus gallus
68% identity, 24% coverage

pax-6B / CAC85262.2 Pax-6B protein from Dugesia japonica (see paper)
49% identity, 13% coverage

PITX2_DANRE / Q9W5Z2 Pituitary homeobox 2; Homeobox protein PITX2; Paired-like homeodomain transcription factor 2 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
66% identity, 20% coverage

OTX1_HUMAN / P32242 Homeobox protein OTX1; Orthodenticle homolog 1 from Homo sapiens (Human) (see paper)
NP_001186699 homeobox protein OTX1 from Homo sapiens
59% identity, 19% coverage

PRRX2_HUMAN / Q99811 Paired mesoderm homeobox protein 2; Paired-related homeobox protein 2; PRX-2 from Homo sapiens (Human) (see paper)
NP_057391 paired mesoderm homeobox protein 2 from Homo sapiens
68% identity, 23% coverage

XP_005157364 pituitary homeobox 2 isoform X1 from Danio rerio
66% identity, 24% coverage

UNC42_CAEEL / L8E946 Homeobox protein unc-42; Uncoordinated protein 42 from Caenorhabditis elegans (see 5 papers)
68% identity, 22% coverage

NP_505519 Homeobox protein unc-42 from Caenorhabditis elegans
68% identity, 23% coverage

XP_018087945 paired like homeodomain 2 S homeolog isoform X2 from Xenopus laevis
66% identity, 20% coverage

NP_037241 homeobox protein OTX1 from Rattus norvegicus
59% identity, 19% coverage

G7YLP5 Visual system homeobox 1 from Clonorchis sinensis
62% identity, 24% coverage

PRRX1_CHICK / Q05437 Paired mesoderm homeobox protein 1; GMHOX; Homeobox protein MHOX; Paired-related homeobox protein 1; PRX-1 from Gallus gallus (Chicken) (see 2 papers)
68% identity, 23% coverage

ptx1 / CAA04801.1 Ptx1 homeodomain protein from Drosophila melanogaster (see paper)
55% identity, 14% coverage

PRRX1_MOUSE / P63013 Paired mesoderm homeobox protein 1; Homeobox protein K-2; Muscle homeobox protein; MHox; Paired-related homeobox protein 1; PRX-1 from Mus musculus (Mouse) (see 4 papers)
NP_035257 paired mesoderm homeobox protein 1 isoform a from Mus musculus
NP_722543 paired mesoderm homeobox protein 1 from Rattus norvegicus
68% identity, 23% coverage

PRRX1_HUMAN / P54821 Paired mesoderm homeobox protein 1; Homeobox protein PHOX1; Paired-related homeobox protein 1; PRX-1 from Homo sapiens (Human) (see 3 papers)
68% identity, 23% coverage

NP_001080981 pituitary homeobox 1 from Xenopus laevis
66% identity, 21% coverage

Q63410 Homeobox protein OTX1 from Rattus norvegicus
59% identity, 19% coverage

XP_014951176 pituitary homeobox 1 from Ovis aries
61% identity, 22% coverage

XP_006711451 paired mesoderm homeobox protein 1 isoform X1 from Homo sapiens
68% identity, 29% coverage

XP_017207930 paired mesoderm homeobox protein 1b isoform X2 from Danio rerio
68% identity, 21% coverage

PITX1_MOUSE / P70314 Pituitary homeobox 1; Hindlimb-expressed homeobox protein backfoot; Homeobox protein P-OTX; Homeobox protein PITX1; Paired-like homeodomain transcription factor 1; Pituitary OTX-related factor from Mus musculus (Mouse) (see 3 papers)
XP_006517220 pituitary homeobox 1 isoform X1 from Mus musculus
61% identity, 22% coverage

GSB_DROME / P09082 Protein gooseberry; BSH9; Protein gooseberry distal from Drosophila melanogaster (Fruit fly) (see paper)
NP_523863 gooseberry from Drosophila melanogaster
67% identity, 15% coverage

NP_001286652 orthopedia, isoform I from Drosophila melanogaster
67% identity, 15% coverage

OTP_DROME / P56672 Homeobox protein orthopedia from Drosophila melanogaster (Fruit fly) (see paper)
67% identity, 13% coverage

PITX1_HUMAN / P78337 Pituitary homeobox 1; Hindlimb-expressed homeobox protein backfoot; Homeobox protein PITX1; Paired-like homeodomain transcription factor 1 from Homo sapiens (Human) (see 4 papers)
NP_002644 pituitary homeobox 1 from Homo sapiens
64% identity, 20% coverage

K1QWY6 Paired box protein Pax-6 from Magallana gigas
61% identity, 20% coverage

XP_015145717 paired mesoderm homeobox protein 1 isoform X2 from Gallus gallus
68% identity, 27% coverage

Smp_160670 putative paired box protein pax-6 from Schistosoma mansoni
63% identity, 5% coverage

CRX_MOUSE / O54751 Cone-rod homeobox protein from Mus musculus (Mouse) (see paper)
NP_031796 cone-rod homeobox protein isoform 1 from Mus musculus
67% identity, 19% coverage

Smp_163140 pituitary homeobox protein-related from Schistosoma mansoni
57% identity, 8% coverage

CRX_HUMAN / O43186 Cone-rod homeobox protein from Homo sapiens (Human) (see 10 papers)
NP_000545 cone-rod homeobox protein from Homo sapiens
67% identity, 19% coverage

PITX1_CHICK / P56673 Pituitary homeobox 1; Homeobox protein PITX1; Paired-like homeodomain transcription factor 1; cPTX1 from Gallus gallus (Chicken) (see 3 papers)
64% identity, 21% coverage

LOC100121225 paired mesoderm homeobox protein 2 from Nasonia vitripennis
64% identity, 24% coverage

NP_001284898 visual system homeobox 1, isoform B from Drosophila melanogaster
64% identity, 7% coverage

O96756 DtPax-6 protein from Girardia tigrina
54% identity, 12% coverage

NP_001161157 pituitary homeobox 1 from Gallus gallus
64% identity, 21% coverage

LOC107444630 homeobox protein unc-4 homolog from Parasteatoda tepidariorum
65% identity, 15% coverage

HMOC_DROME / P22810 Homeotic protein ocelliless; Protein orthodenticle from Drosophila melanogaster (Fruit fly) (see 3 papers)
NP_001259345 ocelliless, isoform G from Drosophila melanogaster
62% identity, 12% coverage

XP_033105362 homeobox protein OTX-like isoform X1 from Anneissia japonica
57% identity, 23% coverage

NP_001027662 Otx from Ciona intestinalis
62% identity, 14% coverage

OTP_PARLI / O76971 Homeobox protein orthopedia; Orthopedia-related; PlOtp from Paracentrotus lividus (Common sea urchin) (see 2 papers)
71% identity, 14% coverage

OTP_HELTB / Q6SR68 Homeobox protein orthopedia from Heliocidaris tuberculata (Sea urchin) (see paper)
71% identity, 14% coverage

BARH1_DROME / Q24255 Homeobox protein B-H1; Homeobox protein BarH1 from Drosophila melanogaster (Fruit fly) (see 5 papers)
NP_523387 BarH1, isoform A from Drosophila melanogaster
44% identity, 14% coverage

UNC30_CAEEL / P52906 Homeobox protein unc-30; Uncoordinated protein 30 from Caenorhabditis elegans (see 4 papers)
NP_001021277 Homeobox protein unc-30 from Caenorhabditis elegans
56% identity, 23% coverage

OTP_SACKO / Q7YTC2 Homeobox protein orthopedia from Saccoglossus kowalevskii (Acorn worm) (see paper)
71% identity, 16% coverage

OTXH_CAEEL / Q9U2Z0 Homeobox protein ttx-1; Abnormal thermotaxis protein 1; OTX homeobox homolog ttx-1 from Caenorhabditis elegans (see 4 papers)
56% identity, 18% coverage

LOC724282 homeobox protein orthopedia from Apis mellifera
65% identity, 15% coverage

NP_001287047 eyegone, isoform C from Drosophila melanogaster
63% identity, 9% coverage

NP_001024213 Homeobox protein ttx-1 from Caenorhabditis elegans
56% identity, 21% coverage

XP_018117181 orthodenticle homeobox 1 L homeolog isoform X1 from Xenopus laevis
64% identity, 17% coverage

ESX1_HUMAN / Q8N693 Homeobox protein ESX1; Extraembryonic, spermatogenesis, homeobox 1 from Homo sapiens (Human) (see 3 papers)
NP_703149 homeobox protein ESX1 from Homo sapiens
59% identity, 16% coverage

XP_012823826 homeobox protein OTX2 isoform X1 from Xenopus tropicalis
64% identity, 19% coverage

LOC109470978 homeobox protein goosecoid-like from Branchiostoma belcheri
62% identity, 25% coverage

XP_020958123 homeobox protein OTX2 isoform X1 from Sus scrofa
64% identity, 20% coverage

NP_001027541 homeobox protein OTX isoform beta from Strongylocentrotus purpuratus
64% identity, 20% coverage

XP_015142758 homeobox protein OTX2 isoform X1 from Gallus gallus
64% identity, 20% coverage

XP_011243291 homeobox protein OTX2 isoform X2 from Mus musculus
64% identity, 20% coverage

OTX5B_XENLA / Q9PT61 Homeobox protein otx5-B; Orthodenticle homolog 5-B; XOtx5b from Xenopus laevis (African clawed frog) (see 5 papers)
NP_001081916 homeobox protein otx5-B from Xenopus laevis
64% identity, 20% coverage

OTX2A_XENLA / Q91813 Homeobox protein OTX2-A; xOTX2-A; Orthodenticle 2-A; Orthodenticle-A-like protein A from Xenopus laevis (African clawed frog) (see 3 papers)
XP_018087427 homeobox protein OTX2-A isoform X1 from Xenopus laevis
64% identity, 20% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 798,070 different protein sequences to 1,261,478 scientific articles. Searches against EuropePMC were last performed on May 12 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory