PaperBLAST – Find papers about a protein or its homologs

 

PaperBLAST

PaperBLAST Hits for 86 a.a. (NDLKASPTLG...)

Other sequence analysis tools:

Find functional residues: SitesBLAST

Search for conserved domains

Find the best match in UniProt

Compare to protein structures

Predict transmenbrane helices: Phobius

Predict protein localization: PSORTb

Find homologs in fast.genomics

Fitness BLAST: loading...

Found 250 similar proteins in the literature:

VSX1_HUMAN / Q9NZR4 Visual system homeobox 1; Homeodomain protein RINX; Retinal inner nuclear layer homeobox protein; Transcription factor VSX1 from Homo sapiens (Human) (see 10 papers)
99% identity, 24% coverage

VSX1_MOUSE / Q91V10 Visual system homeobox 1; Homeodomain protein RINX; Retinal inner nuclear layer homeobox protein; Transcription factor VSX1 from Mus musculus (Mouse) (see 2 papers)
NP_473409 visual system homeobox 1 from Mus musculus
92% identity, 24% coverage

XP_012818027 visual system homeobox 1 isoform X1 from Xenopus tropicalis
90% identity, 25% coverage

VSX1_XENLA / Q0P031 Visual system homeobox 1; Transcription factor vsx1; Xvsx1 from Xenopus laevis (African clawed frog) (see 2 papers)
87% identity, 25% coverage

VSX1_DANRE / O42250 Visual system homeobox 1; Transcription factor VSX1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_571408 visual system homeobox 1 from Danio rerio
86% identity, 25% coverage

NP_990100 visual system homeobox 1 from Gallus gallus
86% identity, 25% coverage

VSX1_CARAU / Q90277 Visual system homeobox 1; Homeobox protein VSX-1; Transcription factor VSX1 from Carassius auratus (Goldfish) (see paper)
92% identity, 22% coverage

VSX2_MOUSE / Q61412 Visual system homeobox 2; Ceh-10 homeodomain-containing homolog; Homeobox protein CHX10 from Mus musculus (Mouse) (see 8 papers)
88% identity, 19% coverage

VSX2_CHICK / Q9IAL1 Visual system homeobox 2; Ceh-10 homeodomain-containing homolog; Homeobox protein CHX10 from Gallus gallus (Chicken) (see paper)
NP_990099 visual system homeobox 2 from Gallus gallus
88% identity, 18% coverage

VSX2_HUMAN / P58304 Visual system homeobox 2; Ceh-10 homeodomain-containing homolog; Homeobox protein CHX10 from Homo sapiens (Human) (see 8 papers)
NP_878314 visual system homeobox 2 from Homo sapiens
88% identity, 19% coverage

NP_571537 visual system homeobox 2 from Danio rerio
88% identity, 18% coverage

VSX2_DANRE / O42477 Visual system homeobox 2; Ceh-10 homeodomain-containing homolog; Homeobox protein ALX; Homeobox protein CHX10; Transcription factor VSX2 from Danio rerio (Zebrafish) (Brachydanio rerio) (see 2 papers)
88% identity, 17% coverage

NP_001288356 visual system homeobox 2 isoform 1 from Mus musculus
88% identity, 18% coverage

XP_017449515 visual system homeobox 2 isoform X1 from Rattus norvegicus
88% identity, 18% coverage

NP_001033832 visual system homeobox 2, isoform A from Drosophila melanogaster
81% identity, 11% coverage

LOC101486458 visual system homeobox 2 from Maylandia zebra
87% identity, 18% coverage

HM10_CAEEL / P41935 Homeobox protein ceh-10 from Caenorhabditis elegans (see 7 papers)
76% identity, 22% coverage

G7YLP5 Visual system homeobox 1 from Clonorchis sinensis
61% identity, 32% coverage

NP_955457 visual system homeobox 1 isoform b from Homo sapiens
98% identity, 25% coverage

XP_016883327 visual system homeobox 1 isoform X2 from Homo sapiens
98% identity, 27% coverage

NP_001284898 visual system homeobox 1, isoform B from Drosophila melanogaster
75% identity, 9% coverage

ALX1_DANRE / Q1LVQ7 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_001038539 ALX homeobox protein 1 from Danio rerio
66% identity, 23% coverage

ALX1_GEOFO / P0DMV5 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Geospiza fortis (Medium ground-finch) (see paper)
66% identity, 23% coverage

XP_695330 homeobox protein aristaless-like 3 isoform X2 from Danio rerio
72% identity, 19% coverage

Q91574 ALX homeobox protein 1 from Xenopus laevis
66% identity, 22% coverage

XP_001340966 homeobox protein aristaless-like 4 from Danio rerio
72% identity, 17% coverage

ALX1_HUMAN / Q15699 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Homo sapiens (Human) (see 4 papers)
NP_008913 ALX homeobox protein 1 from Homo sapiens
66% identity, 23% coverage

ALX1_MOUSE / Q8C8B0 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Mus musculus (Mouse) (see 2 papers)
NP_766141 ALX homeobox protein 1 isoform 1 from Mus musculus
66% identity, 23% coverage

ALX1_RAT / Q63087 ALX homeobox protein 1; Cartilage homeoprotein 1; CART-1 from Rattus norvegicus (Rat) (see 4 papers)
66% identity, 23% coverage

ALX4_MOUSE / O35137 Homeobox protein aristaless-like 4; ALX-4 from Mus musculus (Mouse) (see paper)
NP_031468 homeobox protein aristaless-like 4 from Mus musculus
72% identity, 17% coverage

ALX4_HUMAN / Q9H161 Homeobox protein aristaless-like 4 from Homo sapiens (Human) (see 4 papers)
NP_068745 homeobox protein aristaless-like 4 from Homo sapiens
72% identity, 16% coverage

NP_001025475 homeobox protein aristaless-like 4 from Bos taurus
72% identity, 17% coverage

NP_726006 retinal homeobox from Drosophila melanogaster
66% identity, 7% coverage

NP_001098373 retinal homeobox protein Rx2 from Oryzias latipes
68% identity, 20% coverage

AL_DROME / Q06453 Homeobox protein aristaless from Drosophila melanogaster (Fruit fly) (see paper)
NP_722629 aristaless from Drosophila melanogaster
66% identity, 18% coverage

NP_001079329 aristaless related homeobox S homeolog from Xenopus laevis
69% identity, 12% coverage

NP_001007013 homeobox protein aristaless-like 3 from Rattus norvegicus
70% identity, 20% coverage

NP_031467 homeobox protein aristaless-like 3 from Mus musculus
70% identity, 20% coverage

NP_001093644 homeobox protein ARX from Rattus norvegicus
68% identity, 12% coverage

ALX3_HUMAN / O95076 Homeobox protein aristaless-like 3; Proline-rich transcription factor ALX3 from Homo sapiens (Human) (see paper)
NP_006483 homeobox protein aristaless-like 3 from Homo sapiens
70% identity, 20% coverage

ARX_HUMAN / Q96QS3 Homeobox protein ARX; Aristaless-related homeobox from Homo sapiens (Human) (see 9 papers)
NP_620689 homeobox protein ARX from Homo sapiens
68% identity, 12% coverage

ARX_MOUSE / O35085 Homeobox protein ARX; Aristaless-related homeobox from Mus musculus (Mouse) (see 3 papers)
NP_031518 homeobox protein ARX from Mus musculus
68% identity, 12% coverage

NP_571300 retinal homeobox protein Rx1 from Danio rerio
67% identity, 19% coverage

NP_001086450 aristaless related homeobox L homeolog from Xenopus laevis
69% identity, 12% coverage

NP_477330 PvuII-PstI homology 13 from Drosophila melanogaster
64% identity, 19% coverage

NP_571459 aristaless-related homeobox protein from Danio rerio
69% identity, 14% coverage

LOC552251 homeobox protein aristaless from Apis mellifera
73% identity, 16% coverage

UNC4_RAT / P97830 Homeobox protein unc-4 homolog; Homeobox protein Uncx4.1; Paired-type homeodomain transcription factor 1 from Rattus norvegicus (Rat) (see paper)
70% identity, 12% coverage

NP_001088995 paired box 7 L homeolog from Xenopus laevis
62% identity, 14% coverage

XP_018080096 paired box 7 L homeolog isoform X1 from Xenopus laevis
62% identity, 14% coverage

XP_002936715 retinal homeobox protein Rx from Xenopus tropicalis
61% identity, 23% coverage

PAX7_HUMAN / P23759 Paired box protein Pax-7; HuP1 from Homo sapiens (Human) (see 3 papers)
NP_001128726 paired box protein Pax-7 isoform 3 from Homo sapiens
62% identity, 14% coverage

XP_020951119 paired box protein Pax-7 isoform X3 from Sus scrofa
62% identity, 14% coverage

RXB_XENLA / O42567 Retinal homeobox protein Rx-B; Retina and anterior neural fold homeobox protein B; Rx2A; Xrx2 from Xenopus laevis (African clawed frog) (see paper)
67% identity, 20% coverage

UNC4_MOUSE / O08934 Homeobox protein unc-4 homolog; Homeobox protein Uncx4.1 from Mus musculus (Mouse) (see 11 papers)
NP_038730 homeobox protein unc-4 homolog from Mus musculus
69% identity, 12% coverage

Q5ZNB2 Paired box protein 7 from Salvelinus alpinus
62% identity, 14% coverage

RXA_XENLA / O42201 Retinal homeobox protein Rx-A; Rx1A; Xrx1; Retina and anterior neural fold homeobox protein A from Xenopus laevis (African clawed frog) (see 2 papers)
NP_001081687 retinal homeobox protein Rx-A from Xenopus laevis
67% identity, 20% coverage

NP_990396 paired box protein Pax-7 from Gallus gallus
62% identity, 14% coverage

NP_001245265 paired box gene 7b from Oncorhynchus mykiss
62% identity, 14% coverage

Q683Y9 Paired box protein 7 from Salmo salar
62% identity, 14% coverage

XP_009304561 paired box protein Pax-7a isoform X1 from Danio rerio
62% identity, 14% coverage

NP_001245266 paired box protein Pax-7a from Oncorhynchus mykiss
62% identity, 14% coverage

XP_066838118 paired box protein Pax-7 isoform X1 from Anser cygnoides
62% identity, 14% coverage

NP_571400 paired box protein Pax-7a isoform PAX7C from Danio rerio
62% identity, 14% coverage

O35602 Retinal homeobox protein Rx from Mus musculus
NP_038861 retinal homeobox protein Rx from Mus musculus
67% identity, 19% coverage

NP_446130 retinal homeobox protein Rx from Rattus norvegicus
67% identity, 19% coverage

Bm8528 Uncharacterized protein from Brugia malayi
68% identity, 35% coverage

RAX2_HUMAN / Q96IS3 Retina and anterior neural fold homeobox protein 2; Q50-type retinal homeobox protein; Retina and anterior neural fold homeobox-like protein 1 from Homo sapiens (Human) (see 2 papers)
NP_116142 retina and anterior neural fold homeobox protein 2 from Homo sapiens
60% identity, 40% coverage

NP_001139621 paired box protein Pax-7b from Danio rerio
62% identity, 14% coverage

XP_006538694 paired box protein Pax-7 isoform X3 from Mus musculus
62% identity, 14% coverage

XP_006239325 paired box protein Pax-7 isoform X1 from Rattus norvegicus
62% identity, 14% coverage

PAX7_MOUSE / P47239 Paired box protein Pax-7 from Mus musculus (Mouse) (see 3 papers)
62% identity, 14% coverage

NP_571302 retinal homeobox protein Rx3 from Danio rerio
67% identity, 22% coverage

A6NJT0 Homeobox protein unc-4 homolog from Homo sapiens
NP_001073930 homeobox protein unc-4 homolog from Homo sapiens
69% identity, 11% coverage

RX_HUMAN / Q9Y2V3 Retinal homeobox protein Rx; Retina and anterior neural fold homeobox protein from Homo sapiens (Human) (see 2 papers)
NP_038463 retinal homeobox protein Rx from Homo sapiens
67% identity, 18% coverage

RAX2_BOVIN / Q7YRX0 Retina and anterior neural fold homeobox protein 2; Q50-type retinal homeobox protein; Retina and anterior neural fold homeobox-like protein 1 from Bos taurus (Bovine) (see paper)
60% identity, 40% coverage

XP_011607069 paired box protein Pax-3 isoform X1 from Takifugu rubripes
55% identity, 17% coverage

Ci-Rx / CAC34833.1 Ci-Rx protein from Ciona intestinalis (see paper)
64% identity, 8% coverage

LOC102223853 dorsal root ganglia homeobox protein-like from Xiphophorus maculatus
62% identity, 23% coverage

XP_018669748 prx1 protein isoform X1 from Ciona intestinalis
64% identity, 8% coverage

NP_001084383 paired like homeobox 2B L homeolog from Xenopus laevis
64% identity, 22% coverage

NP_989600 paired box protein Pax-3 isoform a from Gallus gallus
55% identity, 17% coverage

XP_023014225 paired mesoderm homeobox protein 2A-like from Leptinotarsa decemlineata
64% identity, 18% coverage

XP_017213137 paired box protein Pax-3a isoform X1 from Danio rerio
55% identity, 17% coverage

8osbE / Q9H161 Twist1-tcf4-alx4 complex on specific DNA (see paper)
71% identity, 69% coverage

ARXH_CAEEL / Q21836 Homeobox ARX homolog alr-1; Aristaless-related homeobox alr-1 from Caenorhabditis elegans (see 4 papers)
NP_509860 Homeobox ARX homolog alr-1 from Caenorhabditis elegans
63% identity, 18% coverage

XP_001495022 paired box protein Pax-3 isoform X2 from Equus caballus
55% identity, 17% coverage

NP_001120838 paired box protein Pax-3 isoform PAX3i from Homo sapiens
55% identity, 17% coverage

XP_005164261 homeobox protein unc-4 homolog from Danio rerio
69% identity, 13% coverage

NP_001014818 paired like homeobox 2Bb from Danio rerio
64% identity, 23% coverage

NP_001193747 paired box protein Pax-3 from Bos taurus
55% identity, 17% coverage

XP_015149815 homeobox protein unc-4 homolog isoform X1 from Gallus gallus
69% identity, 12% coverage

XP_006245193 paired box protein Pax-3 isoform X1 from Rattus norvegicus
55% identity, 17% coverage

Q8BRF1 Paired box 3 from Mus musculus
NP_001152992 paired box protein Pax-3 isoform b from Mus musculus
55% identity, 17% coverage

PAX3_HUMAN / P23760 Paired box protein Pax-3; HuP2 from Homo sapiens (Human) (see 26 papers)
NP_852122 paired box protein Pax-3 isoform PAX3 from Homo sapiens
55% identity, 17% coverage

XP_018111896 paired box 6 L homeolog isoform X1 from Xenopus laevis
66% identity, 14% coverage

PAX3B_XENLA / Q0IH87 Paired box protein Pax-3-B; xPax3-B; Paired-domain transcription factor Pax3-B from Xenopus laevis (African clawed frog) (see 7 papers)
55% identity, 17% coverage

PAX3_MOUSE / P24610 Paired box protein Pax-3 from Mus musculus (Mouse) (see 3 papers)
55% identity, 17% coverage

PAX3A_XENLA / Q645N4 Paired box protein Pax-3-A; xPax3-A; Paired-domain transcription factor Pax3-A from Xenopus laevis (African clawed frog) (see 7 papers)
55% identity, 17% coverage

LOC536229, XP_015316176 paired box protein Pax-7 from Bos taurus
62% identity, 18% coverage

V3ZQV3 Uncharacterized protein (Fragment) from Lottia gigantea
XP_009066032 hypothetical protein from Lottia gigantea
67% identity, 16% coverage

NP_001091013 paired box gene 6 from Canis lupus familiaris
66% identity, 15% coverage

NP_001231127 paired box protein Pax-6 isoform 1 from Mus musculus
NP_001231129 paired box protein Pax-6 isoform 1 from Mus musculus
66% identity, 15% coverage

PAX6_HUMAN / P26367 Paired box protein Pax-6; Aniridia type II protein; Oculorhombin from Homo sapiens (Human) (see 26 papers)
PAX6_MOUSE / P63015 Paired box protein Pax-6; Oculorhombin from Mus musculus (Mouse) (see 6 papers)
NP_001121084 paired box protein Pax-6 isoform a from Homo sapiens
NP_000271 paired box protein Pax-6 isoform a from Homo sapiens
NP_001253186 paired box protein Pax-6 from Macaca mulatta
66% identity, 15% coverage

NP_001158383 paired box 6 from Saccoglossus kowalevskii
66% identity, 14% coverage

XP_012307699 paired box protein Pax-6 isoform X2 from Aotus nancymaae
66% identity, 15% coverage

PAX6_RAT / P63016 Paired box protein Pax-6; Oculorhombin from Rattus norvegicus (Rat) (see 4 papers)
66% identity, 15% coverage

UNC4_CAEEL / P29506 Homeobox protein unc-4; Homeobox protein ceh-4; Uncoordinated protein 4 from Caenorhabditis elegans (see 8 papers)
NP_496138 Homeobox protein unc-4 from Caenorhabditis elegans
63% identity, 27% coverage

3a01F / Q06453 Crystal structure of aristaless and clawless homeodomains bound to dna (see paper)
73% identity, 70% coverage

XP_005156907 dorsal root ganglia homeobox protein isoform X1 from Danio rerio
62% identity, 22% coverage

NP_446321 paired mesoderm homeobox protein 2A from Rattus norvegicus
64% identity, 23% coverage

NP_032913 paired mesoderm homeobox protein 2A from Mus musculus
Q62066 Paired mesoderm homeobox protein 2A from Mus musculus
64% identity, 23% coverage

PHX2B_MOUSE / O35690 Paired mesoderm homeobox protein 2B; Neuroblastoma Phox; NBPhox; PHOX2B homeodomain protein; Paired-like homeobox 2B from Mus musculus (Mouse) (see paper)
PHX2B_HUMAN / Q99453 Paired mesoderm homeobox protein 2B; Neuroblastoma Phox; NBPhox; PHOX2B homeodomain protein; Paired-like homeobox 2B from Homo sapiens (Human) (see 3 papers)
NP_003915 paired mesoderm homeobox protein 2B from Homo sapiens
NP_032914 paired mesoderm homeobox protein 2B from Mus musculus
64% identity, 20% coverage

XP_005168876 paired box protein Pax-6b isoform X1 from Danio rerio
66% identity, 14% coverage

PHX2A_HUMAN / O14813 Paired mesoderm homeobox protein 2A; ARIX1 homeodomain protein; Aristaless homeobox protein homolog; Paired-like homeobox 2A from Homo sapiens (Human) (see paper)
NP_005160 paired mesoderm homeobox protein 2A isoform 1 from Homo sapiens
64% identity, 23% coverage

NP_001014693 eyeless, isoform D from Drosophila melanogaster
64% identity, 7% coverage

XP_015140956 paired mesoderm homeobox protein 2B from Gallus gallus
64% identity, 22% coverage

T1F6U6 Paired box protein Pax-6 from Helobdella robusta
62% identity, 9% coverage

PAX6_DROME / O18381 Paired box protein Pax-6; Protein eyeless from Drosophila melanogaster (Fruit fly) (see paper)
64% identity, 7% coverage

NP_996953 paired mesoderm homeobox protein 2A from Danio rerio
64% identity, 23% coverage

NP_571379 paired box protein Pax-6 from Danio rerio
66% identity, 14% coverage

T1G400 Uncharacterized protein from Helobdella robusta
66% identity, 15% coverage

PAX6_CHICK / P47237 Paired box protein Pax-6 from Gallus gallus (Chicken) (see paper)
66% identity, 30% coverage

NP_726607 eyeless, isoform B from Drosophila melanogaster
64% identity, 10% coverage

CEH17_CAEEL / G5EC89 Homeobox protein ceh-17 from Caenorhabditis elegans (see 2 papers)
61% identity, 27% coverage

DRGX_MOUSE / Q8BYH0 Dorsal root ganglia homeobox protein; Dorsal root ganglion 11; Homeobox protein DRG11; Paired-related homeobox protein-like 1 from Mus musculus (Mouse) (see 3 papers)
XP_006518475 dorsal root ganglia homeobox protein isoform X2 from Mus musculus
62% identity, 24% coverage

A6NNA5 Dorsal root ganglia homeobox protein from Homo sapiens
62% identity, 24% coverage

XP_063131133 dorsal root ganglia homeobox protein isoform X1 from Rattus norvegicus
62% identity, 24% coverage

VAB3_CAEEL / G5EDS1 Paired box protein 6 homolog; Homeobox and paired domain-containing protein vab-3; Protein male abnormal 18; Variable abnormal morphology protein 3 from Caenorhabditis elegans (see 14 papers)
NP_001024570 Paired box protein 6 homolog from Caenorhabditis elegans
66% identity, 14% coverage

A0A0B7A551 Uncharacterized protein (Fragment) from Arion vulgaris
58% identity, 16% coverage

NP_001315326 paired box protein Pax-3b from Danio rerio
54% identity, 18% coverage

UNC4_DANRE / Q50D79 Homeobox protein unc-4 homolog; Homeobox protein Uncx4.1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_001018616 homeobox protein unc-4 homolog from Danio rerio
67% identity, 13% coverage

HM08_CAEEL / Q94398 Homeobox protein ceh-8 from Caenorhabditis elegans (see paper)
58% identity, 26% coverage

DMBX1_CHICK / F1NEA7 Diencephalon/mesencephalon homeobox protein 1 from Gallus gallus (Chicken) (see 2 papers)
63% identity, 17% coverage

DMX1B_DANRE / Q566X8 Diencephalon/mesencephalon homeobox protein 1-B from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
NP_001017625 diencephalon/mesencephalon homeobox protein 1-B from Danio rerio
63% identity, 17% coverage

Q62798 Dorsal root ganglia homeobox protein from Rattus norvegicus
61% identity, 24% coverage

DMBX1_MOUSE / Q91ZK4 Diencephalon/mesencephalon homeobox protein 1; Diencephalon/mesencephalon-expressed brain homeobox gene 1 protein; Orthodenticle homolog 3; Paired-like homeobox protein DMBX1; Paired-type homeobox Atx from Mus musculus (Mouse) (see 8 papers)
XP_017175447 diencephalon/mesencephalon homeobox protein 1 isoform X3 from Mus musculus
63% identity, 17% coverage

NP_788420 homeobrain from Drosophila melanogaster
63% identity, 15% coverage

DMX1A_DANRE / Q8JI10 Diencephalon/mesencephalon homeobox protein 1-A; Paired homeobox protein 1 from Danio rerio (Zebrafish) (Brachydanio rerio) (see 2 papers)
63% identity, 16% coverage

DMBX1_HUMAN / Q8NFW5 Diencephalon/mesencephalon homeobox protein 1; Orthodenticle homolog 3; Paired-like homeobox protein DMBX1 from Homo sapiens (Human) (see 2 papers)
63% identity, 16% coverage

V4AMZ8 Uncharacterized protein (Fragment) from Lottia gigantea
62% identity, 22% coverage

NP_001290437 intestine-specific homeobox from Homo sapiens
58% identity, 29% coverage

NP_524638 twin of eyeless, isoform A from Drosophila melanogaster
62% identity, 12% coverage

Q25411 Pax6-like protein from Lineus sanguineus
66% identity, 17% coverage

LOC107439785 diencephalon/mesencephalon homeobox protein 1-A from Parasteatoda tepidariorum
60% identity, 27% coverage

ISX_MOUSE / A1A546 Intestine-specific homeobox from Mus musculus (Mouse) (see 2 papers)
NP_082113 intestine-specific homeobox isoform 2 from Mus musculus
54% identity, 32% coverage

Smp_163140 pituitary homeobox protein-related from Schistosoma mansoni
56% identity, 8% coverage

GSBN_DROME / P09083 Protein gooseberry-neuro; BSH4; Protein gooseberry proximal from Drosophila melanogaster (Fruit fly) (see paper)
NP_523862 gooseberry-neuro from Drosophila melanogaster
57% identity, 18% coverage

R7TKD0 Transcription factor Pax3/7 (Fragment) from Capitella teleta
58% identity, 27% coverage

NP_505519 Homeobox protein unc-42 from Caenorhabditis elegans
65% identity, 23% coverage

NP_723721 paired, isoform B from Drosophila melanogaster
P06601 Segmentation protein paired from Drosophila melanogaster
55% identity, 12% coverage

UNC42_CAEEL / L8E946 Homeobox protein unc-42; Uncoordinated protein 42 from Caenorhabditis elegans (see 5 papers)
65% identity, 22% coverage

NP_001290114 paired box protein Pax-7 from Meleagris gallopavo
49% identity, 14% coverage

T1FMW8 Uncharacterized protein from Helobdella robusta
62% identity, 12% coverage

T1G8F8 Uncharacterized protein from Helobdella robusta
66% identity, 22% coverage

NP_477026 reversed polarity from Drosophila melanogaster
71% identity, 8% coverage

CRX_MOUSE / O54751 Cone-rod homeobox protein from Mus musculus (Mouse) (see paper)
NP_031796 cone-rod homeobox protein isoform 1 from Mus musculus
65% identity, 19% coverage

Smp_160670 putative paired box protein pax-6 from Schistosoma mansoni
63% identity, 5% coverage

CRX_HUMAN / O43186 Cone-rod homeobox protein from Homo sapiens (Human) (see 10 papers)
NP_000545 cone-rod homeobox protein from Homo sapiens
65% identity, 19% coverage

UNC4_DROME / O77215 Homeobox protein unc-4; Paired-like homeodomain protein unc-4; DPHD-1 from Drosophila melanogaster (Fruit fly) (see paper)
62% identity, 10% coverage

CG9876 uncharacterized protein from Drosophila melanogaster
51% identity, 26% coverage

3cmyA / P23760 Structure of a homeodomain in complex with DNA (see paper)
64% identity, 67% coverage

XP_033105362 homeobox protein OTX-like isoform X1 from Anneissia japonica
57% identity, 21% coverage

ESX1_HUMAN / Q8N693 Homeobox protein ESX1; Extraembryonic, spermatogenesis, homeobox 1 from Homo sapiens (Human) (see 3 papers)
NP_703149 homeobox protein ESX1 from Homo sapiens
60% identity, 15% coverage

Q5IGV4 Homeodomain transcription factor PaxC from Nematostella vectensis
60% identity, 15% coverage

LOC122268388 visual system homeobox 2 from Parasteatoda tepidariorum
63% identity, 16% coverage

NP_523389 Ods-site homeobox from Drosophila melanogaster
62% identity, 16% coverage

PITX3_DANRE / Q6QU75 Pituitary homeobox 3; Bicoid-like homeodomain transcription factor Pitx3; Homeobox protein PITX3; Paired-like homeodomain transcription factor 3 from Danio rerio (Zebrafish) (Brachydanio rerio) (see 3 papers)
NP_991238 pituitary homeobox 3 from Danio rerio
58% identity, 22% coverage

NP_996314 Ptx1, isoform C from Drosophila melanogaster
56% identity, 12% coverage

PITX_DROME / O18400 Pituitary homeobox homolog Ptx1; D-PTX1 from Drosophila melanogaster (Fruit fly) (see paper)
56% identity, 12% coverage

NP_001082023 paired like homeodomain 3 L homeolog from Xenopus laevis
58% identity, 22% coverage

NP_001079212 pituitary homeobox 3 from Xenopus laevis
58% identity, 22% coverage

NP_001139175 paired like homeobox 2Ba from Danio rerio
56% identity, 41% coverage

1fjlA / P06601 Homeodomain from the drosophila paired protein bound to a DNA oligonucleotide (see paper)
63% identity, 69% coverage

OTXH_CAEEL / Q9U2Z0 Homeobox protein ttx-1; Abnormal thermotaxis protein 1; OTX homeobox homolog ttx-1 from Caenorhabditis elegans (see 4 papers)
56% identity, 16% coverage

PITX1_CHICK / P56673 Pituitary homeobox 1; Homeobox protein PITX1; Paired-like homeodomain transcription factor 1; cPTX1 from Gallus gallus (Chicken) (see 3 papers)
58% identity, 21% coverage

NP_001001263 homeobox protein prophet of Pit-1 from Sus scrofa
58% identity, 31% coverage

PROP1_HUMAN / O75360 Homeobox protein prophet of Pit-1; PROP-1; Pituitary-specific homeodomain factor from Homo sapiens (Human) (see 7 papers)
NP_006252 homeobox protein prophet of Pit-1 from Homo sapiens
58% identity, 31% coverage

UNC30_CAEEL / P52906 Homeobox protein unc-30; Uncoordinated protein 30 from Caenorhabditis elegans (see 4 papers)
NP_001021277 Homeobox protein unc-30 from Caenorhabditis elegans
57% identity, 20% coverage

XP_421631 pituitary homeobox 3 isoform X1 from Gallus gallus
58% identity, 22% coverage

NP_001009767 homeobox protein prophet of Pit-1 from Ovis aries
62% identity, 27% coverage

Q8MJI9 Homeobox protein prophet of Pit-1 from Bos taurus
62% identity, 27% coverage

NP_001161157 pituitary homeobox 1 from Gallus gallus
58% identity, 21% coverage

XP_015327641 homeobox protein prophet of Pit-1 isoform X1 from Bos taurus
61% identity, 27% coverage

NP_001024213 Homeobox protein ttx-1 from Caenorhabditis elegans
56% identity, 19% coverage

Smp_124010 putative homeobox protein otx from Schistosoma mansoni
53% identity, 59% coverage

GSC_DANRE / P53544 Homeobox protein goosecoid; ZGSC from Danio rerio (Zebrafish) (Brachydanio rerio) (see 2 papers)
61% identity, 27% coverage

Smp_126560 putative orthopedia homeobox protein from Schistosoma mansoni
53% identity, 6% coverage

K1QWY6 Paired box protein Pax-6 from Magallana gigas
58% identity, 17% coverage

Q63410 Homeobox protein OTX1 from Rattus norvegicus
50% identity, 19% coverage

NP_037241 homeobox protein OTX1 from Rattus norvegicus
50% identity, 19% coverage

GSC_MOUSE / Q02591 Homeobox protein goosecoid from Mus musculus (Mouse) (see 2 papers)
NP_034481 homeobox protein goosecoid from Mus musculus
61% identity, 25% coverage

GSC_HUMAN / P56915 Homeobox protein goosecoid from Homo sapiens (Human) (see paper)
NP_776248 homeobox protein goosecoid from Homo sapiens
61% identity, 25% coverage

OTX1_HUMAN / P32242 Homeobox protein OTX1; Orthodenticle homolog 1 from Homo sapiens (Human) (see paper)
NP_001186699 homeobox protein OTX1 from Homo sapiens
50% identity, 19% coverage

PITX1_HUMAN / P78337 Pituitary homeobox 1; Hindlimb-expressed homeobox protein backfoot; Homeobox protein PITX1; Paired-like homeodomain transcription factor 1 from Homo sapiens (Human) (see 4 papers)
NP_002644 pituitary homeobox 1 from Homo sapiens
58% identity, 20% coverage

O96756 DtPax-6 protein from Girardia tigrina
51% identity, 13% coverage

PITX1_MOUSE / P70314 Pituitary homeobox 1; Hindlimb-expressed homeobox protein backfoot; Homeobox protein P-OTX; Homeobox protein PITX1; Paired-like homeodomain transcription factor 1; Pituitary OTX-related factor from Mus musculus (Mouse) (see 3 papers)
XP_006517220 pituitary homeobox 1 isoform X1 from Mus musculus
58% identity, 20% coverage

XP_014951176 pituitary homeobox 1 from Ovis aries
58% identity, 20% coverage

P97458 Homeobox protein prophet of Pit-1 from Mus musculus
NP_032962 homeobox protein prophet of Pit-1 from Mus musculus
62% identity, 27% coverage

GSCB_XENLA / P53546 Homeobox protein goosecoid isoform B from Xenopus laevis (African clawed frog) (see 3 papers)
NP_001081278 homeobox protein goosecoid isoform B from Xenopus laevis
59% identity, 26% coverage

GSCA_XENLA / P29454 Homeobox protein goosecoid isoform A from Xenopus laevis (African clawed frog) (see 3 papers)
58% identity, 28% coverage

LOC107444630 homeobox protein unc-4 homolog from Parasteatoda tepidariorum
62% identity, 14% coverage

NP_001080981 pituitary homeobox 1 from Xenopus laevis
58% identity, 21% coverage

XP_012823826 homeobox protein OTX2 isoform X1 from Xenopus tropicalis
52% identity, 23% coverage

NP_957490 short stature homeobox protein 2 from Danio rerio
55% identity, 23% coverage

NP_001287047 eyegone, isoform C from Drosophila melanogaster
56% identity, 9% coverage

XP_006526827 pituitary homeobox 3 isoform X1 from Mus musculus
58% identity, 16% coverage

XP_018085948 homeobox protein goosecoid from Xenopus laevis
58% identity, 23% coverage

PITX3_MOUSE / O35160 Pituitary homeobox 3; Homeobox protein PITX3; Paired-like homeodomain transcription factor 3 from Mus musculus (Mouse) (see 6 papers)
NP_032878 pituitary homeobox 3 from Mus musculus
58% identity, 21% coverage

PITX3_RAT / P81062 Pituitary homeobox 3; Homeobox protein PITX3; Paired-like homeodomain transcription factor 3 from Rattus norvegicus (Rat) (see paper)
XP_006231540 pituitary homeobox 3 isoform X1 from Rattus norvegicus
58% identity, 21% coverage

NP_001020793 short stature homeobox protein from Canis lupus familiaris
56% identity, 23% coverage

SHOX_HUMAN / O15266 Short stature homeobox protein; Pseudoautosomal homeobox-containing osteogenic protein; Short stature homeobox-containing protein from Homo sapiens (Human) (see 4 papers)
51% identity, 25% coverage

XP_005681362 pituitary homeobox 2 isoform X2 from Capra hircus
NP_001191328 pituitary homeobox 2 isoform a from Homo sapiens
XP_051676055 pituitary homeobox 2 isoform X2 from Oryctolagus cuniculus
52% identity, 27% coverage

NP_062207 pituitary homeobox 2 isoform 2 from Rattus norvegicus
52% identity, 27% coverage

NP_006874 short stature homeobox protein isoform SHOXb from Homo sapiens
48% identity, 36% coverage

PITX3_HUMAN / O75364 Pituitary homeobox 3; Homeobox protein PITX3; Paired-like homeodomain transcription factor 3 from Homo sapiens (Human) (see 2 papers)
NP_005020 pituitary homeobox 3 from Homo sapiens
58% identity, 21% coverage

XP_020958123 homeobox protein OTX2 isoform X1 from Sus scrofa
52% identity, 22% coverage

PITX2_MOUSE / P97474 Pituitary homeobox 2; ALL1-responsive protein ARP1; BRX1 homeoprotein; Bicoid-related homeobox protein 1; Homeobox protein PITX2; Orthodenticle-like homeobox 2; Paired-like homeodomain transcription factor 2; Solurshin from Mus musculus (Mouse) (see 8 papers)
NP_035228 pituitary homeobox 2 isoform b from Mus musculus
52% identity, 23% coverage

XP_012818036 short stature homeobox protein 2 isoform X1 from Xenopus tropicalis
56% identity, 22% coverage

XP_005207658 pituitary homeobox 2 isoform X1 from Bos taurus
52% identity, 23% coverage

SHOX2_MOUSE / P70390 Short stature homeobox protein 2; Homeobox protein Og12X; OG-12; Paired family homeodomain protein Prx3 from Mus musculus (Mouse) (see paper)
P70390 glutaredoxin-dependent peroxiredoxin (EC 1.11.1.25) from Mus musculus (see paper)
56% identity, 20% coverage

XP_013834407 pituitary homeobox 2 isoform X1 from Sus scrofa
52% identity, 23% coverage

O60902 Short stature homeobox protein 2 from Homo sapiens
56% identity, 20% coverage

pax-6B / CAC85262.2 Pax-6B protein from Dugesia japonica (see paper)
54% identity, 11% coverage

NP_037160 short stature homeobox protein 2 from Rattus norvegicus
56% identity, 20% coverage

PITX2_HUMAN / Q99697 Pituitary homeobox 2; ALL1-responsive protein ARP1; Homeobox protein PITX2; Paired-like homeodomain transcription factor 2; RIEG bicoid-related homeobox transcription factor; Solurshin from Homo sapiens (Human) (see 12 papers)
52% identity, 23% coverage

XP_018087945 paired like homeodomain 2 S homeolog isoform X2 from Xenopus laevis
56% identity, 20% coverage

NP_001157150 short stature homeobox protein 2 isoform c from Homo sapiens
56% identity, 21% coverage

PITX2_XENLA / Q9PWR3 Pituitary homeobox 2; Homeobox protein PITX2; Paired-like homeodomain transcription factor 2; xPtx2 from Xenopus laevis (African clawed frog) (see 2 papers)
56% identity, 20% coverage

XP_006713790 short stature homeobox protein 2 isoform X1 from Homo sapiens
56% identity, 19% coverage

XP_018084329 homeobox protein OTX2-B isoform X1 from Xenopus laevis
52% identity, 22% coverage

XP_005167911 short stature homeobox protein isoform X1 from Danio rerio
50% identity, 24% coverage

NP_001289286 short stature homeobox protein 2 isoform 2 from Mus musculus
56% identity, 21% coverage

XP_005157364 pituitary homeobox 2 isoform X1 from Danio rerio
56% identity, 24% coverage

PAX4_HUMAN / O43316 Paired box protein Pax-4 from Homo sapiens (Human) (see 3 papers)
55% identity, 17% coverage

PITX2_DANRE / Q9W5Z2 Pituitary homeobox 2; Homeobox protein PITX2; Paired-like homeodomain transcription factor 2 from Danio rerio (Zebrafish) (Brachydanio rerio) (see paper)
56% identity, 20% coverage

MIXL1_CHICK / O73592 Homeobox protein MIXL1; Homeodomain protein MIX; cMIX; MIX1 homeobox-like protein 1; Mix.1 homeobox-like protein from Gallus gallus (Chicken) (see 2 papers)
61% identity, 28% coverage

NP_990341 pituitary homeobox 2 from Gallus gallus
56% identity, 19% coverage

PITX2_CHICK / O93385 Pituitary homeobox 2; Homeobox protein PITX2; cPITX2; Paired-like homeodomain transcription factor 2 from Gallus gallus (Chicken) (see paper)
56% identity, 19% coverage

NP_001259080 twin of eyeless, isoform C from Drosophila melanogaster
58% identity, 12% coverage

NP_001027662 Otx from Ciona intestinalis
54% identity, 14% coverage

OTX1B_DANRE / Q91994 Homeobox protein OTX1 B; zOtx1; Orthodenticle homolog 1 B from Danio rerio (Zebrafish) (Brachydanio rerio) (see 3 papers)
52% identity, 21% coverage

NP_571325 homeobox protein OTX1 B from Danio rerio
52% identity, 21% coverage

XP_065401022 paired mesoderm homeobox protein 2B from Macaca fascicularis
44% identity, 19% coverage

LOC109470978 homeobox protein goosecoid-like from Branchiostoma belcheri
61% identity, 23% coverage

P32115 Paired box protein Pax-4 from Mus musculus
55% identity, 17% coverage

XP_014952574 homeobox protein OTX2 isoform X2 from Ovis aries
56% identity, 20% coverage

XP_015142758 homeobox protein OTX2 isoform X1 from Gallus gallus
56% identity, 20% coverage

NP_851848 orthodenticle homolog 5 from Danio rerio
58% identity, 20% coverage

NP_001153398 paired box protein Pax-4 isoform 3 from Mus musculus
55% identity, 18% coverage

New Search

For advice on how to use these tools together, see Interactive tools for functional annotation of bacterial genomes.

Statistics

The PaperBLAST database links 798,070 different protein sequences to 1,261,478 scientific articles. Searches against EuropePMC were last performed on May 12 2025.

How It Works

PaperBLAST builds a database of protein sequences that are linked to scientific articles. These links come from automated text searches against the articles in EuropePMC and from manually-curated information from GeneRIF, UniProtKB/Swiss-Prot, BRENDA, CAZy (as made available by dbCAN), BioLiP, CharProtDB, MetaCyc, EcoCyc, TCDB, REBASE, the Fitness Browser, and a subset of the European Nucleotide Archive with the /experiment tag. Given this database and a protein sequence query, PaperBLAST uses protein-protein BLAST to find similar sequences with E < 0.001.

To build the database, we query EuropePMC with locus tags, with RefSeq protein identifiers, and with UniProt accessions. We obtain the locus tags from RefSeq or from MicrobesOnline. We use queries of the form "locus_tag AND genus_name" to try to ensure that the paper is actually discussing that gene. Because EuropePMC indexes most recent biomedical papers, even if they are not open access, some of the links may be to papers that you cannot read or that our computers cannot read. We query each of these identifiers that appears in the open access part of EuropePMC, as well as every locus tag that appears in the 500 most-referenced genomes, so that a gene may appear in the PaperBLAST results even though none of the papers that mention it are open access. We also incorporate text-mined links from EuropePMC that link open access articles to UniProt or RefSeq identifiers. (This yields some additional links because EuropePMC uses different heuristics for their text mining than we do.)

For every article that mentions a locus tag, a RefSeq protein identifier, or a UniProt accession, we try to select one or two snippets of text that refer to the protein. If we cannot get access to the full text, we try to select a snippet from the abstract, but unfortunately, unique identifiers such as locus tags are rarely provided in abstracts.

PaperBLAST also incorporates manually-curated protein functions:

Except for GeneRIF and ENA, the curated entries include a short curated description of the protein's function. For entries from BioLiP, the protein's function may not be known beyond binding to the ligand. Many of these entries also link to articles in PubMed.

For more information see the PaperBLAST paper (mSystems 2017) or the code. You can download PaperBLAST's database here.

Changes to PaperBLAST since the paper was written:

Many of these changes are described in Interactive tools for functional annotation of bacterial genomes.

Secrets

PaperBLAST cannot provide snippets for many of the papers that are published in non-open-access journals. This limitation applies even if the paper is marked as "free" on the publisher's web site and is available in PubmedCentral or EuropePMC. If a journal that you publish in is marked as "secret," please consider publishing elsewhere.

Omissions from the PaperBLAST Database

Many important articles are missing from PaperBLAST, either because the article's full text is not in EuropePMC (as for many older articles), or because the paper does not mention a protein identifier such as a locus tag, or because of PaperBLAST's heuristics. If you notice an article that characterizes a protein's function but is missing from PaperBLAST, please notify the curators at UniProt or add an entry to GeneRIF. Entries in either of these databases will eventually be incorporated into PaperBLAST. Note that to add an entry to UniProt, you will need to find the UniProt identifier for the protein. If the protein is not already in UniProt, you can ask them to create an entry. To add an entry to GeneRIF, you will need an NCBI Gene identifier, but unfortunately many prokaryotic proteins in RefSeq do not have corresponding Gene identifers.

References

PaperBLAST: Text-mining papers for information about homologs.
M. N. Price and A. P. Arkin (2017). mSystems, 10.1128/mSystems.00039-17.

Europe PMC in 2017.
M. Levchenko et al (2017). Nucleic Acids Research, 10.1093/nar/gkx1005.

Gene indexing: characterization and analysis of NLM's GeneRIFs.
J. A. Mitchell et al (2003). AMIA Annu Symp Proc 2003:460-464.

UniProt: the universal protein knowledgebase.
The UniProt Consortium (2016). Nucleic Acids Research, 10.1093/nar/gkw1099.

BRENDA in 2017: new perspectives and new tools in BRENDA.
S. Placzek et al (2017). Nucleic Acids Research, 10.1093/nar/gkw952.

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12.
I. M. Keeseler et al (2016). Nucleic Acids Research, 10.1093/nar/gkw1003.

The MetaCyc database of metabolic pathways and enzymes.
R. Caspi et al (2018). Nucleic Acids Research, 10.1093/nar/gkx935.

CharProtDB: a database of experimentally characterized protein annotations.
R. Madupu et al (2012). Nucleic Acids Research, 10.1093/nar/gkr1133.

The carbohydrate-active enzymes database (CAZy) in 2013.
V. Lombard et al (2014). Nucleic Acids Research, 10.1093/nar/gkt1178.

The Transporter Classification Database (TCDB): recent advances
M. H. Saier, Jr. et al (2016). Nucleic Acids Research, 10.1093/nar/gkv1103.

REBASE - a database for DNA restriction and modification: enzymes, genes and genomes.
R. J. Roberts et al (2015). Nucleic Acids Research, 10.1093/nar/gku1046.

Deep annotation of protein function across diverse bacteria from mutant phenotypes.
M. N. Price et al (2016). bioRxiv, 10.1101/072470.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory