SitesBLAST – Find functional sites

 

SitesBLAST

Comparing 3609295 FitnessBrowser__Dino:3609295 to proteins with known functional sites using BLASTp with E ≤ 0.001.

Or try Sites on a Tree, PaperBLAST, Conserved Domains, or compare to all protein structures

Found 12 hits to proteins with known functional sites (download)

Q5SKW9 Glycine cleavage system H protein from Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8) (see paper)
50% identity, 96% coverage: 5:121/122 of query aligns to 8:124/128 of Q5SKW9

query
sites
Q5SKW9
Y
 
F
Y
 
Y
T
 
T
D
 
K
D
 
T
H
 
H
E
 
E
W
 
W
I
 
A
E
 
L
V
 
P
E
 
E
D
 
G
D
 
D
T
 
T
A
 
V
T
 
L
I
 
V
G
 
G
I
 
I
T
 
T
K
 
D
H
 
Y
A
 
A
A
 
Q
E
 
D
Q
 
A
L
 
L
G
 
G
E
 
D
V
 
V
V
 
V
F
 
Y
I
 
V
E
 
E
L
 
L
Q
 
P
P
 
E
E
 
V
G
 
G
E
 
R
T
 
V
F
 
V
V
 
E
K
 
K
G
 
G
D
 
E
E
 
A
I
 
V
G
 
A
V
 
V
V
 
V
E
 
E
S
 
S
V
 
V
K
|
K
A
 
T
A
 
A
S
 
S
D
 
D
I
 
I
F
 
Y
A
 
A
P
 
P
V
 
V
T
 
A
G
 
G
E
 
E
I
 
I
L
 
V
E
 
E
A
 
V
N
 
N
A
 
L
A
 
A
L
 
L
V
 
E
E
 
K
T
 
T
P
 
P
A
 
E
E
 
L
L
 
V
N
 
N
E
 
Q
D
 
D
P
 
P
E
 
Y
G
 
G
N
 
E
S
 
G
W
 
W
L
 
I
Y
 
F
K
 
R
I
 
L
K
 
K
L
 
P
S
 
R
D
 
D
P
 
M
G
 
G
E
 
D
L
 
L
S
 
D
E
 
E
L
 
L
L
 
L
D
 
D
A
 
A
E
 
G
G
 
G
Y
 
Y
A
 
Q
A
 
E
L
 
V
I
 
L

Q9HDV9 Putative glycine cleavage system H protein, mitochondrial; Glycine decarboxylase complex subunit H from Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast) (see paper)
50% identity, 98% coverage: 3:121/122 of query aligns to 44:162/169 of Q9HDV9

query
sites
Q9HDV9
T
 
T
T
 
K
Y
 
H
Y
 
F
T
 
T
D
 
K
D
 
E
H
 
H
E
 
E
W
 
W
I
 
V
E
 
K
V
 
V
E
 
D
D
 
G
D
 
D
T
 
V
A
 
G
T
 
T
I
 
V
G
 
G
I
 
I
T
 
T
K
 
S
H
 
Y
A
 
A
A
 
A
E
 
N
Q
 
A
L
 
L
G
 
G
E
 
E
V
 
V
V
 
V
F
 
F
I
 
V
E
 
E
L
 
L
Q
 
-
P
 
P
E
 
E
G
 
P
E
 
E
T
 
T
F
 
T
V
 
V
K
 
S
-
 
V
G
 
G
D
 
D
E
 
G
I
 
I
G
 
G
V
 
A
V
 
V
E
 
E
S
 
S
V
 
V
K
 
K
A
 
S
A
 
A
S
 
S
D
 
D
I
 
V
F
 
Y
A
 
S
P
 
P
V
 
V
T
 
S
G
 
G
E
 
T
I
 
V
L
 
T
E
 
S
A
 
I
N
 
N
A
 
E
A
 
S
L
 
L
V
 
G
E
 
D
T
 
S
P
 
P
A
 
D
E
 
K
L
 
V
N
 
S
E
 
S
D
x
S
P
 
P
E
 
E
G
 
E
N
 
E
S
 
G
W
 
W
L
 
I
Y
 
C
K
 
K
I
 
I
K
 
K
L
 
L
S
 
S
D
 
S
P
 
P
G
 
D
E
 
E
L
 
L
S
 
K
E
 
S
L
 
L
L
 
L
D
 
N
A
 
D
E
 
E
G
 
S
Y
 
Y
A
 
A
A
 
Q
L
 
F
I
 
C

3ab9A Crystal structure of lipoylated e. Coli h-protein (reduced form) (see paper)
51% identity, 95% coverage: 6:121/122 of query aligns to 8:124/127 of 3ab9A

query
sites
3ab9A
Y
 
Y
T
 
S
D
 
K
D
 
E
H
 
H
E
 
E
W
 
W
I
 
L
E
 
R
V
 
K
E
 
E
-
 
A
D
 
D
D
 
G
T
 
T
A
 
Y
T
 
T
I
 
V
G
 
G
I
 
I
T
 
T
K
 
E
H
 
H
A
 
A
A
 
Q
E
 
E
Q
 
L
L
 
L
G
 
G
E
 
D
V
 
M
V
 
V
F
 
F
I
 
V
E
x
D
L
|
L
Q
 
P
P
 
E
E
 
V
G
 
G
E
 
A
T
 
T
F
 
V
V
 
S
K
 
A
G
 
G
D
 
D
E
 
D
I
 
C
G
 
A
V
 
V
V
 
A
E
 
E
S
 
S
V
 
V
K
 
K
A
 
A
A
 
A
S
 
S
D
 
D
I
 
I
F
 
Y
A
 
A
P
 
P
V
 
V
T
 
S
G
 
G
E
 
E
I
 
I
L
 
V
E
 
A
A
 
V
N
 
N
A
 
D
A
 
A
L
 
L
V
 
S
E
 
D
T
 
S
P
 
P
A
 
E
E
 
L
L
 
V
N
 
N
E
 
S
D
 
E
P
 
P
E
 
Y
G
 
A
N
 
G
S
 
G
W
 
W
L
 
I
Y
 
F
K
 
K
I
 
I
K
 
K
L
 
A
S
 
S
D
 
D
P
 
E
G
 
S
E
 
E
L
 
L
S
 
E
E
 
S
L
 
L
L
 
L
D
 
D
A
 
A
E
 
T
G
 
A
Y
 
Y
A
 
E
A
 
A
L
 
L
I
 
L

P16048 Glycine cleavage system H protein, mitochondrial from Pisum sativum (Garden pea) (Lathyrus oleraceus) (see paper)
46% identity, 95% coverage: 6:121/122 of query aligns to 43:158/165 of P16048

query
sites
P16048
Y
 
Y
T
 
A
D
 
P
D
 
S
H
 
H
E
 
E
W
 
W
I
 
V
E
 
K
V
 
H
E
 
E
D
 
G
D
 
S
T
 
V
A
 
A
T
 
T
I
 
I
G
 
G
I
 
I
T
 
T
K
 
D
H
 
H
A
 
A
A
 
Q
E
 
D
Q
 
H
L
 
L
G
 
G
E
 
E
V
 
V
V
 
V
F
 
F
I
 
V
E
 
E
L
 
L
Q
 
P
P
 
E
E
 
P
G
 
G
E
 
V
T
 
S
F
 
V
V
 
T
K
 
K
G
 
G
D
 
K
E
 
G
I
 
F
G
 
G
V
 
A
V
 
V
E
 
E
S
 
S
V
 
V
K
|
K
A
 
A
A
 
T
S
 
S
D
 
D
I
 
V
F
 
N
A
 
S
P
 
P
V
 
I
T
 
S
G
 
G
E
 
E
I
 
V
L
 
I
E
 
E
A
 
V
N
 
N
A
 
T
A
 
G
L
 
L
V
 
T
E
 
G
T
 
K
P
 
P
A
 
G
E
 
L
L
 
I
N
 
N
E
 
S
D
 
S
P
 
P
E
 
Y
G
 
E
N
 
D
S
 
G
W
 
W
L
 
M
Y
 
I
K
 
K
I
 
I
K
 
K
L
 
P
S
 
T
D
 
S
P
 
P
G
 
D
E
 
E
L
 
L
S
 
E
E
 
S
L
 
L
L
 
L
D
 
G
A
 
A
E
 
K
G
 
E
Y
 
Y
A
 
T
A
 
K
L
 
F
I
 
C

1htpA Refined structures at 2 angstroms and 2.2 angstroms of the two forms of the h-protein, a lipoamide-containing protein of the glycine decarboxylase complex (see paper)
46% identity, 95% coverage: 6:121/122 of query aligns to 9:124/131 of 1htpA

query
sites
1htpA
Y
 
Y
T
 
A
D
 
P
D
x
S
H
|
H
E
|
E
W
 
W
I
 
V
E
 
K
V
 
H
E
 
E
D
 
G
D
 
S
T
 
V
A
 
A
T
 
T
I
 
I
G
 
G
I
 
I
T
 
T
K
 
D
H
 
H
A
 
A
A
 
Q
E
 
D
Q
x
H
L
 
L
G
 
G
E
 
E
V
 
V
V
 
V
F
 
F
I
 
V
E
 
E
L
 
L
Q
 
P
P
 
E
E
 
P
G
 
G
E
 
V
T
 
S
F
 
V
V
 
T
K
 
K
G
 
G
D
 
K
E
 
G
I
 
F
G
 
G
V
 
A
V
 
V
E
 
E
S
 
S
V
 
V
K
|
K
A
 
A
A
 
T
S
 
S
D
 
D
I
 
V
F
 
N
A
 
S
P
 
P
V
 
I
T
 
S
G
 
G
E
 
E
I
 
V
L
 
I
E
 
E
A
 
V
N
 
N
A
 
T
A
 
G
L
 
L
V
 
T
E
 
G
T
 
K
P
 
P
A
 
G
E
 
L
L
 
I
N
 
N
E
 
S
D
 
S
P
 
P
E
 
Y
G
 
E
N
 
D
S
 
G
W
 
W
L
 
M
Y
 
I
K
 
K
I
 
I
K
 
K
L
 
P
S
 
T
D
 
S
P
 
P
G
 
D
E
 
E
L
 
L
S
 
E
E
 
S
L
 
L
L
 
L
D
 
G
A
 
A
E
 
K
G
 
E
Y
 
Y
A
 
T
A
 
K
L
 
F
I
 
C

Sites not aligning to the query:

1hpcB Refined structures at 2 angstroms and 2.2 angstroms of the two forms of the h-protein, a lipoamide-containing protein of the glycine decarboxylase (see paper)
46% identity, 95% coverage: 6:121/122 of query aligns to 9:124/131 of 1hpcB

query
sites
1hpcB
Y
 
Y
T
 
A
D
 
P
D
 
S
H
 
H
E
 
E
W
 
W
I
 
V
E
 
K
V
 
H
E
 
E
D
 
G
D
 
S
T
 
V
A
 
A
T
 
T
I
 
I
G
 
G
I
 
I
T
 
T
K
 
D
H
 
H
A
 
A
A
 
Q
E
 
D
Q
x
H
L
 
L
G
 
G
E
 
E
V
 
V
V
 
V
F
 
F
I
 
V
E
 
E
L
 
L
Q
 
P
P
 
E
E
 
P
G
 
G
E
 
V
T
 
S
F
 
V
V
 
T
K
 
K
G
 
G
D
 
K
E
 
G
I
 
F
G
 
G
V
 
A
V
 
V
E
 
E
S
 
S
V
 
V
K
|
K
A
 
A
A
 
T
S
 
S
D
 
D
I
 
V
F
 
N
A
 
S
P
 
P
V
 
I
T
 
S
G
 
G
E
 
E
I
 
V
L
 
I
E
 
E
A
 
V
N
 
N
A
 
T
A
 
G
L
 
L
V
 
T
E
 
G
T
 
K
P
 
P
A
 
G
E
 
L
L
 
I
N
 
N
E
 
S
D
 
S
P
 
P
E
 
Y
G
 
E
N
 
D
S
 
G
W
 
W
L
 
M
Y
 
I
K
 
K
I
 
I
K
 
K
L
 
P
S
 
T
D
 
S
P
 
P
G
 
D
E
 
E
L
 
L
S
 
E
E
 
S
L
 
L
L
 
L
D
 
G
A
 
A
E
 
K
G
 
E
Y
 
Y
A
 
T
A
 
K
L
 
F
I
 
C

1hpcA Refined structures at 2 angstroms and 2.2 angstroms of the two forms of the h-protein, a lipoamide-containing protein of the glycine decarboxylase (see paper)
46% identity, 95% coverage: 6:121/122 of query aligns to 9:124/131 of 1hpcA

query
sites
1hpcA
Y
 
Y
T
 
A
D
 
P
D
 
S
H
 
H
E
 
E
W
 
W
I
 
V
E
 
K
V
 
H
E
 
E
D
 
G
D
 
S
T
 
V
A
 
A
T
 
T
I
 
I
G
 
G
I
 
I
T
 
T
K
 
D
H
 
H
A
 
A
A
 
Q
E
 
D
Q
x
H
L
 
L
G
 
G
E
 
E
V
 
V
V
 
V
F
 
F
I
 
V
E
 
E
L
 
L
Q
 
P
P
 
E
E
 
P
G
 
G
E
 
V
T
 
S
F
 
V
V
 
T
K
 
K
G
 
G
D
 
K
E
 
G
I
 
F
G
 
G
V
 
A
V
 
V
E
 
E
S
 
S
V
 
V
K
|
K
A
 
A
A
 
T
S
 
S
D
 
D
I
 
V
F
 
N
A
 
S
P
 
P
V
 
I
T
 
S
G
 
G
E
 
E
I
 
V
L
 
I
E
 
E
A
 
V
N
 
N
A
 
T
A
 
G
L
 
L
V
 
T
E
 
G
T
 
K
P
 
P
A
 
G
E
 
L
L
 
I
N
 
N
E
 
S
D
 
S
P
 
P
E
 
Y
G
 
E
N
 
D
S
 
G
W
 
W
L
 
M
Y
 
I
K
 
K
I
 
I
K
 
K
L
 
P
S
 
T
D
 
S
P
 
P
G
 
D
E
 
E
L
 
L
S
 
E
E
 
S
L
 
L
L
 
L
D
 
G
A
 
A
E
 
K
G
 
E
Y
 
Y
A
 
T
A
 
K
L
 
F
I
 
C

1dxmA Reduced form of the h protein from glycine decarboxylase complex (see paper)
46% identity, 95% coverage: 6:121/122 of query aligns to 9:124/131 of 1dxmA

query
sites
1dxmA
Y
 
Y
T
 
A
D
 
P
D
 
S
H
 
H
E
 
E
W
 
W
I
 
V
E
 
K
V
 
H
E
 
E
D
 
G
D
 
S
T
 
V
A
 
A
T
 
T
I
 
I
G
 
G
I
 
I
T
 
T
K
 
D
H
 
H
A
 
A
A
 
Q
E
 
D
Q
x
H
L
|
L
G
 
G
E
 
E
V
 
V
V
 
V
F
 
F
I
 
V
E
 
E
L
 
L
Q
 
P
P
 
E
E
 
P
G
 
G
E
 
V
T
 
S
F
 
V
V
 
T
K
 
K
G
 
G
D
 
K
E
 
G
I
 
F
G
 
G
V
 
A
V
 
V
E
 
E
S
 
S
V
 
V
K
|
K
A
 
A
A
 
T
S
 
S
D
 
D
I
 
V
F
 
N
A
 
S
P
 
P
V
 
I
T
 
S
G
 
G
E
 
E
I
 
V
L
 
I
E
 
E
A
 
V
N
 
N
A
 
T
A
 
G
L
 
L
V
 
T
E
 
G
T
 
K
P
 
P
A
 
G
E
 
L
L
 
I
N
 
N
E
 
S
D
 
S
P
 
P
E
 
Y
G
 
E
N
 
D
S
 
G
W
 
W
L
 
M
Y
 
I
K
 
K
I
 
I
K
 
K
L
 
P
S
 
T
D
 
S
P
 
P
G
 
D
E
 
E
L
 
L
S
 
E
E
 
S
L
 
L
L
 
L
D
 
G
A
 
A
E
 
K
G
 
E
Y
 
Y
A
 
T
A
 
K
L
 
F
I
 
C

Sites not aligning to the query:

P23434 Glycine cleavage system H protein, mitochondrial; Lipoic acid-containing protein from Homo sapiens (Human) (see 4 papers)
43% identity, 95% coverage: 6:121/122 of query aligns to 53:168/173 of P23434

query
sites
P23434
Y
 
F
T
 
T
D
 
E
D
 
K
H
|
H
E
 
E
W
 
W
I
 
V
E
 
T
V
 
T
E
 
E
D
 
N
D
 
G
T
 
I
A
 
G
T
 
T
I
 
V
G
 
G
I
 
I
T
 
S
K
x
N
H
 
F
A
 
A
A
x
Q
E
|
E
Q
x
A
L
|
L
G
|
G
E
x
D
V
|
V
V
|
V
F
x
Y
I
x
C
E
x
S
L
|
L
Q
x
P
P
x
E
E
x
V
G
|
G
E
x
T
T
x
K
F
x
L
V
x
N
K
|
K
G
x
Q
D
|
D
E
|
E
I
x
F
G
|
G
V
x
A
V
x
L
E
|
E
S
|
S
V
|
V
K
|
K
A
|
A
A
|
A
S
|
S
D
x
E
I
x
L
F
x
Y
A
x
S
P
|
P
V
x
L
T
x
S
G
|
G
E
|
E
I
x
V
L
x
T
E
|
E
A
x
I
N
|
N
A
x
E
A
|
A
L
|
L
V
x
A
E
|
E
T
x
N
P
|
P
A
x
G
E
x
L
L
x
V
N
|
N
E
x
K
D
x
S
P
x
C
E
x
Y
G
x
E
N
x
D
S
x
G
W
|
W
L
|
L
Y
x
I
K
|
K
I
x
M
K
x
T
L
|
L
S
|
S
D
x
N
P
|
P
G
x
S
E
|
E
L
|
L
S
x
D
E
|
E
L
|
L
L
x
M
D
x
S
A
x
E
E
|
E
G
x
A
Y
|
Y
A
x
E
A
x
K
L
x
Y
I
|
I

Sites not aligning to the query:

P11183 Glycine cleavage system H protein, mitochondrial; Lipoic acid-containing protein from Gallus gallus (Chicken) (see paper)
43% identity, 95% coverage: 6:121/122 of query aligns to 44:159/164 of P11183

query
sites
P11183
Y
 
F
T
 
T
D
 
D
D
 
K
H
 
H
E
 
E
W
 
W
I
 
I
E
 
S
V
 
V
E
 
E
D
 
N
D
 
G
T
 
I
A
 
G
T
 
T
I
 
V
G
 
G
I
 
I
T
 
S
K
 
N
H
 
F
A
 
A
A
 
Q
E
 
E
Q
 
A
L
 
L
G
 
G
E
 
D
V
 
V
V
 
V
F
 
Y
I
 
C
E
 
S
L
 
L
Q
 
P
P
 
E
E
 
I
G
 
G
E
 
T
T
 
K
F
 
L
V
 
N
K
 
K
G
 
D
D
 
D
E
 
E
I
 
F
G
 
G
V
 
A
V
 
L
E
 
E
S
 
S
V
 
V
K
|
K
A
 
A
A
 
A
S
 
S
D
 
E
I
 
L
F
 
Y
A
 
S
P
 
P
V
 
L
T
 
T
G
 
G
E
 
E
I
 
V
L
 
T
E
 
D
A
 
I
N
 
N
A
 
A
A
 
A
L
 
L
V
 
A
E
 
D
T
 
N
P
 
P
A
 
G
E
 
L
L
 
V
N
 
N
E
 
K
D
 
S
P
 
C
E
 
Y
G
 
Q
N
 
D
S
 
G
W
 
W
L
 
L
Y
 
I
K
 
K
I
 
M
K
 
T
L
 
V
S
 
E
D
 
K
P
 
P
G
 
A
E
 
E
L
 
L
S
 
D
E
 
E
L
 
L
L
 
M
D
 
S
A
 
E
E
 
D
G
 
A
Y
 
Y
A
 
E
A
 
K
L
 
Y
I
 
I

Sites not aligning to the query:

P20821 Glycine cleavage system H protein, mitochondrial; Lipoic acid-containing protein from Bos taurus (Bovine) (see 3 papers)
41% identity, 95% coverage: 6:121/122 of query aligns to 53:168/173 of P20821

query
sites
P20821
Y
 
F
T
 
T
D
 
E
D
 
K
H
 
H
E
 
E
W
 
W
I
 
V
E
 
T
V
 
T
E
 
E
D
 
N
D
 
G
T
 
V
A
 
G
T
 
T
I
 
V
G
 
G
I
 
I
T
 
S
K
 
N
H
 
F
A
 
A
A
 
Q
E
 
E
Q
 
A
L
 
L
G
 
G
E
 
D
V
 
V
V
 
V
F
 
Y
I
 
C
E
 
S
L
 
L
Q
 
P
P
 
E
E
 
V
G
 
G
E
 
T
T
 
K
F
 
L
V
 
N
K
 
K
G
 
Q
D
 
E
E
 
E
I
 
F
G
 
G
V
 
A
V
 
L
E
 
E
S
 
S
V
 
V
K
|
K
A
 
A
A
 
A
S
 
S
D
 
E
I
 
L
F
 
Y
A
 
S
P
 
P
V
 
L
T
 
S
G
 
G
E
 
E
I
 
V
L
 
T
E
 
E
A
 
I
N
 
N
A
 
K
A
 
A
L
 
L
V
 
A
E
 
E
T
 
N
P
 
P
A
 
G
E
 
L
L
 
V
N
 
N
E
 
K
D
 
S
P
 
C
E
 
Y
G
 
E
N
 
D
S
 
G
W
 
W
L
 
L
Y
 
I
K
 
K
I
 
M
K
 
T
L
 
F
S
 
S
D
 
N
P
 
P
G
 
S
E
 
E
L
 
L
S
 
D
E
 
E
L
 
L
L
 
M
D
 
S
A
 
E
E
 
E
G
 
A
Y
 
Y
A
 
E
A
 
K
L
 
Y
I
 
I

Sites not aligning to the query:

A0A0H3JT43 Glycine cleavage system H-like protein; GcvH-L from Staphylococcus aureus (strain Mu50 / ATCC 700699) (see paper)
33% identity, 75% coverage: 9:100/122 of query aligns to 6:96/110 of A0A0H3JT43

query
sites
A0A0H3JT43
D
 
N
H
 
Y
E
 
L
W
 
W
I
 
V
E
 
E
V
 
K
E
 
V
D
 
G
D
 
D
T
 
L
A
 
Y
T
 
V
I
 
F
G
 
S
I
 
M
T
 
T
K
 
P
H
 
E
A
 
L
A
 
Q
E
 
D
Q
 
D
L
 
I
G
 
G
E
 
T
V
 
V
V
 
G
F
 
Y
I
 
V
E
 
E
L
 
F
Q
 
V
P
 
S
E
 
P
G
 
D
E
 
E
T
 
V
F
 
K
V
 
V
K
 
-
G
 
D
D
 
D
E
 
E
I
 
I
G
 
V
V
 
S
V
 
I
E
|
E
S
 
A
V
 
S
K
|
K
A
 
T
A
 
V
S
 
I
D
 
D
I
 
V
F
 
Q
A
 
T
P
 
P
V
 
L
T
 
S
G
 
G
E
 
T
I
 
I
L
 
I
E
 
E
A
 
R
N
 
N
A
 
T
A
 
K
L
 
A
V
 
E
E
 
E
T
 
E
P
 
P
A
 
T
E
 
I
L
 
L
N
 
N
-
 
S
E
 
E
D
 
K
P
 
P
E
 
E
G
 
E
N
 
N
S
 
-
W
 
W
L
 
L
Y
 
F
K
 
K
I
 
L

Query Sequence

>3609295 FitnessBrowser__Dino:3609295
MPTTYYTDDHEWIEVEDDTATIGITKHAAEQLGEVVFIELQPEGETFVKGDEIGVVESVK
AASDIFAPVTGEILEANAALVETPAELNEDPEGNSWLYKIKLSDPGELSELLDAEGYAAL
IG

Or try a new SitesBLAST search

SitesBLAST's Database

SitesBLAST's database includes (1) SwissProt entries with experimentally-supported functional features; and (2) protein structures with bound ligands, from the BioLip database.

by Morgan Price, Arkin group
Lawrence Berkeley National Laboratory