IUPAC Sequence Codes

The nucleic acid codes supported are:

 

The amino acid codes supported are:

Nucleic Acid Code Meaning Amino Acid Code Meaning
A
Adenosine
A
Alanine
C
Cytidine
B
Aspartic acid or Asparagine
G
Guanine
C
Cysteine
T
Thymidine
D
Aspartic acid
U
Uracil
E
Glutamic acid
R
G A (puRine)
F
Phenylalanine
Y
T C (pYrimidine)
G
Glycine
K
G T (Ketone)
H
Histidine
M
A C (aMino group)
I
Isoleucine
S
G C (Strong interaction)
K
Lysine
W
A T (Weak interaction)
L
Leucine
B
G T C (not A) (B comes after A)
M
Methionine
D
G A T (not C) (D comes after C)
N
Asparagine
H
A C T (not G) (H comes after G)
P
Proline
V
G C A (not T, not U) (V comes after U)
Q
Glutamine
N
A G C T (aNy)
R
Arginine
X
masked
S
Serine
-
gap of indeterminate length
T
Threonine
   
U
Selenocysteine
   
V
Valine
   
W
Tryptophan
   
Y
Tyrosine
   
Z
Glutamic acid or Glutamine
   
X
any
   
*
translation stop
   
-
gap of indeterminate length