The Kabat numbering scheme is a widely adopted standard for numbering the residues in an antibody in a consistent manner. However the scheme does have its problems!
First, since the numbering scheme was developed from (fairly limited) sequence data, the position at which insertions occur in CDR-L1 and CDR-H1 does not match the structural insertion position. Thus topologically equivalent residues in these loops do not get the same number.
Second, the numbering adopts a rigid specification. For example in the potentially very long CDR-H3, insertions are numbered between residue H100 and H101 with letters up to K (i.e. H100, H100A ... H100K, H101). If there are more residues than that, there is no standard way of numbering them. Such situations occur at other positions too.
The numbering throughout the chains follows.
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 27A 27B 27C 27D 27E 27F 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 95A 95B 95C 95D 95E 95F 96 97 98 99 100 101 102 103 104 105 106 106A 107 108 109
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 35A 35B 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 52A 52B 52C 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 82A 82B 82C 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 100A 100B 100C 100D 100E 100F 100G 100H 100I 100J 100K 101 102 103 104 105 106 107 108 109 110 111 112 113
The pre-1989 Chothia numbering (the structurally correct version) throughout the chains follows.
Note That in their latest paper (Al-Lazikani et al., (1997) JMB 273,927-948), Chothia's group returns to using residue 30 as the insertion site in CDR-L1!
0 1 2 3 4 5 6 7 8 9
10 11 12 13 14 15 16 17 18 19
20 21 22 23 24 25 26 27 28 29
30
30A 30B 30C 30D 30E 30F
31 32 33 34 35 36 37 38 39
40 41 42 43 44 45 46 47 48 49
50 51 52 53 54 55 56 57 58 59
60 61 62 63 64 65 66 67 68 69
70 71 72 73 74 75 76 77 78 79
80 81 82 83 84 85 86 87 88 89
90 91 92 93 94 95
95A 95B 95C 95D 95E 95F 96 97 98 99
100 101 102 103 104 105 106
106A 107 108 109
0 1 2 3 4 5 6 7 8 9
10 11 12 13 14 15 16 17 18 19
20 21 22 23 24 25 26 27 28 29
30 31
31A 31B
32 33 34 35 36 37 38 39
40 41 42 43 44 45 46 47 48 49
50 51 52
52A 52B 52C 53 54 55 56 57 58 59
60 61 62 63 64 65 66 67 68 69
70 71 72 73 74 75 76 77 78 79
80 81 82
82A 82B 82C 83 84 85 86 87 88 89
90 91 92 93 94 95 96 97 98 99
100
100A 100B 100C 100D 100E 100F 100G 100H 100I 100J
100K 101 102 103 104 105 106 107 108 109
110 111 112 113
A number if definitions of the CDRs are commonly in use:
Loop Kabat AbM Chothia Contact
---- ----- --- ------- -------
L1 L24 -- L34 L24 -- L34 L24 -- L34 L30 -- L36
L2 L50 -- L56 L50 -- L56 L50 -- L56 L46 -- L55
L3 L89 -- L97 L89 -- L97 L89 -- L97 L89 -- L96
H1 H31 -- H35B H26 -- H35B H26 -- H32..34 H30 -- H35B
(Kabat Numbering)
H1 H31 -- H35 H26 -- H35 H26 -- H32 H30 -- H35
(Chothia Numbering)
H2 H50 -- H65 H50 -- H58 H52 -- H56 H47 -- H58
H3 H95 -- H102 H95 -- H102 H95 -- H102 H93 -- H101
Note that the end of the Chothia CDR-H1 loop when numbered using the Kabat numbering convention varies between H32 and H34 depending on the length of the loop. (This is because the Kabat numbering scheme places the insertions at H35A and H35B.)
This diagram illustrates the alternative definitions for CDR-H1. The Kabat and Chothia numbering schemes are shown horizontally and the Kabat, Chothia, AbM and Contact definitions of the CDRs are shown with arrows above and below the two numbering schemes.
Following an analysis of the contacts between antibody and antigen in the complex structures available in the Protein Databank, we have generated a set of mean contact data. The full method by which these results were obtained is described in the following paper: MacCallum, R. M., Martin, A. C. R. and Thornton, J. T. Antibody-antigen interactions: Contact analysis and binding site topography. J. Mol. Biol. 262, 732-745.
Briefly, we have analysed the number of contacts made at each position, defining contact as burial by > 1 square Angstrom change in solvent accessibility. These data give a simple measure of how likely a residue is to be involved in antigen contact.
Second, we have calculated the mean percentage burial over the accessible residues.
Click here for an image showing a composite combining site containing all CDR conformations coloured by contact propensity.
The table presents the chain name, residue number (N.B. This is pre-1989 Chothia Numbering), the number of contacts and the mean percent burial. The data are available by clicking here.
An alternative simplified view is presented as a list of CDR residues making contact in each antibody with summary data for each CDR.
Start - Approx residue 24
Residue before is always a Cys
Residue after is always a Trp. Typically TRP-TYR-GLN, but also,
TRP-LEU-GLN, TRP-PHE-GLN, TRP-TYR-LEU
Length 10 to 17 residues
Start - always 16 residues after the end of L1
Residues before generally ILE-TYR, but also, VAL-TYR, ILE-LYS, ILE-PHE
Length always 7 residues (except 7FAB which has a deletion in this region)
Start - always 33 residues after end of L2 (except 7FAB which has the
deletion at the end of CDR-L2)
Residue before is always Cys
Residues after always PHE-GLY-XXX-GLY
Length 7 to 11 residues
Start - Approx residue 26 (always 4 after a CYS) [Chothia / AbM defintion]
Kabat definition starts 5 residues later
Residues before always CYS-XXX-XXX-XXX
Residues after always a TRP. Typically TRP-VAL, but also,
TRP-ILE, TRP-ALA
Length 10 to 12 residues (AbM definition)
Chothia definition excludes the last 4 residues
Start - always 15 residues after the end of Kabat / AbM definition) of CDR-H1
Residues before typically LEU-GLU-TRP-ILE-GLY, but a number of variations
Residues after LYS/ARG-LEU/ILE/VAL/PHE/THR/ALA-THR/SER/ILE/ALA
Length Kabat definition 16 to 19 residues (AbM definition ends 7 residues
earlier)
Start - always 33 residues after end of CDR-H2 (always 2 after a CYS)
Residues before always CYS-XXX-XXX (typically CYS-ALA-ARG)
Residues after always TRP-GLY-XXX-GLY
Length 3 to 25(!) residues