DiscoTope: Structure-based Antibody Prediction |
DiscoTope is a method for predicting discontinuous epitopes from 3D structures of proteins in PDB format (see Reference at the end of the tutorial).
Step1- Enter the structure either by entering its PDB id (Each structure in the PDB is represented by a 4 character identifier of the form [0-9][a-z,0-9][a-z,0-9][a-z,0-9][a-z,0-9]. For example, 4HHB, 9INS are identification codes for PDB entries for hemoglobin and insulin) Or, by uploading a Structure File in PDB format from your computer.
Step2- Enter the Chain id for the protein chain of interest.
Step3 -Click on submit.
On the following screen, you will be able to see the predicted discontinuous epitopes as a chart of DiscoTope score vs. residue id. Here, you can change the threshold for predictions, view results in table or 3D viewer, save the prediction result or start with a new prediction.
Chart view
To change the threshold value, enter a different threshold and click on 'Change'. The default value is -7.7, which corresponds to a specificity of 75%. Higher values correspond to higher specificity. A specificity of 0.75 means that 25% of the nonepitope residues were predicted as part of epitopes. A sensitivity of 0.47 means that 47% of the epitope residues were predicted as part of epitopes.
In a benchmark containing more than 75 antigen/antibody complexes, the following relations were found:
| Score | Sensitivity | Specificity |
|---|---|---|
| >-3.1 | 0.16 | 0.95 |
| >-4.7 | 0.24 | 0.90 |
| >-6.0 | 0.32 | 0.85 |
| >-6.9 | 0.40 | 0.80 |
| >-7.7 | 0.47 | 0.75 |
In the chart, predictions above the threshold (red line) are positive predictions (displayed in green) and predictions below the threshold are negative prediction (displayed in orange).
Table view
Table view lists following columns:
Chain id: The chain id of the protein chain used in prediction. (specified by the user)
Residue id: PDB Residue id
Residue Name: Name of the residue
Contact Number: The residue contact number is the number of Cα atoms in the antigen within a distance of 10 Å of the residue's Cα atom. A low contact number correlates with localization of the residue close to the surface or in protruding regions of the antigen's structures.
Propensity Score: This score tells you about the probability/tendency of being part of an epitope for that particular residue. The propensity is reflected in amino acid epitope log-odds ratios, which were calculated on a set of 75 antigens. The propensity score is calculated by sequentially averaging epitope log-odds ratios within a window of 9 residues. Then the scores are summed up based on the proximity in the 3D structure of the antigen. For any given residue, the sequentially averaged log-odds scores from all residues within 10Å are summed to gives the propensity score.
DiscoTope Score: This score is calculated by combining the contact numbers with propensity score. DiscoTope score above the threshold value indicates positive predictions and that below the threshold value indicates negative predictions.
Positive predictions are displayed in red.
3D view
The 3d view uses Jmol to display the structure with positive predictions highlighted in yellow. The side chain of each predicted residue is shown. You can rotate, zoom and manipulate the structure by using different buttons on the mouse. The table lists the predicted epitope residues along with their chain id, residue id, contact number, propensity score and DiscoTope score. Clicking on the CPK button in each residue will highlight this residue in CPK on the 3D viewer.