This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| tutorials:workshop [2017/10/24 08:57] – Add structure scan picture and result link sayoni | tutorials:workshop [2019/06/19 15:02] (current) – sillitoe | ||
|---|---|---|---|
| Line 42: | Line 42: | ||
| **You are going to look at how the CATH database can help us in identifying the function of a particular protein structure.** | **You are going to look at how the CATH database can help us in identifying the function of a particular protein structure.** | ||
| - | The PDB structure, 4i6g, is an X-ray crystallography-solved structure for which the function has yet to be determined. However, this can be inferred by comparing the protein with other proteins of known function. | + | The PDB structure, 4i6g, is an X-ray crystallography-solved structure for which the function has yet to be determined. However, this can be inferred by comparing the protein with other proteins of known function. |
| The CATHEDRAL server can be found **[[http:// | The CATHEDRAL server can be found **[[http:// | ||
| Line 52: | Line 52: | ||
| {{ : | {{ : | ||
| - | Each chain of the PDB can be submitted for structural scans separately. Submit **chain A** of the uploaded PDB to the structural scan by clicking on ' | + | Each chain of the PDB can be submitted for structural scans separately. Submit **chain A** of the uploaded PDB to the structural scan by clicking on ' |
| A total of 528 matching structures in CATH v4.1 have been found, with scores ranging from very good (in green) through to very poor (in red). | A total of 528 matching structures in CATH v4.1 have been found, with scores ranging from very good (in green) through to very poor (in red). | ||
| Line 72: | Line 72: | ||
| If you wish to explore other structural domains within a given S35 cluster, clicking on 'Show related domains' | If you wish to explore other structural domains within a given S35 cluster, clicking on 'Show related domains' | ||
| - | Please also visit the link to PDBsum on the domain page. PDBsum is a resource that stores information about all the protein files deposited in the PDB to learn more about the structural and functional characteristics of these domains. | + | Please also visit the link to PDBsum on the domain page. **PDBsum** is a resource that stores information about all the protein files deposited in the PDB to learn more about the structural and functional characteristics of these domains. |
| ==== The HUP Superfamily ==== | ==== The HUP Superfamily ==== | ||
| Line 78: | Line 78: | ||
| We are now going to look more closely at the CATH superfamily in which 1dnpA01 is classified. This is the HUP domain superfamily (CATH code 3.40.50.620), | We are now going to look more closely at the CATH superfamily in which 1dnpA01 is classified. This is the HUP domain superfamily (CATH code 3.40.50.620), | ||
| - | The CATH webpage for the HUP superfamily can be accessed [[http:// | + | The CATH webpage for the HUP superfamily can be accessed |
| {{: | {{: | ||
| Line 84: | Line 84: | ||
| Section 1 is a menu that you click on to navigate the site. From here you can explore the structural and functional features of the superfamily, | Section 1 is a menu that you click on to navigate the site. From here you can explore the structural and functional features of the superfamily, | ||
| - | A concise summary for the superfamily in the form of some useful statistics can be seen in section 9. It gives information on, for example, the number of domains, structural clusters and functional terms. For the HUP superfamily, | + | A concise summary for the superfamily in the form of some useful statistics can be seen in section 9. It gives information on, for example, the number of domains, structural clusters and functional terms. For the HUP superfamily, |
| - | An indication of just how structurally diverse the HUP family is shown in section 6. Here, you can scroll | + | An indication of just how structurally diverse the HUP family is shown in section 6. Here, you can scroll |
| The box below shows a 3D structural superposition between the smallest (2pfsA01) and largest domain (1wkbA01) displayed using the program Jmol. What you see initially is a wireframe representation of the superposition, | The box below shows a 3D structural superposition between the smallest (2pfsA01) and largest domain (1wkbA01) displayed using the program Jmol. What you see initially is a wireframe representation of the superposition, | ||
| Line 105: | Line 105: | ||
| === Investigating the Structural and Functional diversity within the HUP Superfamily using CATH === | === Investigating the Structural and Functional diversity within the HUP Superfamily using CATH === | ||
| - | This brings us into the next part of this tutorial in which we are going to explore the structural and functional diversity of the HUP superfamily using CATH. The structure and function of a protein | + | This brings us to the next part of this tutorial in which we are going to explore the structural and functional diversity of the HUP superfamily using CATH. The structure and function of a protein |
| {{tutorials: | {{tutorials: | ||
| Line 111: | Line 111: | ||
| The HUP superfamily is known to be particularly functionally diverse. Here, we concentrate our efforts on looking at two domains | The HUP superfamily is known to be particularly functionally diverse. Here, we concentrate our efforts on looking at two domains | ||
| - | [[http:// | + | **[[http:// |
| {{tutorials: | {{tutorials: | ||
| Line 118: | Line 118: | ||
| // | // | ||
| - | There is a link at the bottom of the page to an overview | + | There is a link at the bottom of the page to an overview |
| level (i.e. different EC3 numbers), which is suggestive of changes in chemistry throughout this superfamily (click [[http:// | level (i.e. different EC3 numbers), which is suggestive of changes in chemistry throughout this superfamily (click [[http:// | ||
| Line 124: | Line 124: | ||
| If you then go back to the list of MACiE entries and click on the entries for our example domains (M0299, Pantothenate synthetase, EC 6.3.2.1 for 1od6A00 and M0235, Arginyl-tRNA synthetase, EC 6.1.1.19 for 1f7uA01), you can see the overall reactions for these enzymes. It can be seen that both 1od6A00 and 1f7uA01 are ligases, but they have different substrates and form different products. | If you then go back to the list of MACiE entries and click on the entries for our example domains (M0299, Pantothenate synthetase, EC 6.3.2.1 for 1od6A00 and M0235, Arginyl-tRNA synthetase, EC 6.1.1.19 for 1f7uA01), you can see the overall reactions for these enzymes. It can be seen that both 1od6A00 and 1f7uA01 are ligases, but they have different substrates and form different products. | ||
| - | It is clear from these results that the HUP superfamily is associated with a significant number of different enzyme reaction mechanisms. There are a number of possible reasons for this functional diversity. To explore how these enzymes may have evolved different functions, we can look for structural changes within the family. Here, we compare the structures of our two HUP domain examples using our in-house structural comparison algorithm called SSAP. | + | It is clear from these results that the HUP superfamily is associated with a significant number of different enzyme reaction mechanisms. There are a number of possible reasons for this functional diversity. To explore how these enzymes may have evolved different functions, we can look for structural changes within the family. Here, we compare the structures of our two HUP domain examples using our in-house structural comparison algorithm called |
| Whilst the CATHEDRAL algorithm you used at the beginning of the tutorial is fast and allows you to search all structures in CATH, SSAP is a slower and slightly more accurate method for comparing two protein structures. | Whilst the CATHEDRAL algorithm you used at the beginning of the tutorial is fast and allows you to search all structures in CATH, SSAP is a slower and slightly more accurate method for comparing two protein structures. | ||
| - | SSAP takes two structures and calculates how similar they are in structure, residue-by-residue. Similarity is measured by the SSAP score. This score ranges from 0 to 100; a score of 100 would indicate that the two structures were effectively identical. Please click [[http:// | + | SSAP takes two structures and calculates how similar they are in structure, residue-by-residue. Similarity is measured by the SSAP score. This score ranges from 0 to 100; a score of 100 would indicate that the two structures were effectively identical. Please click **[[http:// |
| From this superposition we can see that the two domains are significantly different in structure. This structural divergence is also clearly highlighted by their SSAP score of 58.77 and an RMSD of 8.15Å. | From this superposition we can see that the two domains are significantly different in structure. This structural divergence is also clearly highlighted by their SSAP score of 58.77 and an RMSD of 8.15Å. | ||
| Line 140: | Line 140: | ||
| The superposition shows that, although there is a structural core common to both structures, 1f7uA01 has some considerable structural embellishments not seen in 1od6A00. There are also noticeable shifts in the positions of the catalytic site residues. | The superposition shows that, although there is a structural core common to both structures, 1f7uA01 has some considerable structural embellishments not seen in 1od6A00. There are also noticeable shifts in the positions of the catalytic site residues. | ||
| - | 2DSEC ([[http:// | + | **2DSEC** ([[http:// |
| The 2DSEC plot for the HUP examples 1f7uA01 and 1od6A00 is shown below: | The 2DSEC plot for the HUP examples 1f7uA01 and 1od6A00 is shown below: | ||
| Line 148: | Line 148: | ||
| The 2DSEC plot confirms the findings of the SSAP superposition; | The 2DSEC plot confirms the findings of the SSAP superposition; | ||
| - | Recruitment of different domain partners can also result in changes in protein function. There is a link to a third party application called Archschema ([[http:// | + | Recruitment of different domain partners can also result in changes in protein function. There is a link to a third party application called |
| {{: | {{: | ||
| Line 155: | Line 155: | ||
| Now that we have an idea of the scale of the number of domain partners associated with the family as a whole, we will now return to comparing our two HUP examples using a different resource. Gene3D assigns CATH domains to genes and annotates them with functional and structural information. We are going to use Gene3D to compare the MDAs of our examples. Multi-chain architectures show all the domains contained within a protein chain. | Now that we have an idea of the scale of the number of domain partners associated with the family as a whole, we will now return to comparing our two HUP examples using a different resource. Gene3D assigns CATH domains to genes and annotates them with functional and structural information. We are going to use Gene3D to compare the MDAs of our examples. Multi-chain architectures show all the domains contained within a protein chain. | ||
| - | + | Next, go to the Gene3D v14 website protein search page [[http:// | |
| - | + | ||
| - | Next go to the Gene3D v14 website protein search page [[http:// | + | |
| Line 188: | Line 186: | ||
| Clicking on the ' | Clicking on the ' | ||
| - | For this tutorial, we are most interested in comparing the reaction mechanisms associated with two relatives having different functions. For example, 1h7oA00, Aminolevulinate dehydratase (EC 4.2.1.24) and 1d3gA00, Dihydroorotate oxidase (EC 1.3.3.1). Have a look for the reaction mechanisms corresponding to these ECs in the Catalytic Machinery Similarities table and draw your own conclusion. For more information on this comparison, click on the link within the table. This takes you to a page that compares the two reaction mechanisms side by side. | + | For this tutorial, we are most interested in comparing the reaction mechanisms associated with two relatives having different functions. For example, |
| So, how are these changes in mechanisms mediated? | So, how are these changes in mechanisms mediated? | ||
| Line 194: | Line 192: | ||
| Firstly, we can explore whether there are any significant structural differences between the domains associated with these functions. | Firstly, we can explore whether there are any significant structural differences between the domains associated with these functions. | ||
| - | Within a CATH superfamily, | + | Within a CATH superfamily, |
| If we go back to the homepage for the 3.20.20.70 superfamily, | If we go back to the homepage for the 3.20.20.70 superfamily, | ||
| Line 230: | Line 228: | ||
| The next thing we can look at is whether or not there are local changes, particularly around the active site, for example, residue mutations in the site and changes in catalytic residues. Taking 1h7oA00 and 1d3gA00 as our examples, we can go back to their respective functional family pages and look at the multiple alignments for those families. Highly conserved residues are highlighted in the alignment (as shown above) and the structure and you can compare them side by side to observe any differences. We are currently in the process of adding in catalytic residue information to the FunFam pages so that conserved residue and catalytic residue information can be viewed on the FunFam MSA and the representative structure. | The next thing we can look at is whether or not there are local changes, particularly around the active site, for example, residue mutations in the site and changes in catalytic residues. Taking 1h7oA00 and 1d3gA00 as our examples, we can go back to their respective functional family pages and look at the multiple alignments for those families. Highly conserved residues are highlighted in the alignment (as shown above) and the structure and you can compare them side by side to observe any differences. We are currently in the process of adding in catalytic residue information to the FunFam pages so that conserved residue and catalytic residue information can be viewed on the FunFam MSA and the representative structure. | ||
| - | We can also use SSAP to create a superposition of our two proteins and then compare the position of functional residues. Just type 1h7oA00 as protein 1 and 1d3gA00 as protein 2. An interactive | + | We can also use [[http:// |
| - | + | ||
| - | {{: | + | |
| + | {{: | ||
| The [[http:// | The [[http:// | ||
| - | Once you have your catalytic residues, highlight them on your RasMol superposition using the following commands - **select n1, n2, n3** etc (where n//x// denotes a catalytic residue number, for example, 17) then **spacefill** and then select a color - for example type **colour purple** if you want the catalytic residues for one of the proteins to be purple. | + | A jmol of the SSAP superposition has been provided with the catalytic residues of the domains highlighted. Here, 1h7oA00 is in pink, with its catalytic residues red and 1d3gA00 light blue with its catalytic residues blue |
| - | + | ||
| - | A jmol of the SSAP superposition has been provided | + | |
| <jsmol 1h7o_2 : | <jsmol 1h7o_2 : | ||
| Line 245: | Line 240: | ||
| </ | </ | ||
| - | It can clearly be seen that the catalytic residues of these two domains are in different 3D locations in the active site. An SSAP alignment of the two domains is below which highlights catalytic residues according to their properties. Aromatic residues are in red, polar residues in green and those with a positive charge are in purple. | + | It can clearly be seen that the catalytic residues of these two domains are in different 3D locations in the active site. A SSAP alignment of the two domains is below which highlights catalytic residues according to their properties. Aromatic residues are in red, polar residues in green and those with a positive charge are in purple. |
| {{: | {{: | ||