online.universita.zanichelli.it - in primo piano -...

26
Bioinformatics Exercises Over the last two decades, information has been gaining increasing importance in both teaching and learning biochemistry. The most obvious case is the sequencing of the human genome and many other complete genomes. In 1990, the determination of the sequence of a protein was often the topic of a full publication in a peer-reviewed journal such as Science, Nature, or The Journal of Biological Chemistry. Now entire genomes are the topic of individual research papers. The term "bioinformatics" is a catch-all phrase which generally refers to the use computers and computer science approaches to the study of biological systems. The main chapters where this information is discussed in the text are chapters 3 (Nucleotides, Nucleic Acids and Genetic Information), 5 (Proteins: Primary Structure), 6 (Proteins: Three- Dimensional Structure), 12 (Enzyme Kinetics, Inhibition and Regulation) and 13 (Introduction to Metabolism). Here we provide exercises appropriate to these chapters aimed at introducing the techniques of bioinformatics that involve the use of computers, Internet-accessible databases and the tools that have been developed to “mine” those databases. General principles 1. Open ended questions. The exercises may include some questions that have definite answers, but in many cases there will also be questions which may be answered in a number of ways, depending on the approach you take or the topic you select. 2. Stable Internet Resources. As much as possible, the exercises will be based on well established, stable web sites. If it is necessary to use less reliable sites and/or resources, attempts have been made to provide multiple sites that perform similar functions. 3. Here are the stable online resources that will be used most frequently: a. Genbank (http://www.ncbi.nlm.nih.gov/) b. Protein Data Bank (http://www.rcsb.org) c. Expasy Proteomics Server (http://us.expasy.org/) d. European Bioinformatics Institute (http://www.ebi.ac.uk/) e. Pfam (http://www.sanger.ac.uk/Software/Pfam/) f. SCOP (http://scop.mrc-lmb.cam.ac.uk/scop/) g. CATH (http://www.biochem.ucl.ac.uk/bsm/cath/) h. PubMed (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi)

Upload: vuongdat

Post on 17-May-2018

217 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

Bioinformatics Exercises

Over the last two decades, information has been gaining increasing importance in both teaching and learning biochemistry. The most obvious case is the sequencing of the human genome and many other complete genomes. In 1990, the determination of the sequence of a protein was often the topic of a full publication in a peer-reviewed journal such as Science, Nature, or The Journal of Biological Chemistry. Now entire genomes are the topic of individual research papers. The term "bioinformatics" is a catch-all phrase which generally refers to the use computers and computer science approaches to the study of biological systems. The main chapters where this information is discussed in the text are chapters 3 (Nucleotides, Nucleic Acids and Genetic Information), 5 (Proteins: Primary Structure), 6 (Proteins: Three-Dimensional Structure), 12 (Enzyme Kinetics, Inhibition and Regulation) and 13 (Introduction to Metabolism). Here we provide exercises appropriate to these chapters aimed at introducing the techniques of bioinformatics that involve the use of computers, Internet-accessible databases and the tools that have been developed to “mine” those databases.

General principles

1. Open ended questions. The exercises may include some questions that have definite answers, but in many cases there will also be questions which may be answered in a number of ways, depending on the approach you take or the topic you select.

2. Stable Internet Resources. As much as possible, the exercises will be based on well established, stable web sites. If it is necessary to use less reliable sites and/or resources, attempts have been made to provide multiple sites that perform similar functions.

3. Here are the stable online resources that will be used most frequently:

a. Genbank (http://www.ncbi.nlm.nih.gov/)b. Protein Data Bank (http://www.rcsb.org)c. Expasy Proteomics Server (http://us.expasy.org/)d. European Bioinformatics Institute (http://www.ebi.ac.uk/)e. Pfam (http://www.sanger.ac.uk/Software/Pfam/)f. SCOP (http://scop.mrc-lmb.cam.ac.uk/scop/)g. CATH (http://www.biochem.ucl.ac.uk/bsm/cath/)h. PubMed (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi)i. PubMed Central (http://www.pubmedcentral.nih.gov/)

4. Answer key. Where a definite answer is known, it will be provided in an answer key. For more open-ended questions, a typical correct answer will be presented.

5. Historical perspective. If historical resources are available online (including PubMed), there may be questions designed to help students identify some of the historical roots of biochemistry and molecular biology.

Project 4: Structural Alignment and Protein Folding

Page 2: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

Protein Folding is described in Section 6-5 in Fundamentals of Biochemistry (4th ed). The famous “Levinthal paradox” is presented along with the current thinking about different aspects of protein folding. You are encouraged to review the textbook before proceeding.

Background

The textbook describes the role of energy and entropy in protein folding. The role of kinetics in protein folding in represented in Figure 6-41, while the thermodynamics of protein folding can be described by the famous folding funnel diagram in Figure 6-42. Remember that protein folding is guided by a series of constraints that involve kinetics, thermodynamics, and function.

1. Kinetics. Once proteins are synthesized, they tend to fold rapidly; other processes such as disulfide isomerization and chaperone-assisted folding are important here, but those will not be the focus of these exercises.

2. Thermodynamics. The folding funnel presents the idea that a protein goes through one intermediate or a series of intermediates until it reaches a more stable (lower energy) form. This is not usually the thermodynamically most stable form of the protein.

3. Function. To function properly, proteins must have the correct conformation or fold, which means that they need to fold rapidly (to avoid getting selected for destruction) to the functional state.

You are certainly free to skip around this exercise, but you will probably understand it best if you go through it in a linear fashion. The exercise begins with installation of some software which you will use to explore some simple molecules and see how much variation can happen in just a small structure. The second half of the exercise is about the bienniel CASP competition for protein structure prediction, including a literature exploration exercise and an exploration of the 3D structural alignment options found on the Protein Data Bank web site (http://www.rcsb.org/pdb/home/home.do).

Software installation

To begin, you will need to visit the ChemAxon web site and install the Marvin Suite (http://www.chemaxon.com/products/marvin/marvinsketch/) on your computer. The Marvin Suite is free for everyone. We will be using a few more advanced features found in the Marvin Suite. These features are free for students and faculty members at universities and colleges, but you will need to apply for an academic license (http://www.chemaxon.com/free-software/academic-license/), a process that may take a day or so, to unlock these features.

I will include some screen shots of the installation process to assist you. This software was chosen for this exercise because it is free and available for Macintosh, Windows and Linux platforms. Your instructor may prefer other software, such as ChemSketch, a free

Page 3: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

chemical drawing program for the Windows operating system from ACD Labs (http://www.acdlabs.com/resources/freeware/chemsketch/) or one of several commercial software products. If that is the case, your instructor will be able to lead you through this exercise.

Figure 1. The opening screen for downloading the Marvin Suite (http://www.chemaxon.com/products/marvin/marvinsketch/)

Click on the Download Marvin Suite button. This will take you to a page where you need to decide what kind of user you are. Unless you have good reason to do otherwise, choose “End User”. You will then go to the download page. For Windows, your best bet is probably simply the “Windows Installer” since most Windows installations already include Java. There is only one option for the Mac (OS X Installer). Linux, like Windows usually includes Java, so your best bet is “Linux Installer”. When you select your preferred installer, you will need to register with ChemAxon to complete the download. The remainder of this installation is shown in Mac OS X (Snow Leopard), but the installation should be very similar on other platforms and operating systems.

For Mac OS X, the download is “marvinbeans-5.5.1.0-macos.dmg” as of 23 July 2011. The filename may change slightly over time but the installation should proceed as described below. When you double-click on the file, a window should pop up that contains “ChemAxon Marvin Beans Installer”.

Page 4: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

Double click on the the icon or filename and the installer wizard will appear.

Click the Next > button to get the License Agreement page.

Page 5: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

Click on the “I accept the agreement” radio button, but click on Next > to select the directory for the application. Again you are encouraged to use the default choice unless you have a specific reason to do otherwise.

Click on the Next > to go the Select Additional Tasks window and choose the “Create desktop icons” if you would like them.

Page 6: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

The installation process will then proceed and may take a few minutes. The image below was captured early in the installation process.

The Finish screen will appear next and you just need to click the Finish button to complete the installation.

Page 7: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

If you chose to the icons appear on your desktop, they should look like this.

1. Alignment of Small Molecules

Ethane: 2D and 3D

Open MarvinSketch by double clicking on the icon. You will see a window that looks like this.

Page 8: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

Let’s start with something very simple - ethane. Use the line tool (2nd icon on the left hand side of the window - the default selection) and click in the middle of the screen to make ethane to get this image.

Page 9: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

Use the select tool (upper left hand corner) to pick the ethane molecule. Move to the dropdown menu and select Structure...Add...Add Explicit Hydrogens to get this image.

Page 10: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

From the dropdown menu, select File...Save As.. and call the file ethane_2D.mrv and save it as a ChemAxon Marvin Document / MRV (*.mrv), the first option under the File Format menu.

Select the ethane with the Selection tool again. From the dropdown menu select View...Transform...Rotate in 3D. What is nature of the bonds in ethane in this image?

With ethane still selected, choose Structure...Clean 3D...Clean in 3D from the dropdown menu. Then view it in 3D again (View...Transform...Rotate in 3D). How did the structure change and why did it change?

From the dropdown menu, select File...Save As.. and call the file ethane_3D.mrv.

Now explore the molecule. Rotate it in 3D (View...Transform...Rotate in 3D) to see the relative positions of the hydrogen atoms. Think back to your organic chemistry course to explain why the hydrogens on the two carbons are not superimposed when you look down the carbon-carbon single bond.

To learn more about how MarvinSketch optimizes the 3D conformation of a molecule, look at the Structure...Clean 3D...Cleaning Method menu. What is the significance of

Page 11: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

each of those options? You can find answers in the MarvinSketch help files by searching for Cleaning Methods.

Lysine: 2D and 3D

Now let’s repeat the ethane exercise on something a bit more complex - the amino acid, lysine. Here are the steps to take.

1. Draw the structure of lysine.

2. From the dropdown menu, select Structure...Add...Add Explicit Hydrogens to get this image.

3. From the dropdown menu select View...Transform...Rotate in 3D. Once again, it is just a flat structure.

4. From the dropdown menu select Structure...Clean 3D...Clean in 3D. Then use View...Transform...Rotate in 3D again. Notice the changes in the structure to take advantage of the 3D space.

5. For further interest, you are encouraged to create 2D drawings, then perform 3D optimizations on some common small molecules found in biochemistry: citric acid, acetyl CoA, ATP, NAD+.

2. Alignment of Peptides

The purpose of the simple exercises with ethane and lysine was to remind you of the difference between the 2D representations we often use for molecules and the 3D conformations that those same molecules assume in solution. You may wonder, “How does this relate to protein folding?” Protein folding is a multi-dimensional problem that

Page 12: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

must include thermodynamics (stability), kinetics (formation in a useful time frame) and function (preparing a protein that can do its job). As proteins fold, they are also constrained by the steric and electronic requirements of the amino acids that make up their chains. The bulky tryptophan side chain needs enough room to maneuver into position. The negatively charged glutamate side chain will be repelled by another negatively charged side chain and attracted by a positively charged lysine or arginine side chain.

In this exercise, you will construct a peptide consisting of 12 amino acids, optimize that peptide in 3D space, and then compare the conformation of the same exact peptide sequence that is found in dihydrofolate reductase from Candida albicans (PDB entry 3QLW): Lysine - Glutamine - Proline - Lysine - Serine - Glutamate - Leucine - Glutamine - Lysine - Phenylalanine - Valine - Glycine, which are residues 158 - 169 in the primary sequence of the enzyme.

1. You can draw this structure if you’d like the challenge. Otherwise you can download this file (link to KQPKSELQKFVG_peptide.mrv), which contains the 2D structure of the peptide that has not been optimized. It should look something like this:

2. From the dropdown menu, select Structure...Add...Add Explicit Hydrogens to get this image.

Page 13: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

3. From the dropdown menu select View...Transform...Rotate in 3D so you can see it as a flat structure.

4. From the dropdown menu select Structure...Clean 3D...Clean in 3D. Save the file as KQPKSELQKFVG_3D_optimized_in_MarvinSketch.mrv. Then use View...Transform...Rotate in 3D again. Notice the changes in the structure to take advantage of the 3D space. How long did it take to clean the structure in 3D compared to ethane or lysine?

At this point, you have a 12 amino acid peptide that is optimized in 3D space to account for steric and electronic effects. Did this take more time than the lysine? Please note that this is a simple optimization; a commercial program such as Spartan (Wavefunction, Inc.; http://www.wavefun.com/products/spartan.html) would do a much more rigorous optimization and would probably take a great deal more time.

Explore the structure to see if the peptide bonds are planar as they are known to be in proteins. Do you see other features of the peptide where parts of the molecule rotated out of position to minimize steric or electronic conflicts?

5. Now we will move on to compare this optimized structure with the peptide that is found as a turn-helix portion of the structure of dihydrofolate reductase (DHFR) from Candida albicans using the tools in MarvinSketch and MarvinSpace.

a. Download the DHFR peptide in the file 3QLW_158_169.pdb, which contains residues 158-169 from PDB entry 3QLW.

b. Open three new windows in MarvinSketch. In the first window open the file 3QLW_158_169.pdb. In the second window open the file KQPKSELQKFVG_3D_optimized_in_MarvinSketch.mrv. Go to the third window and save it as KQPKSELQKFVG_3D_aligned_to_3QLW_158_169.mrv.

Page 14: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

c. Copy the peptide from 3QLW_158_169.pdb and paste it into KQPKSELQKFVG_3D_aligned_to_3QLW_158_169.mrv.

d. In the window containing KQPKSELQKFVG_3D_optimized_in_MarvinSketch, use Structure...Remove...Remove Explicit Hydrogens to simplify the structure. Then use View...Transform...Rotate in 3D to rotate the peptide so that you can clearly see the phenylalanine side chain.

Page 15: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

e. Copy the peptide from KQPKSELQKFVG_3D_optimized_in_MarvinSketch.mrv and paste it into KQPKSELQKFVG_3D_aligned_to_3QLW_158_169.mrv below the previous peptide. Just by looking at the image, you can see that the lower peptide (the one you made) is more extended than the upper peptide (the segment from PDB entry 3QLW).

Page 16: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

f. The next step is to use the reaction arrow (6th icon down on the left side of

the MarvinSketch window) to connect two of the phenylalanine ring carbons from the top structure to the identical ring carbons in the lower structure. You can tell that it works if you see the numbers 1 and 2 next to those carbons in the MarvinSketch window.

Page 17: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous
Page 18: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

g. Select both peptides entirely (to preserve their original 3D conformations throughout the alignment process), then choose Tools...Conformation...3D Alignment from the dropdown menus. This will bring up a dialog box. Make sure that the “Align by extended atom types” is selected from the Alignment Options menu and click on the OK button.

h. The output of the alignment will appear in a MarvinSketch window that allows you to rotate the aligned structures. You can also selectively make them disappear to better explore the alignment using the buttons on the right hand side. The alignment in the left window uses CPK coloring and the alignment in the right window is colored by peptide. Notice how different the conformation of the two peptides is. In the right window, the blue peptide (the native conformation of 3QLW fragment 158-169) is more compact (and actually contains an alpha helix), while the same peptide sequence optimized by MarvinSketch (in white) is much more elongated.

Page 19: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous
Page 20: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

3. CASP and Protein Structure Prediction

The main goal of the exercises with the small molecules and the peptide is to help you understand how many possible ways that molecules can bend and the forces that drive their 3D shape. Imagine scaling up from a single amino acid, such as lysine, to a short peptide, such as KQPKSELQKFVG, to a protein with 250 amino acids in its primary structure. That protein must fold in a way that meets all the steric and electronic requirements of the individual side chains; at the same time it has to be sufficiently stable to maintain its 3D structure for periods of time ranging from minutes to years. Even this is not sufficient - it must also have the correct 3D structure to perform its function. The remainder of this exercise will focus on a bienniel competition called CASP (http://www.predictioncenter.org/casp9/index.cgi), which relates to protein folding.

Here are some questions about CASP to get you started.

1. What is CASP? Please note - CASP is an acronym for many things, including the Connecticut Association of School Psychologists, but you need to focus on a group interested in predicting protein structure.

2. What are the goals of CASP?3. Where was CASP held most recently?4. What structures are candidates for CASP?5. Several different approaches are used in the CASP competition. Two of these

approaches are called “template-based modeling” and “termplate-free modeling”. Explain the basis for each approach.

CASP in the literature

Find a literature article written since January, 2009, that relates to the CASP competition. Collect the citation information (Authors, Title, Journal, Volume, Pages, Year of publication)

Read the abstract to find the relationship of this journal article to the CASP competition and describe that relationship. What aspects of protein folding are the focus of the article? Do the authors refer to any structures in the Protein Data Bank (http://www.rcsb.org/pdb/home/home.do). If so, find the structures and attempt to identify/visualize any unique folds that are described in the journal article.

3D Alignment of Protein Structures in the Protein Data Bank

The Protein Data Bank web site offers you the ability to compare structures by 3D similarity. So to finish this exercise, we’ll look at one structure that has many structural homologs in the Protein Data Bank and one of the structures for the CASP competition that is more unique.

Page 21: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

On the PDB web site (http://www.rcsb.org/pdb/home/home.do) search for the structure 1ML4. Collect the descriptive information on 1ML4: Authors, Title, Journal Citation, Year). On the main page for 1ML4 (http://www.rcsb.org/pdb/explore/structureCluster.do?structureId=1ML4), there are many tabs you can select: Summary, Sequence, Annotations, Seq. Similarity, 3D Similarity, Literature, Biol. & Chem., Methods, Geometry, and Links. It is a worthwhile exercise for any student of biochemistry to explore all of these tabs for one or more proteins, but we will focus here on the 3D Similarity tab, so click on that tab will open a window that contains the table shown here.

Now answer these questions about 3D Similarity comparisons to 1ML4.

1. How many chains are considered structural homologs of 1ML4, chain A?2. Do all the chains listed catalyze the same reactions, based on their enzyme

names? If not, how many different reactions are catalyzed by the first 30 structures in the list?

3. The relationships between the chains are quantitatively evaluated based on 8 different criteria (P-value, Score, Rmsd, Len1, Len2, %ID, %Cov1, %Cov2). Use the legend on the right hand side of this page to find the meaning of each of these terms. Note that the query protein (1ML4 in this case) is considered Chain 1 in the comparisons listed here, while chain 2 refers to the other protein in this alignment. In this specific case, the #1 ranking chain 2 was chain I of PDB entry 1DUV (as of 30 July 2011 - the results will change with time).

4. In the table on the 3D Similarity page, click on the View link for the protein chain that ranked first in the table. Scroll down the page and look for the alignment in the Jmol applet window, which should look like this. The query chain (1ML4, chain A) is shown in orange, while the target chain (1DUV, chain I) is shown in light blue.

Page 22: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

Compare the two chains to look for similarities and differences. Enter the following command in the Jmol Script box: select all; backbone off; cartoon. What features of secondary structure to you see? Are they present in both structures? Are there any differences between the structures?

Now let’s consider a protein structure that was determined from the 2008 CASP competition. The instructions are simple - follow the four steps listed above for your evaluation of 1ML4 with one protein from each of these two classes from the 2008 CASP competition.

Template-based competition (Česlovas Venclovas, Mindaugas Margelevičius, The use of automatic tools and human expertise in template-based modeling of CASP8 target proteins. Proteins: Structure, Function, and Bioinformatics, 77, Supplement S9, 81–88, 2009; Daniel A. Keedy, Christopher J. Williams, Jeffrey J. Headd, W. Bryan Arendall III, Vincent B. Chen, Gary J. Kapral, Robert A. Gillespie, Jeremy N. Block, Adam Zemla, David C. Richardson, Jane S. Richardson. The other 90% of the protein: Assessment beyond the Cαs for CASP8 template-based and high-accuracy models,Proteins: Structure, Function, and Bioinformatics, 77, Supplement S9, 29–49, 2009): 3DSM, 3DLB, 3D3U, 2K49, 2VX3.

Page 23: online.universita.zanichelli.it - In primo piano - Zanichellionline.universita.zanichelli.it/voet3e/files/2013/05/... · Web viewFundamentals of Biochemistry (4th ed). The famous

Template-free competition (Moshe Ben-David, Orly Noivirt-Brik, Aviv Paz, Jaime Prilusky, Joel L. Sussman, Yaakov Levy. Assessment of CASP8 structure predictions for template free targets, Proteins: Structure, Function, and Bioinformatics, 77, Supplement S9, 50–65, 2009): 3D4R, 3D3Q, 3DEE, 2K5C, 3DFD, 2K4N, 2K4V, 3DOA, 3DO9.Finally, was there a difference in the number of homologs for these structures compared to those for 1ML4 or another PDB entry that is described in your textbook? Can you venture a guess why this is the case?