Carbohydrates Lehninger Principles of Biochemistry PDF
Document Details
Uploaded by Deleted User
Tags
Related
- Lecture 7: Cell surface Oligosaccharides I (March 13) - Membrane Biochemistry 2024 PDF
- Carbohydrates AG - Part I Biochemistry of Sugars (PDF)
- Carbohydrates - The Biochemistry of Sugars (Part II) PDF
- Carbohydrates - The Biochemistry of Sugars (Part III) PDF
- Introduction To Biochemistry And Carbohydrates PDF
- Biochemistry of Carbohydrates PDF - 2024/2025
Summary
This chapter from a biochemistry textbook details the structure, function, and classification of carbohydrates. It covers topics such as monosaccharides, disaccharides, polysaccharides, and glycoconjugates. The importance of carbohydrates in biological systems is highlighted throughout the text.
Full Transcript
CHAPTER 7 CARBOHYDRATES AND GLYCOBIOLOGY 7.1 Monosaccharides and Disaccharides 7.2 Polysaccharides 7.3 Glycoconjugates: Proteoglycans, Glycoproteins, and Glycolipids 7.4 Carbohydrates as Informational Molecules: The Sugar Code 7.5 Working with Carbohydrates Carbohydrates are the most abundan...
CHAPTER 7 CARBOHYDRATES AND GLYCOBIOLOGY 7.1 Monosaccharides and Disaccharides 7.2 Polysaccharides 7.3 Glycoconjugates: Proteoglycans, Glycoproteins, and Glycolipids 7.4 Carbohydrates as Informational Molecules: The Sugar Code 7.5 Working with Carbohydrates Carbohydrates are the most abundant biomolecules on Earth. Each year, photosynthesis converts more than 100 billion metric tons of CO and H 2 2 O into cellulose and other plant products. The carbohydrates in these plant products are a dietary staple in most parts of the world, and the oxidation of carbohydrates is the central energy-yielding pathway in most nonphotosynthetic cells. Carbohydrate polymers called glycans serve as structural and protective elements in the cell walls of bacteria, fungi, and plants, and in the connective tissues of animals. Other carbohydrate polymers lubricate skeletal joints and participate in cell-cell recognition and adhesion. In addition, some complex 908 carbohydrate polymers covalently attached to proteins or lipids act as signals that determine the intracellular destination or metabolic fate of these hybrid molecules, called glycoconjugates. Carbohydrates are aldehydes or ketones with at least two hydroxyl groups, or substances that yield such compounds on hydrolysis. Many, but not all, carbohydrates have the empirical formula (CH 2 O)n ; some also contain nitrogen, phosphorus, or sulfur. There are three major size classes of carbohydrates: monosaccharides, oligosaccharides, and polysaccharides (the word “saccharide” is derived from the Greek sakcharon, meaning “sugar”). Monosaccharides, or simple sugars, consist of a single polyhydroxy aldehyde or ketone unit. The most abundant monosaccharide in nature is the six-carbon sugar D -glucose, sometimes referred to as dextrose. Oligosaccharides consist of short chains of monosaccharide units, or residues, joined by characteristic linkages called glycosidic bonds. The most abundant are the disaccharides, with two monosaccharide units. Sucrose (table sugar), for example, consists of the six-carbon sugars D -glucose and D -fructose. All common monosaccharides and disaccharides have names ending with the suffix “-ose.” In cells, most oligosaccharides consisting of three or more units do not occur as free entities but are joined to nonsugar molecules (lipids or proteins) in glycoconjugates. The polysaccharides are sugar polymers containing more than 10 monosaccharide units; some have hundreds or thousands of units. Some polysaccharides, such as cellulose, are linear chains; 909 others, such as glycogen, are branched. Both cellulose and glycogen consist of recurring units of D -glucose, but they differ in the type of glycosidic linkage and consequently have strikingly different properties and biological roles. This chapter introduces the major classes of carbohydrates and glycoconjugates and provides examples of their many structural and functional roles. Learning the structures and chemical properties of biomolecules is essential, because they are the vocabulary and the grammar of biochemistry. As you read about carbohydrates, note how specific cases illustrate these general principles that underlie all of biochemistry. Carbohydrates can have multiple chiral carbons; the configuration of groups around each carbon atom determines how the compound interacts with other biomolecules. As we saw for L -amino acids in proteins, with rare exceptions, biological evolution selected one stereochemical series (D -series) for sugars. Monomeric subunits, monosaccharides, serve as the building blocks of large carbohydrate polymers. The specific sugar, the way the units are linked, and whether the polymer is branched determine its properties and thus its function. Storage of low molecular weight metabolites in polymeric form avoids the very high osmolarity that would result from storing them as individual monomers. If the glucose in liver glycogen were monomeric, the glucose 910 concentration in liver would be so high that cells would swell and lyse from the entry of water by osmosis. The sequences of complex polysaccharides are determined by the intrinsic properties of the biosynthetic enzymes that add each monomeric unit to the growing polymer. This is in contrast with DNA, RNA, and proteins, which are synthesized on templates that direct their sequence. Polysaccharides assume three-dimensional structures with the lowest-energy conformations, determined by covalent bonds, hydrogen bonds, charge interactions, and steric factors. Starch folds into a helical structure stabilized by internal hydrogen bonds; cellulose assumes an extended structure in which intermolecular hydrogen bonds are more important. Molecular complementarity is central to function. The recognition of oligosaccharides by sugar-binding proteins (lectins) results from a perfect fit between lectin and ligand. An almost infinite variety of discrete structures can be built from a small number of monomeric subunits. Even short polymers, when arranged in different sequences, joined through different linkages, and branched to specific degrees, present unique faces recognized by their molecular partners. 911 7.1 Monosaccharides and Disaccharides The simplest of the carbohydrates, the monosaccharides, are either aldehydes or ketones with two or more hydroxyl groups; the six-carbon monosaccharides glucose and fructose have five hydroxyl groups. Many of the carbon atoms to which the hydroxyl groups are attached are chiral centers, which give rise to the many sugar stereoisomers found in nature. Stereoisomerism in sugars is biologically significant because the enzymes that act on sugars are strictly stereospecific, typically preferring one stereoisomer to another by three or more orders of magnitude, as reflected in K values or binding constants. It is as difficult to fit m the wrong sugar stereoisomer into an enzyme’s binding site as it is to put your le glove on your right hand. We begin by describing the families of monosaccharides with backbones of three to seven carbons — their structure, their stereoisomeric forms, and the means of representing their three- dimensional structures on paper. We then discuss several chemical reactions of the carbonyl groups of monosaccharides. One such reaction, the addition of a hydroxyl group from within the same molecule, generates cyclic forms with four or more backbone carbons (the forms that predominate in aqueous solution). This ring closure creates a new chiral center, adding further stereochemical complexity to this class of compounds. The nomenclature for unambiguously specifying the configuration about each carbon atom in a cyclic form and the 912 means of representing these structures on paper are described in some detail; this information will be useful as we discuss the metabolism of monosaccharides in Part II. We also introduce here some important monosaccharide derivatives that will be examined closely in later chapters. The Two Families of Monosaccharides Are Aldoses and Ketoses Monosaccharides are colorless, crystalline solids that are freely soluble in water but insoluble in nonpolar solvents. Most have a sweet taste (Box 7-1). The backbones of common monosaccharides are unbranched carbon chains in which all the carbon atoms are linked by single bonds. In this open-chain form, one of the carbon atoms is double-bonded to an oxygen atom to form a carbonyl group; each of the other carbon atoms has a hydroxyl group. If the carbonyl group is at an end of the carbon chain (that is, in an aldehyde group), the monosaccharide is an aldose; if the carbonyl group is at any other position (in a ketone group), the monosaccharide is a ketose. The simplest monosaccharides are the two three-carbon trioses: glyceraldehyde, an aldotriose, and dihydroxyacetone, a ketotriose (Fig. 7-1a). BOX 7-1 MEDICINE 913 What Makes Sugar Sweet? Sweetness is one of the five basic flavors that humans can taste; the others are sour, bitter, salty, and umami. Sweetness is detected by protein receptors in the plasma membranes of cells in the taste buds on the surface of the tongue. In humans, two closely related genes (TAS1R2 and TAS1R3) encode sweet-taste receptors (Fig. 1). When a molecule with a compatible structure binds a sweet- taste receptor’s extracellular domain, it triggers a series of events in the cell (including activation of a GTP-binding protein; see Fig. 12-20) that generates an electrical signal to the brain that is interpreted as “sweet.” FIGURE 1 The receptor for sweet-tasting substances, showing its regions of interaction (short arrows) with various sweet-tasting compounds. Each receptor subunit has seven transmembrane helices, a common feature of signaling receptors. Artificial sweeteners bind to only one of the two receptor subunits; natural sugars bind to both. TAS1R2 and TAS1R3 are the proteins encoded by the 914 genes TAS1R2 and TAS1R3. [Information from F. M. Assadi-Porter et al., J. Mol. Biol. 398:584, 2010, Fig. 1.] During evolution, there has probably been selection for the ability to taste compounds found in foods containing important nutrients, such as the carbohydrates that are major fuels for most organisms. Most sugars, including sucrose, glucose, lactose, and fructose, taste sweet, but other classes of compounds also bind sweet-taste receptors. The amino acids glycine, alanine, and serine are mildly sweet and harmless; nitrobenzene and ethylene glycol have a strong sweet taste but are toxic. (See Box 18-2 for a remarkable medical mystery involving ethylene glycol poisoning.) Several natural products are extraordinarily sweet. Stevioside, a sugar derivative isolated from the leaves of the stevia plant (Stevia rebaudiana Bertoni), is several hundred times sweeter than an equivalent amount of sucrose. The small protein brazzein (54 amino acids), isolated from berries of the Oubli vine (Pentadiplandra brazzeana Baillon) in Gabon and Cameroon, is 17,000 times sweeter than sucrose on a molar basis. Presumably, the sweetness of the berries encourages their consumption by animals that then disperse the seeds so that new plants are established. In societies where obesity is a major health problem, compounds that give foods a sweet taste without adding the calories found in sugars are common food additives. The artificial sweetener aspartame demonstrates the importance of stereochemistry in biology (Fig. 2). One simple model of binding to the sweet-taste receptor involves three sites: AH , B , and X. Site AH has + − + a group (an alcohol or an amine) that binds the partially negative oxygen of the carboxylic acid of (S,S)-aspartame. Site B has a partially negative oxygen − available to hydrogen-bond with the amine nitrogen of (S,S)-aspartame. Site X is oriented perpendicular to the other two groups and can accommodate the hydrophobic benzene ring of (S,S)-aspartame. 915 FIGURE 2 Stereochemical basis for the taste of two isomers of aspartame. [Information from http://chemistry.elmhurst.edu/vchembook/549receptor.html, © Charles E. Ophardt, Elmhurst College.] When the steric match is correct, as for (S,S)-aspartame, on the le in Figure 2, the sweet receptor is stimulated and the signal “sweet” is conducted to the brain. When the match is not correct, as for (R,S)-aspartame, the sweet receptor is not stimulated; in fact, (R,S)-aspartame stimulates a separate receptor specifically for bitterness. Stereoisomerism really matters! FIGURE 7-1 Representative monosaccharides. (a) Two trioses: an aldose and a ketose. The carbonyl group in each is shaded. (b) Two common hexoses. (c) The pentose components of nucleic acids. -Ribose is a component of ribonucleic acid (RNA), and 2- deoxy- -ribose is a component of deoxyribonucleic acid (DNA). Monosaccharides with four, five, six, and seven carbon atoms in their backbones are called, respectively, tetroses, pentoses, 916 hexoses, and heptoses. There are aldoses and ketoses of each of these chain lengths: aldotetroses and ketotetroses, aldopentoses and ketopentoses, and so on. The hexoses, which include the aldohexose D -glucose and the ketohexose D -fructose (Fig. 7-1b), are the most common monosaccharides in nature — the products of photosynthesis and key intermediates in the central energy- yielding reaction sequence in most organisms. The aldopentoses D -ribose and 2-deoxy-D -ribose (Fig. 7-1c) are components of nucleotides and nucleic acids (Chapter 8). Monosaccharides Have Asymmetric Centers All the monosaccharides except dihydroxyacetone contain one or more asymmetric (chiral) carbon atoms and thus occur in optically active isomeric forms (pp. 16–17). The simplest aldose, glyceraldehyde, contains one chiral center (the middle carbon atom) and therefore has two different optical isomers, or enantiomers (Fig. 7-2). 917 FIGURE 7-2 Three ways to represent the two enantiomers of glyceraldehyde. The enantiomers are mirror images of each other. Ball- and-stick models show the actual configuration of molecules. In Fischer projections, vertical lines point behind the plane of the page, and horizontal lines project above the page. Recall (see Fig. 1-17) that in perspective 918 formulas, the wide end of a solid wedge projects out of the plane of the paper, toward the reader; a dashed wedge extends behind. KEY CONVENTION One of the two enantiomers of glyceraldehyde is, by convention, designated the D isomer; the other is the L isomer. As for other biomolecules with chiral centers, the absolute configurations of sugars are known from x-ray crystallography. To represent three- dimensional sugar structures on paper, we o en use Fischer projection formulas (Fig. 7-2). In these projections, bonds drawn horizontally indicate bonds that project out of the plane of the paper, toward the reader; bonds drawn vertically project behind the plane of the paper, away from the reader. In general, a molecule with n chiral centers can have 2 n stereoisomers. Glyceraldehyde has 2 1 = 2 ; the aldohexoses, with four chiral centers, have 2 4 = 16. The stereoisomers of monosaccharides of each carbon-chain length can be divided into two groups that differ in the configuration about the chiral center most distant from the carbonyl carbon. Those in which the configuration at this reference carbon is the same as that of D - glyceraldehyde are designated D isomers, and those with the same configuration as L -glyceraldehyde are L isomers. In other words, when the hydroxyl group on the reference carbon is on the right (dextro) in a projection formula that has the carbonyl carbon at the top, the sugar is the D isomer; when the hydroxyl group is on 919 the le (levo), it is the L isomer. Of the 16 possible aldohexoses, eight are D forms and eight are L. Most of the hexoses of living organisms are D isomers. Why D isomers? An interesting and unanswered question. Recall that all of the amino acids found in proteins are exclusively one of two possible stereoisomers, L (p. 73). The basis for this initial preference for one isomer during evolution is unknown; however, once one isomer became prevalent, evolving enzymes able to use that isomer efficiently would have a selective advantage. Figure 7-3 shows the structures of the D stereoisomers of all the aldoses and ketoses having three to six carbon atoms. The carbons of a sugar are numbered beginning at the end of the chain nearest the carbonyl group. Each of the eight D - aldohexoses, which differ in the stereochemistry at C-2, C-3, or C- 4, has its own name: D -glucose, D -galactose, D -mannose, and so forth (Fig. 7-3a). The four- and five-carbon ketoses are designated by inserting “ul” into the name of a corresponding aldose; for example, D -ribulose is the ketopen-tose corresponding to the aldopentose D -ribose. (The importance of ribulose will become clear when we discuss the fixation of atmospheric CO by green 2 plants, in Chapter 20.) The ketohexoses are named otherwise: for example, fructose (from the Latin fructus, “fruit”; fruits are one source of this sugar) and sorbose (from Sorbus, the genus of mountain ash, which has berries rich in the related sugar alcohol sorbitol). Two sugars that differ only in the configuration around one carbon atom are called epimers; D -glucose and D -mannose, 920 which differ only in the stereochemistry at C-2, are epimers, as are D -glucose and D -galactose (which differ at C-4) (Fig. 7-4). 921 FIGURE 7-3 Aldoses and ketoses. The series of (a) -aldoses and (b) -ketoses having from three to six carbon atoms, shown as projection formulas. The carbon atoms in red are chiral centers. In all of these isomers, the chiral carbon most distant from the carbonyl carbon has the same configuration as the chiral carbon in -glyceraldehyde. Shown are the most common sugars in nature; you will encounter these again in this and later chapters. FIGURE 7-4 Epimers. -Glucose and two of its epimers are shown as projection formulas. Each epimer differs from -glucose in the configuration at one chiral center (shaded light red or blue). Some sugars occur naturally in their L form; examples are L - arabinose and the L isomers of some sugar derivatives that are common components of glycoconjugates (Section 7.3). 922 The Common Monosaccharides Have Cyclic Structures For simplicity, we have thus far represented the structures of aldoses and ketoses as straight-chain molecules (Figs 7-3, 7-4). In fact, in aqueous solution, aldotetroses and all monosaccharides with five or more carbon atoms in the backbone occur predominantly as cyclic (ring) structures in which the carbonyl group has formed a covalent bond with the oxygen of a hydroxyl group in the same sugar molecule. The formation of these ring structures is the result of a general reaction between alcohols and aldehydes or ketones to form derivatives called hemiacetals or hemiketals. Two molecules of an alcohol can add to a carbonyl carbon; the product of the first addition is a hemiacetal (for addition to an aldose) or a hemiketal (for addition to a ketose). If the —OH and carbonyl groups are on the same molecule, a five- or six-membered ring results. Addition of the second molecule of alcohol produces the full acetal or ketal (Fig. 7-5), and the linkage formed is a glycosidic bond. When the two molecules that react are monosaccharides, the acetal or ketal formed is a disaccharide. 923 FIGURE 7-5 Formation of hemiacetals and hemiketals. An aldehyde or a ketone can react with an alcohol in a 1:1 ratio to yield a hemiacetal or a hemiketal, respectively, creating a new chiral center at the carbonyl carbon. Substitution of a second alcohol molecule produces an acetal or a ketal. When the second alcohol is part of another sugar molecule, the bond produced is a glycosidic bond. The reaction with the first molecule of alcohol creates an additional chiral center (the carbonyl carbon). Because the alcohol can add in either of two ways, attacking either the “front” or the “back” of the carbonyl carbon, the reaction can produce either of two stereoisomeric configurations, denoted α and β. For example, D -glucose exists in solution as an intramolecular hemiacetal in which the free hydroxyl group at C-5 has reacted with the aldehydic C-1, rendering the latter carbon asymmetric and producing two possible stereoisomers, designated α and β (Fig. 7-6). Isomeric forms of monosaccharides that differ only in their configuration about the hemiacetal or hemiketal carbon atom are called anomers, and the carbonyl carbon atom is called the anomeric carbon. 924 FIGURE 7-6 Formation of the two cyclic forms of -glucose. Reaction between the aldehyde group at C-1 and the hydroxyl group at C-5 forms a hemiacetal linkage, producing either of two stereoisomers, the α and β anomers, which differ only in the stereochemistry around the hemiacetal 925 carbon. This reaction is reversible. The interconversion of α and β anomers is called mutarotation. Six-membered ring compounds are called pyranoses because they resemble the six-membered ring compound pyran (shown in Fig. 7-7). The systematic names for the two ring forms of D - glucose are therefore α- -glucopyranose and β- - D D glucopyranose. Ketohexoses (such as fructose) also occur as cyclic compounds with α and β anomeric forms. In these compounds, the hydroxyl group at C-5 (or C-6) reacts with the keto group at C-2 to form a furanose (or pyranose) ring containing a hemiketal linkage (Fig. 7-5). D -Fructose readily forms the furanose ring (Fig. 7-7); the more common anomer of this sugar in combined forms or in derivatives is β- -fructofuranose. D 926 FIGURE 7-7 Pyranoses and furanoses. The pyranose forms of -glucose and the furanose forms of -fructose are shown here as Haworth perspective formulas. The edges of the ring nearest the reader are represented by bold lines. Hydroxyl groups below the plane of the ring in these Haworth perspectives would appear at the right side of a Fischer projection (compare with Fig. 7-6). Pyran and furan are shown for comparison. 927 Cyclic sugar structures are more accurately represented in Haworth perspective formulas than in the Fischer projections commonly used for linear sugar structures. In Haworth perspectives, the six-membered ring is tilted to make its plane almost perpendicular to that of the paper, with the bonds closest to the reader drawn thicker than those farther away, as in Figure 7-7. KEY CONVENTION To convert the Fischer projection formula of any linear D -hexose to a Haworth perspective formula showing the molecule’s cyclic structure, draw the six-membered ring (five carbons, and one oxygen at the upper right), number the carbons in a clockwise direction beginning with the anomeric carbon, then place the hydroxyl groups. If a hydroxyl group is to the right in the Fischer projection, it is placed pointing down (i.e., below the plane of the ring) in the Haworth perspective; if it is to the le in the Fischer projection, it is placed pointing up (i.e., above the plane) in the Haworth perspective. The terminal — CH 2 OH group projects upward for the D enantiomer, downward for the L enantiomer. The hydroxyl on the anomeric carbon can point up or down. When the anomeric hydroxyl of a D -hexose is on the same side of the ring as C-6, the structure is by definition β; when it is on the opposite side from C-6, the structure is α. 928 WORKED EXAMPLE 7-1 Conversion of Fischer Projection to Haworth Perspective Formulas Draw the Haworth perspective formulas for D -mannose and D - galactose. 929 SOLUTION: Pyranoses are six-membered rings, so start with six-membered Haworth structures with the oxygen atom at the top right. Number the carbon atoms clockwise, starting with the aldose carbon. For mannose, place the hydroxyls on C-2, C-3, and C-4 above, above, and below the ring, respectively (because in the Fischer projection they are on the le , le , and right sides of the mannose backbone). For D -galactose, the hydroxyls are oriented below, above, and above the ring for C-2, C-3, and C-4, respectively. The hydroxyl at C-1 can point either up or down; there are two possible configurations, α and β, at this carbon. 930 WORKED EXAMPLE 7-2 Drawing Haworth Perspective Formulas of Sugar Isomers Draw the Haworth perspective formulas for α- -mannose and β- D L -galactose. SOLUTION: The Haworth perspective formula of D -mannose from Worked Example 7-1 can have the hydroxyl group at C-1 pointing either up or down. According to the Key Convention, for the α form, the C- 1 hydroxyl is pointing down when C-6 is up, as it is in D -mannose. For β- -galactose, use the Fischer representation of L D -galactose (see Worked Example 7-1) to draw the correct Fischer representation of L -galactose, which is its mirror image: the hydroxyls at C-2, C-3, C-4, and C-5 are on the le , right, right, and le sides, respectively. Now draw the Haworth perspective, a six- membered ring in which the —OH groups on C-2, C-3, and C-4 are oriented up, down, and down, respectively, because in the Fischer representation they are on the le , right, and right sides. Because it is the β form, the —OH on the anomeric carbon points down (same side as C-5). The α and β anomers of D -glucose interconvert in aqueous solution by a process called mutarotation, in which one ring 931 form (say, the α anomer) opens briefly into the linear form, then closes again to produce the β anomer (Fig. 7-6). Thus, a solution of β- -glucose and a solution of α- -glucose eventually form D D identical equilibrium mixtures having identical optical properties. This mixture consists of about one-third α- -glucose, D two-thirds β- -glucose, and very small amounts of the linear D form and the five-membered ring (glucofuranose) form. Haworth perspective formulas like those in Figure 7-7 are commonly used to show the stereochemistry of ring forms of monosaccharides. However, the six-membered pyranose ring is not planar, as Haworth perspectives suggest, but tends to assume either of two “chair” conformations (Fig. 7-8). Recall from Chapter 1 that two conformations of a molecule are interconvertible without the breakage of covalent bonds, whereas two configurations can be interconverted only by breaking a covalent bond. To interconvert α and β configurations, the bond involving the ring oxygen atom has to be broken, but interconversion of the two chair forms (which are conformers) does not require bond breakage and does not change configurations at any of the ring carbons. The specific three- dimensional structures of the monosaccharide units are important in determining the biological properties and functions of some polysaccharides, as we shall see. 932 FIGURE 7-8 Conformational formulas of pyranoses. (a) Two chair forms of the pyranose ring of β- -glucopyranose. Two conformers such as these are not readily interconvertible; an input of about 46 kJ of energy per mole of sugar is required to force the interconversion of chair forms. Another conformation, the “boat” (not shown), is seen only in derivatives with very bulky substituents. (b) The preferred chair conformation of α- - glucopyranose. Organisms Contain a Variety of Hexose Derivatives In addition to simple hexoses such as glucose, galactose, and mannose, there are many sugar derivatives in which a hydroxyl group in the parent compound is replaced with another 933 substituent, or a carbon atom is oxidized to a carboxyl group (Fig. 7-9). In glucosamine, galactosamine, and mannosamine, the hydroxyl at C-2 of the parent compound is replaced with an amino group. The amino group is commonly condensed with acetic acid, as in N-acetylglucosamine. This glucosamine derivative is part of many structural polymers, including those of the bacterial cell wall. Substitution of a hydrogen for the hydroxyl group at C-6 of L - galactose or L -mannose produces L -fucose or L -rhamnose, respectively. L -Fucose is found in the complex oligosaccharide components of glycoproteins and glycolipids; L -rhamnose is found in plant polysaccharides. 934 FIGURE 7-9 Some hexose derivatives important in biology. In amino sugars, an —NH 2 group replaces one of the —OH groups in the parent hexose. Substitution of OH for —OH produces a deoxy sugar; note that the deoxy sugars shown here occur in nature as the isomers. The acidic sugars contain a carboxylate group, which confers a negative charge δ at neutral pH. -Glucono- -lactone results from formation of an ester linkage between the C-1 carboxylate group and the C-5 (also known as the δ carbon) hydroxyl group of -gluconate. Oxidation of the carbonyl (aldehyde) carbon of glucose to the carboxyl level produces gluconic acid, used in medicine as an innocuous counterion when administering positively charged drugs (such as quinine) or ions (such as Ca ). Other aldoses yield 2+ other aldonic acids. Oxidation of the carbon at the other end of the carbon chain — C-6 of glucose, galactose, or mannose — forms the corresponding uronic acid: glucuronic, galacturonic, or mannuronic acid. Both aldonic acids and uronic acids form stable intramolecular esters called lactones (Fig. 7-9, lower le ). The sialic acids are a family of sugars with the same nine-carbon backbone. One of them, N-acetylneuraminic acid (o en referred to simply as “sialic acid”), is a derivative of N-acetylmannosamine that occurs in many glycoproteins and glycolipids on animal cell surfaces, providing sites of recognition by other cells or extracellular carbohydrate-binding proteins. The carboxylic acid groups of the acidic sugar derivatives are ionized at pH 7, and the compounds are therefore correctly named as the carboxylates — glucuronate, galacturonate, and so forth. In the synthesis and metabolism of carbohydrates, the intermediates are very o en not the sugars themselves but their 935 phosphorylated derivatives. Condensation of phosphoric acid with one of the hydroxyl groups of a sugar forms a phosphate ester, as in glucose 6-phosphate (Fig. 7-9), the first metabolite in the pathway by which most organisms oxidize glucose for energy. Sugar phosphates are relatively stable at neutral pH and bear a negative charge. One effect of sugar phosphorylation within cells is to trap the sugar inside the cell; most cells do not have plasma membrane transporters for phosphorylated sugars. Phosphorylation also activates sugars for subsequent chemical transformation. Several important phosphorylated derivatives of sugars are components of nucleotides (discussed in the next chapter). Sugars That Are, or Can Form, Aldehydes Are Reducing Sugars Free aldehyde groups in sugars undergo a characteristic redox reaction with Cu 2+ under alkaline conditions. As the sugar is oxidized from aldehyde to carboxylic acid, Cu 2+ is reduced to Cu + , which forms a brick-red precipitate. This reaction defines reducing sugars, which include, for example, glucose, galactose, mannose, ribose, and glyceraldehyde. The reaction occurs only with a free aldose; but because cyclic aldoses are in equilibrium with their linear forms, which do have free aldehyde groups (Fig. 7-6), all aldose monosaccharides are reducing sugars. Ketoses that can rearrange (tautomerize) to form aldehydes are also reducing sugars; fructose and ribulose are reducing sugars, for example. 936 The metabolism of glucose in people with diabetes can be monitored by measuring urinary glucose with a simple qualitative assay for reducing sugar, or by measuring quantitatively the nonenzymatic reaction between glucose and the hemoglobin in blood (Box 7-2). BOX 7-2 MEDICINE Blood Glucose Measurements in the Diagnosis and Treatment of Diabetes Glucose is the principal fuel for the brain. When the amount of glucose reaching the brain is too low, the consequences can be dire: lethargy, coma, permanent brain damage, and death. Complex hormonal mechanisms have evolved to ensure that the concentration of glucose in the blood remains high enough (about 5 m ) to satisfy the brain’s needs — but not too high, because elevated blood glucose can also have serious physiological consequences. Individuals with insulin-dependent diabetes mellitus do not produce sufficient insulin, the hormone that normally serves to reduce blood glucose concentration. If the diabetes is untreated, blood glucose levels may rise to several-fold higher than normal. These high glucose levels are believed to be at least one cause of the serious long-term consequences of untreated diabetes — kidney failure, cardiovascular disease, blindness, and impaired wound healing — so one goal of therapy is to provide just enough insulin (by injection) to keep blood glucose levels near normal. To maintain the correct balance of exercise, diet, and insulin, individuals with diabetes must measure their blood glucose concentration several times a day, and adjust the amount of insulin injected appropriately. The concentrations of glucose in blood can be determined by a simple assay for reducing sugar. A single drop of blood is added to a test strip containing the enzyme glucose oxidase, which catalyzes the reaction: 937 glucose oxidase D-Glucose + O2 −−−−−−−−→ D-glucono-δ-lactone + H2 O2 A second enzyme, a peroxidase, catalyzes reaction of the H 2 O2 with a colorless compound to create a colored product, which is quantified with a simple photometer that reads out the blood glucose concentration. Because blood glucose levels change with the timing of meals and exercise, single-time measurements do not reflect the average blood glucose over hours and days, so dangerous increases may go undetected. The average glucose concentration can be assessed by looking at its effect on hemoglobin, the oxygen-carrying protein in erythrocytes (p. 153). Transporters in the erythrocyte membrane equilibrate intracellular and plasma glucose concentrations, so hemoglobin is constantly exposed to glucose at whatever concentration is present in the blood. A series of relatively slow nonenzymatic reactions (Fig. 1) occurs between glucose and primary amino groups in hemoglobin (either the amino-terminal Val or the ε-amino groups of Lys residues) (Fig. 2). The rate of this process is proportional to the concentration of glucose, so the reaction can be used to estimate the average blood glucose level over weeks. The amount of glycated hemoglobin (GHB) present at any time reflects the average blood glucose concentration over the circulating lifetime of the erythrocyte (about 120 days), although the concentration in the two weeks before the test is the most important in setting the level of GHB. 938 FIGURE 1 The nonenzymatic reaction of glucose with a primary amino group in hemoglobin begins with formation of a Schiff base, which undergoes a rearrangement to generate a stable product; this ketoamine can further cyclize to yield GHB. 939 FIGURE 2 Glycated hemoglobin, showing the hemes (red) and the glycated Lys 99 residue (blue). [Data from PDB ID 3B75, N. T. Saraswathi et al., 2008.] The extent of hemoglobin glycation (so named to distinguish it from glycosylation, the enzymatic transfer of glucose to a protein) is measured clinically by extracting hemoglobin from a small sample of blood and separating GHB from unmodified hemoglobin electrophoretically (Fig. 3), taking advantage of the charge difference resulting from modification of the amino group(s). Normal values of the monoglycated hemoglobin referred to as HbA1c are about 5% of total hemoglobin (corresponding to an average blood glucose level of 120 mg/100 mL). In people with untreated diabetes, however, this value may be as high as 13%, indicating an average blood glucose level of about 300 mg/100 mL — dangerously high. One criterion for success in an individual program of insulin therapy (the timing, frequency, and amount of insulin injected) is maintaining HbA1c values at about 7%. 940 FIGURE 3 Pattern of hemoglobin (detected by its absorption at 415 nm) a er electrophoretic separation of nonglycated (A0) and monoglycated (A1c) forms in a thin glass capillary. Integration of the area under the peaks allows calculation of the amount of GHB (HbA1c) as a percentage of total hemoglobin. Shown here is the profile of an individual with a normal level of HbA1c (5.9%). Glycated hemoglobin undergoes a series of rearrangements, oxidations, and dehydrations of the carbohydrate moiety to produce a heterogeneous mixture of advanced glycation end products (AGE), such as ε-N-carboxymethyllysine and methylglyoxal. AGE are believed to be responsible for at least some of the pathology of diabetes. AGE can leave the erythrocyte and form covalent cross- links between proteins, interfering with normal protein function (Fig. 4). The accumulation of relatively high concentrations of AGE in people with diabetes may, by cross-linking critical proteins, cause the damage to the kidneys, 941 retinas, and cardiovascular system that characterizes the disease. Some AGE also can act through membrane receptors for AGE (RAGE), triggering intracellular responses that include activation of a transcription factor (NF-κB) and consequent changes in gene expression. This pathogenic process is a potential target for drug action. FIGURE 4 Pathways from glycated hemoglobin to the tissue damage associated with diabetes. Disaccharides (such as maltose, lactose, and sucrose) consist of two monosaccharides joined covalently by an O-glycosidic bond, which is formed when a hydroxyl group of one sugar molecule, typically in its cyclic form, reacts with the anomeric carbon of the other (Fig. 7-10). This reaction represents the formation of an acetal from a hemiacetal (such as glucopyranose) and an alcohol (a hydroxyl group of the second sugar molecule) (Fig. 7-5), and the 942 resulting compound is called a glycoside. Glycosidic bonds are readily hydrolyzed by acid but resist cleavage by base. Thus disaccharides can be hydrolyzed to yield their free monosaccharide components by boiling with dilute acid. FIGURE 7-10 Formation of maltose. A disaccharide is formed from two monosaccharides (here, two molecules of -glucose) when an —OH (alcohol) of one monosaccharide molecule (right) condenses with the intramolecular hemiacetal of the other (le ), with elimination of H 2O and formation of a glycosidic bond. The reversal of this reaction is hydrolysis — attack by H 2O on the glycosidic bond. The maltose molecule, shown here, retains a reducing hemiacetal at the C-1 not involved in the glycosidic bond. Because mutarotation interconverts the α and β forms of the hemiacetal, the bonds at this position are sometimes depicted with wavy lines to indicate that the structure may be either α or β. 943 When the anomeric carbon is involved in a glycosidic bond, the easy interconversion of linear and cyclic forms shown in Figure 7- 6 is prevented. Formation of a glycosidic bond therefore renders a sugar nonreducing. In describing disaccharides or polysaccharides, the end of a chain with a free anomeric carbon (one not involved in a glycosidic bond) is called the reducing end. The disaccharide maltose (Fig. 7-10) contains two D -glucose residues joined by a glycosidic linkage between C-1 (the anomeric carbon) of one glucose residue and C-4 of the other. Because the disaccharide retains a free anomeric carbon (C-1 of the glucose residue on the right in Fig. 7-10), maltose is a reducing sugar. The configuration of the anomeric carbon atom in the glycosidic linkage isα. The glucose residue with the free anomeric carbon is capable of existing in α- and β-pyranose forms. KEY CONVENTION To name reducing disaccharides such as maltose unambiguously, and especially to name more complex oligosaccharides, several rules are followed. By convention, the name describes the compound written with its nonreducing end to the le , and we can “build up” the name in the following order. (1) Give the configuration ( α or β) at the anomeric carbon joining the first monosaccharide unit (on the le ) to the second. (2) Name the nonreducing residue; to distinguish five- and six-membered ring structures, insert “furano” or “pyrano” into the name. (3) Indicate in parentheses the two carbon atoms joined by the glycosidic 944 bond, with an arrow connecting the two numbers; for example, (1→4) shows that C-1 of the first-named sugar residue is joined to C-4 of the second. (4) Name the second residue. If there is a third residue, describe the second glycosidic bond by the same conventions. (To shorten the description of complex polysaccharides, three-letter abbreviations or colored symbols for the monosaccharides are o en used, as given in Table 7-1.) Following this convention for naming oligosaccharides, maltose is α- -glucopyranosyl- D (1→4) -D -glucopyranose. Because most sugars encountered in this book are the D enantiomers and the pyranose form of hexoses predominates, we generally use a shortened version of the formal name of such compounds, giving the configuration of the anomeric carbon and naming the carbons joined by the glycosidic bond. In this abbreviated nomenclature, maltose is Glc (α1→4)Glc. TABLE 7-1 Symbols and Abbreviations for Common Monosaccharides and Some of Their Derivatives Abequose Abe Glucuronic acid GlcA Arabinose Ara Galactosamine GalN Fructose Fru Glucosamine GlcN Fucose Fuc N-Acetylgalactosamine GalNAc Galactose Gal N-Acetylglucosamine GlcNAc Glucose Glc Iduronic acid IdoA Mannose Muramic acid Mur 945 Man Rhamnose Rha N-Acetylmuramic acid Mur2Ac Ribose Rib N-Acetylneuraminic acid (a sialic Neu5Ac acid) Xylose Xyl Note: In a commonly used convention, hexoses are represented as circles, N- acetylhexosamines as squares, and hexosamines as squares divided diagonally. All sugars with the “gluco” configuration are blue, those with the “galacto” configuration are yellow, and “manno” sugars are green. Other substituents can be added as needed: sulfate (S), phosphate (P), O-acetyl (OAc), or O-methyl (OMe). The disaccharide lactose (Fig. 7-11), which yields D -galactose and D -glucose on hydrolysis, occurs naturally in milk and gives milk its sweetness. The anomeric carbon of the glucose residue is available for oxidation, and thus lactose is a reducing disaccharide. The reducing end of this disaccharide, which by convention is drawn on the right, is glucose, and the disaccharide is named as a derivative of glucose. Its abbreviated name is Gal (β1→4)Glc , with the linkage shown in parentheses; the anomeric carbon of galactose is β, and C-1 of Gal is linked to C-4 of Glc. 946 FIGURE 7-11 Three common disaccharides. Like maltose in Figure 7-10, these disaccharides are shown as Haworth perspectives. The common name, full systematic name, and abbreviation are given. Formal nomenclature for sucrose names glucose as the parent glycoside, although it is typically depicted as shown, with glucose on the le. The two abbreviated symbols shown for sucrose are equivalent (≡). 947 The enzyme lactase — absent in lactose-intolerant individuals — begins the digestive process in the small intestine by splitting the (β1→4) bond of lactose into monosaccharides, which can be absorbed from the small intestine. Lactose, like other disaccharides, is not absorbed from the small intestine, and in lactose-intolerant individuals the undigested lactose passes into the large intestine. Here, the increased osmolarity due to dissolved lactose opposes the absorption of water from the intestine into the bloodstream, causing watery, loose stools. In addition, fermentation of the lactose by intestinal bacteria produces large volumes of CO , which leads to the bloating, 2 cramps, and gas associated with lactose intolerance. Sucrose (table sugar) is a disaccharide of glucose and fructose. It is formed by plants but not by animals. In contrast to maltose and lactose, sucrose contains no free anomeric carbon atom; the anomeric carbons of both monosaccharide units are involved in the glycosidic bond (Fig. 7-11). Sucrose is therefore a nonreducing sugar, and its stability — its resistance to oxidation — makes it a suitable molecule for the storage and transport of energy in plants. In the abbreviated nomenclature, a double-headed arrow connects the symbols specifying the anomeric carbons and their configurations. Thus the abbreviated name of sucrose is either Glc(α1↔2β) Fru or Fru(β2↔1α)Glc. Sucrose is a major intermediate product of photosynthesis; in many plants it is the principal form in which sugar is transported from the leaves to other parts of the plant body. 948 Trehalose, Glc(α1↔1α)Glc (Fig. 7-11) — a disaccharide of D - glucose that, like sucrose, is a nonreducing sugar — is a major constituent of the circulating fluid (hemolymph) of insects. It serves as an energy-storage compound, and it serves as antifreeze in some invertebrate organisms. SUMMARY 7.1 Monosaccharides and Disaccharides Sugars (saccharides) contain an aldehyde or ketone group and two or more hydroxyl groups. Monosaccharides generally contain several chiral carbons and therefore exist in a variety of stereochemical forms, which may be represented on paper as Fischer projections. Epimers are sugars that differ in configuration at only one carbon atom. Monosaccharides with at least six carbons commonly form cyclic structures, in which the aldehyde or ketone group is joined to a hydroxyl group of the same molecule. The cyclic structure can be represented as a Haworth perspective formula. The carbon atom originally found in the aldehyde or ketone group (the anomeric carbon) can assume either of two configurations, α and β, which are interconvertible by mutarotation. In the linear form of the monosaccharide, which is in equilibrium with the cyclic forms, the anomeric carbon is easily oxidized, making the compound a reducing sugar. Naturally occurring hexoses include some that have —NH at 2 C-2 (amino sugars), o en with the amino group acetylated. Oxidation of the carbonyl carbon of glucose and other aldoses yields aldonic acids (gluconic acid); oxidation at C-6 produces 949 uronic acids (glucuronate). Some sugar intermediates are phosphate esters (for example, glucose 6-phosphate). A hydroxyl group of one monosaccharide can add to the anomeric carbon of a second monosaccharide to form an acetal called a glycoside. Oligosaccharides are short polymers of several different monosaccharides joined by glycosidic bonds. At one end of the chain, the reducing end, is a monosaccharide residue with its anomeric carbon not involved in a glycosidic bond, and therefore available to be oxidized. Disaccharides and oligosaccharides are named as derivatives of the sugar at the reducing end. Their names provide the order of monosaccharide units, the configuration at each anomeric carbon, and the carbon atoms involved in the glycosidic linkage(s). 950 7.2 Polysaccharides Most carbohydrates found in nature occur as polysaccharides, polymers of medium to high molecular weight (M r > 20,000). Polysaccharides, also called glycans, differ from each other in the identity of their recurring monosaccharide units, in the length of their chains, in the types of bonds linking the units, and in the degree of branching. Homopolysaccharides contain only a single monomeric sugar species; heteropolysaccharides contain two or more kinds of monomers (Fig. 7-12). Some homopolysaccharides serve as storage forms of monosaccharides that are used as fuels; starch and glycogen are homopolysaccharides of this type. Other homopolysaccharides (cellulose and chitin, for example) serve as structural elements in plant cell walls and animal exoskeletons. Heteropolysaccharides provide extracellular support for organisms of all kingdoms. For example, the rigid layer of the bacterial cell envelope (the peptidoglycan) is composed in part of a heteropolysaccharide built from two alternating monosaccharide units (see Fig. 6-32). In animal tissues, the extracellular space is occupied by several types of heteropolysaccharides, which form a matrix that holds individual cells together and provides protection, shape, and support to cells, tissues, and organs. 951 FIGURE 7-12 Homopolysaccharides and heteropolysaccharides. Polysaccharides may be composed of one, two, or several different monosaccharides, in straight or branched chains of varying length. Unlike proteins, polysaccharides generally do not have defined lengths or molecular weights. This difference is a consequence of the mechanisms of assembly of the two types of polymer. As we shall see in Chapter 27, proteins are synthesized on a template (messenger RNA) of defined sequence and length, by enzymes that follow the template exactly. For polysaccharide synthesis there is no template; rather, the program for polysaccharide synthesis is intrinsic to the enzymes that catalyze the polymerization of the monomeric units, and there is no specific stopping point in the synthetic process; the products thus vary in length. 952 Some Homopolysaccharides Are Storage Forms of Fuel The most important storage polysaccharides are starch in plant cells and glycogen in animal cells. Both polysaccharides occur intracellularly as large clusters or granules. Starch and glycogen molecules are heavily hydrated, because they have many exposed hydroxyl groups available to hydrogen- bond with water. Most plant cells have the ability to form starch, and starch storage is especially abundant in tubers (underground stems), such as potatoes, and in seeds. Starch contains two types of glucose polymer, amylose and amylopectin (Fig. 7-13). Amylose consists of long, unbranched chains of D -glucose residues connected by (α1→4) linkages (as in maltose). Such chains vary in molecular weight from a few thousand to more than a million. Amylopectin is even larger (M up to 200 million) but unlike amylose is highly branched. The r glycosidic linkages joining successive glucose residues in amylopectin chains are (α1→4); the branch points (occurring every 24 to 30 residues) are (α1→6) linkages. 953 FIGURE 7-13 Starch and glycogen. (a) A short segment of a linear polymer of -glucose residues in (α1→4) linkage, shown here as Haworth perspectives. This is the basic structure of amylose of starch (in plants) and glycogen (in animals). A single chain of amylose can contain several thousand glucose residues. (b) An (α1→6) branch point of amylopectin or glycogen. (c) A cluster of amylose and amylopectin like that believed to occur in starch granules. Strands of amylopectin (black) form double- helical structures with each other or with amylose strands (blue). Amylopectin has frequent (α1→6) branch points (red). Glucose residues at the nonreducing ends of the outer branches are removed enzymatically during the mobilization of starch for energy production. Glycogen has a similar structure but is more highly branched and more compact. Glycogen is the main storage polysaccharide of animal cells. Like amylopectin, glycogen is a polymer of (α1→4)-linked glucose subunits, with (α1→6) -linked branches, but glycogen is more extensively branched (on average, a branch every 8 to 12 residues) and more compact than starch. Glycogen is especially abundant in the liver, where it may constitute as much as 7% of the wet weight; it is also present in skeletal muscle. In hepatocytes, glycogen is found in large α granules (see Fig. 15-1), which are clusters of smaller β granules, each of which is a single, highly branched glycogen molecule with an average molecular weight of several million. The large α glycogen granules also contain, in tightly bound form, the enzymes responsible for the synthesis and degradation of glycogen (see Fig. 15-16). Because each branch of glycogen ends with a nonreducing sugar unit, a glycogen molecule with n branches has n + 1 nonreducing ends, but only one reducing end. When glycogen is used as an energy source, glucose units are removed one at a time from the nonreducing ends. Degradative enzymes that act only at nonreducing ends can work simultaneously on the many branches, speeding the conversion of the polymer to monosaccharides. Why not store glucose in its monomeric form? Hepatocytes in the fed state store glycogen equivalent to a glucose concentration of 0.4 M. The actual concentration of glycogen, which contributes little to the osmolarity of the cytosol, is about 0.01 μ M. If the cytosol contained 0.4 M glucose, the 954 osmolarity would be threateningly elevated, leading to osmotic entry of water that might rupture the cell (see Fig. 2-12). Some Homopolysaccharides Serve Structural Roles Cellulose, a tough, fibrous, water-insoluble substance, is found in the cell walls of plants, particularly in stalks, stems, trunks, and all the woody portions of the plant body. Cellulose constitutes much of the mass of wood, and cotton is almost pure cellulose. Like amylose, the cellulose molecule is a linear, unbranched homopolysaccharide, consisting of 10,000 to 15,000 D - glucose units. But there is a very important difference: in cellulose the glucose residues have the β configuration (Fig. 7-14), whereas in amylose the glucose is in the α configuration. The glucose residues in cellulose are linked by (β1→4) glycosidic bonds, in contrast to the (α1→4) bonds of amylose. This difference causes individual molecules of cellulose and amylose to fold differently in space, giving them very different macroscopic structures and physical properties. FIGURE 7-14 Cellulose. Two units of a cellulose chain; the -glucose residues are in (β1→4) linkage. The rigid chair structures can rotate relative to one another. The tough, fibrous nature of cellulose makes it useful in such commercial products as cardboard and insulation material, and it is a major constituent 955 of cotton and linen fabrics. Cellulose is also the starting material for the commercial production of cellophane, rayon, and lyocell. Chitin is a linear homopolysaccharide composed of N-acetylglucosamine residues in (β1→4) linkage (Fig. 7-15). The only chemical difference from cellulose is the replacement of the hydroxyl group at C-2 with an acetylated amino group, which makes chitin more hydrophobic and water-resistant than cellulose. Chitin forms extended fibers similar to those of cellulose, and like cellulose cannot be digested by vertebrates. Chitin is the principal component of the hard exoskeletons of nearly a million species of arthropods — insects, lobsters, and crabs, for example — and is probably the second most abundant polysaccharide, next to cellulose, in nature; an estimated 1 billion tons of chitin are produced in the biosphere each year. FIGURE 7-15 Chitin. (a) A short segment of chitin, a homopolymer of N-acetyl- -glucosamine units in (β1→4) linkage. (b) A spotted June beetle (Pelidnota punctata), showing its surface armor (exoskeleton) of chitin. Steric Factors and Hydrogen Bonding Influence Homopolysaccharide Folding The folding of polysaccharides in three dimensions follows the same principles as those governing polypeptide structure: subunits with a more- or-less rigid structure dictated by covalent bonds form three-dimensional macromolecular structures that are stabilized by weak interactions within or between molecules, such as hydrogen bonds, interactions due to the 956 hydrophobic effect, van der Waals interactions, and, for polymers with charged subunits, electrostatic interactions. Because polysaccharides have so many hydroxyl groups, hydrogen bonding has an especially important influence on their structure. Glycogen, starch, cellulose, and chitin are composed of pyranoside (six-membered ring) subunits, as are the oligosaccharides of glycoproteins and glycolipids, to be discussed later. Such molecules can be represented as a series of rigid pyranose rings connected by an oxygen atom bridging two carbon atoms (the glycosidic bond). There is, in principle, free rotation about both COO bonds linking the residues, but as in polypeptides (see Figs 4-2, 4-8), rotation about each bond is limited by steric hindrance by substituents. The three-dimensional structures of these molecules can be described in terms of the dihedral angles, ϕ and ψ, about the glycosidic bond (Fig. 7-16). The bulkiness of the pyranose ring and its substituents, along with electronic effects at the anomeric carbon, place constraints on the angles ϕ and ψ; thus, certain conformations are much more stable than others. 957 FIGURE 7-16 Different energetic conformations of a disaccharide. The torsion angles ϕ (phi) and ψ (psi), which define the spatial relationship between adjacent rings, can in principle have any value from 0 to 360. In fact, some torsion angles give conformations ° ° that maximize hydrogen bonding and minimize energy, whereas others give conformations that are sterically hindered, as we can see in two conformers of the disaccharide Gal(β1→3)Gal at opposite energetic extremes. The most stable three-dimensional structure for the (α1→4)-linked chains of starch and glycogen is a tightly coiled helix (Fig. 7-17), stabilized by interchain hydrogen bonds. The average plane of each residue along the amylose chain forms a 60 angle with the average plane of the preceding ° residue, so the helical structure has six residues per turn. For amylose, the core of the helix is of precisely the right dimensions to accommodate iodine as complex ions (I − 3 and I − 5 ). This interaction gives an intensely blue product, making it a common qualitative test for amylose. 958 FIGURE 7-17 Helical structure of starch (amylose). (a) Four glucose units of starch (amylose), showing both the free rotation possible around the ψ and ϕ angles of the (α1→4) glycosidic bonds and the hydrogen bonding possible between adjacent rings. In this most stable conformation, with adjacent rigid chairs at 60 to one another, the ° polysaccharide chain is curved. (b) A model of a segment of amylopectin (branched amylose). The conformation of (α1→4) linkages in amylose, amylopectin, and glycogen causes these polymers to assume tightly coiled helical structures. [(b) Data from www.biotopics.co.uk/jsmol/amylopectin.html.] 959 For cellulose, the most stable conformation is that in which each chair is turned 180 relative to its neighbors, yielding a straight, extended ° chain. All —OH groups are available for hydrogen bonding with neighboring chains. With several chains lying side by side, a stabilizing network of interchain and intrachain hydrogen bonds produces straight, stable supramolecular fibers of great tensile strength (Fig. 7-18). This property of cellulose has made it useful to civilizations for millennia. Many manufactured products, including papyrus, paper, cardboard, rayon, insulating tiles, and a variety of other useful materials, are derived from cellulose. The water content of these materials is low because extensive interchain hydrogen bonding between cellulose molecules satisfies their capacity for hydrogen-bond formation. FIGURE 7-18 The linear structure of cellulose chains. (a) Four glucose units of cellulose. In the most stable conformation of cellulose, the (β1→4) glycosidic linkages put the adjacent rigid chairs at 180 relative to each other. (b) A model of straight, unbranched cellulose ° segments, held together in layers stabilized by interchain hydrogen bonds. [(b) Data from Cornell, B. 2016. Sugar Polymers. Available at: http://ib.bioninja.com.au. (Accessed 11 March 2020).] 960 Peptidoglycan Reinforces the Bacterial Cell Wall The rigid component of bacterial cell walls (peptidoglycan) is a heteropolymer of alternating (β1→4)-linked N-acetylglucosamine and N- acetylmuramic acid residues (see Fig. 6-32). The linear polymers lie side by side in the cell wall, cross-linked by short peptides, the exact structure of which depends on the bacterial species. The peptide cross-links weld the polysaccharide chains into a strong sheath (peptidoglycan) that envelops the entire cell and prevents cellular swelling and lysis due to the osmotic entry of water. The enzyme lysozyme kills bacteria by hydrolyzing the (β1→4) glycosidic bond between N-acetylglucosamine and N-acetylmuramic acid. The enzyme is found in human tears, where it is presumably a defense against bacterial infections of the eye, and in hen’s eggs, where it prevents bacterial infection of the developing embryo. Penicillin and related antibiotics kill bacteria by preventing synthesis of the peptidoglycan cross-links, leaving the cell wall too weak to resist osmotic lysis (p. 211). Glycosaminoglycans Are Heteropolysaccharides of the Extracellular Matrix The extracellular space in the tissues of multicellular animals is filled with a gel-like material, the extracellular matrix (ECM), which holds the cells together and provides a porous pathway for the diffusion of nutrients and oxygen to individual cells. The ECM is composed of an interlocking meshwork of heteropolysaccharides (also called ground substance) and fibrous proteins such as fibrillar collagens, elastins, and fibronectins. The 961 basement membrane is a specialized ECM that underlies epithelial cells; it consists of specialized collagens, laminins, and heteropolysaccharides. These heteropolysaccharides, the glycosaminoglycans, are a family of linear polymers composed of repeating disaccharide units (Fig. 7-19). They are unique to animals and bacteria and are not found in plants. One of the two monosaccharides is always either N-acetylglucosamine or N- acetylgalactosamine; the other is in most cases a uronic acid, usually D - glucuronic or L -iduronic acid. Some glycosaminoglycans contain esterified sulfate groups. The combination of sulfate groups and the carboxylate groups of the uronic acid residues gives glycosaminoglycans a very high density of negative charge. To minimize the repulsive forces among neighboring charged groups, these molecules assume an extended conformation in solution, forming a rodlike helix in which the negatively charged carboxylate groups occur on alternate sides of the helix (as shown for heparin in Fig. 7-19). The extended rod form also provides maximum separation between the negatively charged sulfate groups. The specific patterns of sulfated and nonsulfated sugar residues in glycosaminoglycans allow specific recognition by a variety of protein ligands that bind electrostatically to these molecules. The sulfated glycosaminoglycans are attached to extracellular proteins to form proteoglycans (Section 7.3). 962 FIGURE 7-19 Repeating units of some common glycosaminoglycans of extracellular matrix. The molecules are copolymers of alternating uronic acid and amino sugar residues (keratan sulfate is the exception), with sulfate esters in any of several positions, except in hyaluronan. The ionized carboxylate and sulfate groups give these polymers their characteristic high negative charge. Therapeutic heparin contains primarily iduronic acid 963 (IdoA) and a smaller proportion of glucuronic acid (not shown) and is generally highly sulfated and heterogeneous in length. The space-filling model shows a heparin segment as its structure in solution, as determined by NMR spectroscopy. Carbons in iduronic acid sulfate are colored blue; those in glucosamine sulfate are green. Oxygen and sulfur are shown in red and yellow, respectively. Hydrogen atoms are not shown (for clarity). [Data for molecular model from PDB ID 1HPN, B. Mulloy et al., Biochem. J. 293:849, 1993.] The glycosaminoglycan hyaluronan (hyaluronic acid) contains alternating residues of D -glucuronic acid and N-acetylglucosamine (Fig. 7-19). With up to 50,000 repeats of the basic disaccharide unit, hyaluronan has a molecular weight of several million; it forms clear, highly viscous, noncompressible solutions that serve as lubricants in the synovial fluid of joints and give the vitreous humor of the vertebrate eye its jellylike consistency (the Greek hyalos means “glass”; hyaluronan can have a glassy or translucent appearance). Hyaluronan is also a component of the ECM of cartilage and tendons, to which it contributes tensile strength and elasticity as a result of its strong noncovalent interactions with other components of the matrix. Hyaluronidase, an enzyme secreted by some pathogenic bacteria, can hydrolyze the glycosidic linkages of hyaluronan, rendering tissues more susceptible to bacterial invasion. In many animal species, a similar enzyme in sperm hydrolyzes the outer glycosaminoglycan coat around an ovum, allowing sperm penetration. Other glycosaminoglycans differ from hyaluronan in three respects: they are generally much shorter polymers, they are covalently linked to specific proteins (proteoglycans), and one or both monomeric units differ from those of hyaluronan. Chondroitin sulfate (Greek chondros, “cartilage”) contributes to the tensile strength of cartilage, tendons, ligaments, heart valves, and the walls of the aorta. Dermatan sulfate (Greek derma, “skin”) contributes to the pliability of skin and is also present in blood vessels and heart valves. In this polymer, many of the glucuronate residues present in chondroitin sulfate are replaced by their C-5 epimer, L -iduronate (IdoA). 964 Keratan sulfates (Greek keras, “horn”) have no uronic acid, and their sulfate content is variable. They are present in cornea, cartilage, bone, and a variety of horny structures formed from dead cells: horn, hair, hoofs, nails, and claws. Heparan sulfate (Greek hēpar, “liver”; it was originally isolated from dog liver) contains variable, nonrandom arrangements of sulfated and nonsulfated sugars. The exact sequence of sulfated residues gives the molecule the ability to interact specifically with a large number of proteins, including growth factors and ECM components, as well as various enzymes and factors present in plasma. Heparin is a highly sulfated, intracellular form of heparan sulfate produced primarily by mast cells (a type of leukocyte, or immune cell). Its physiological role is not yet clear, but purified heparin is used as a therapeutic agent to inhibit coagulation of blood through its capacity to bind the protease inhibitor antithrombin (see Fig. 7-24). Table 7-2 summarizes the composition, properties, roles, and occurrence of the polysaccharides described in Section 7.2. TABLE 7-2 Structures and Roles of Some Polysaccharides Polymer Typea Repeating Size (number of Roles/significance unitb monosaccharide units) Starch Energy storage: in plants 965 Amylose Homo- (α1→4)Glc , 50–5,000 linear Amylopectin Homo- (α1→4)Glc , Up to 10 6 with (α1→6)Glc branches every 24–30 residues Glycogen Homo- (α1→4)Glc , Up to 50,000 Energy storage: in with bacteria and (α1→6)Glc animal cells branches every 8–12 residues Cellulose Homo- (β1→4)Glc Up to 15,000 Structural: in plants, gives rigidity and strength to cell walls Chitin Homo- (β1→4)Glc NAc Very large Structural: in insects, spiders, crustaceans, gives rigidity and strength to exoskeletons Dextran Homo- (α1→6)Glc , Wide range Structural: in with (α1→3) bacteria, branches extracellular adhesive Peptidoglycan Hetero-; 4)Mur2Ac Very large Structural: in peptides (β1→4) GlcNAc bacteria, gives attached (β1 rigidity and strength to cell envelope Hyaluronan (a Hetero-; 4)GlcA (β1→3) Up to 100,000 Structural: in glycosaminoglycan) acidic GlcNAc (β1 vertebrates, extracellular matrix of skin and connective tissue; viscosity and lubrication in joints 966 aEach polymer is classified as a homopolysaccharide (homo-) or heteropolysaccharide (hetero-). bThe abbreviated names for the peptidoglycan, agarose, and hyaluronan repeating units indicate that the polymer contains repeats of this disaccharide unit. For example, in peptidoglycan, the GlcNAc of one disaccharide unit is (β1→4)-linked to the first residue of the next disaccharide unit. SUMMARY 7.2 Polysaccharides Homopolysaccharides contain only a single monomeric sugar species; heteropolysaccharides contain two or more kinds of monomers. The homopolysaccharides starch and glycogen are storage fuels in plant, animal, and bacterial cells. They consist of D -glucose units with (α1→4) linkages, and both contain some branches. The homopolysaccharides cellulose, chitin, and dextran serve structural roles. Cellulose, composed of (β1→4)-linked D -glucose residues, lends strength and rigidity to plant cell walls. Chitin, a polymer of (β1→4)-linked N-acetylglucosamine, strengthens the exoskeletons of arthropods. Homopolysaccharides assume stable conformations dictated by weak interactions. The chair form of the pyranose ring is essentially rigid, so the conformation of the polymers is determined by rotation about the C—O bonds of the glycosidic linkage. Starch and glycogen form helical structures with intrachain hydrogen bonding; cellulose and chitin form long, straight strands that interact with neighboring strands. Bacterial cell walls are strengthened by peptidoglycan in which the repeating disaccharide is GlcNAc(β1→4) Mur2Ac. Glycosaminoglycans are extracellular heteropolysaccharides in which one of the two monosaccharide units is a uronic acid (keratan sulfate is an exception) and the other is an N-acetylated amino sugar. The high density of negative charge on these molecules forces them to assume extended conformations. These polymers (hyaluronan, chondroitin sulfate, dermatan sulfate, and keratan sulfate) provide viscosity, adhesiveness, and tensile strength to the extracellular matrix. 967 7.3 Glycoconjugates: Proteoglycans, Glycoproteins, and Glycolipids In addition to their important roles as fuel stores (starch, glycogen, dextran) and as structural materials (cellulose, chitin, peptidoglycans), polysaccharides and oligosaccharides are information carriers. Some provide communication between cells and their extracellular surroundings; others label proteins for transport to and localization in specific organelles, or for destruction when the protein is malformed or superfluous; and others serve as recognition sites for extracellular signal molecules (growth factors, for example) or extracellular parasites (bacteria or viruses). On almost every eukaryotic cell, specific oligosaccharide chains attached to components of the plasma membrane form a carbohydrate layer (the glycocalyx), several nanometers thick, that serves as an information-rich surface that the cell shows to its surroundings. These oligosaccharides are central players in cell-cell recognition and adhesion, cell migration during development, blood clotting, the immune response, wound healing, and other cellular processes. In most of these cases, the informational carbohydrate is covalently joined to a protein or a lipid to form a glycoconjugate, which is the biologically active molecule (Fig. 7-20). 968 FIGURE 7-20 Glycoconjugates. The structures of some typical proteoglycans, glycoproteins, and glycosphingolipids described in the text. Proteoglycans are macromolecules of the cell surface or ECM in which one or more sulfated glycosaminoglycan chains are joined covalently to a membrane protein or a secreted protein. The glycosaminoglycan chain can bind to extracellular proteins through electrostatic interactions between the protein and the negatively charged sugar moieties on the proteoglycan. Proteoglycans are major components of all extracellular matrices. 969 Glycoproteins have one or several oligosaccharides of varying complexity joined covalently to a protein. They are usually found on the outer face of the plasma membrane (as part of the glycocalyx), in the ECM, and in the blood. Inside cells, they are found in specific organelles such as Golgi complexes (where the oligosaccharide moieties are added to the proteins), secretory granules, and lysosomes. The oligosaccharide portions of glycoproteins are very heterogeneous and, like glycosaminoglycans, are rich in information, forming highly specific sites for recognition and high-affinity binding by carbohydrate-binding proteins called lectins. Some cytosolic and nuclear proteins can be glycosylated as well. Glycolipids are plasma membrane components in which the hydrophilic head groups are oligosaccharides. Glycosphingolipids are a class of glycolipids with a specific backbone structure that will be considered in more detail in Chapter 10. They have complex oligosaccharide components that act as specific sites for recognition by lectins, as they do in glycoproteins. Neurons are rich in glycosphingolipids, which act in nerve conduction and myelin formation. Glycosphingolipids also play a role in signal transduction in cells. Proteoglycans Are Glycosaminoglycan-Containing 970 Macromolecules of the Cell Surface and Extracellular Matrix Mammalian cells can produce dozens of types of proteoglycans. Many are secreted into the ECM, where they act as tissue organizers and influence cellular activities, such as growth factor activation and adhesion. The basic proteoglycan unit consists of a “core protein” with covalently attached glycosaminoglycan(s). The point of attachment is a Ser residue, to which the glycosaminoglycan is joined through a tetrasaccharide linker (Fig. 7-21). FIGURE 7-21 Proteoglycan structure, showing the tetrasaccharide bridge. A typical tetrasaccharide linker (blue) connects a glycosaminoglycan — in this case, chondroitin sulfate (orange) — to a Ser residue in the core protein. The xylose residue at the reducing end of the linker is joined by its anomeric carbon to the hydroxyl of the Ser residue. 971 Some proteoglycans are integral membrane proteins (see Fig. 11- 9). For example, the basal lamina, a specialized layer of the ECM that separates two organized groups of cells, contains a family of core proteins (M 20,000 to 40,000), each with several covalently r attached heparan sulfate chains. There are two major families of membrane heparan sulfate proteoglycans. Syndecans have a single transmembrane domain and an extracellular domain bearing three to five chains of heparan sulfate and, in some cases, chondroitin sulfate (Fig. 7-22a). Glypicans are attached to the membrane by a GPI anchor, a glycosylated derivative of the membrane lipid phosphatidylinositol (see Fig. 11-16). Both syndecans and glypicans can be shed into the extracellular space. A protease in the ECM that cuts proteins close to the membrane surface releases syndecan ectodomains (domains outside the plasma membrane), and a phospholipase that breaks the connection to the GPI anchor releases glypicans. These mechanisms provide a way for a cell to change its surface features quickly. Shedding is highly regulated and is activated in proliferating cells, such as cancer cells. Proteoglycan shedding is involved in cell-cell recognition and adhesion, and in the proliferation and differentiation of cells. Numerous chondroitin sulfate and dermatan sulfate proteoglycans also exist, some as membrane-bound entities, others as secreted products in the ECM. 972 FIGURE 7-22 Two families of membrane proteoglycans. (a) Syndecans and glypicans can be shed from the membrane by enzymatic cleavage near the outer membrane surface. (b) Along a heparan sulfate chain, regions rich in sulfated sugars, the NS domains (green), alternate with regions with chiefly unmodified residues of GlcNAc and GlcA, the NA domains (gray). One of the NS domains is shown in more detail, revealing a high density of modified residues: GlcNS (N-sulfoglucosamine) and both GlcA and IdoA, with a sulfate ester at C-2. [(a) Information from U. Häcker et al., Nature Rev. Mol. Cell Biol. 6:530, 2005. (b) Information from J. Turnbull et al., Trends Cell Biol. 11:75, 2001.] The glycosaminoglycan chains in proteoglycans can bind to a variety of extracellular ligands and thereby modulate the ligands’ 973 interaction with specific receptors of the cell surface. Detailed studies of heparan sulfate demonstrate a domain structure that is not random; some domains (typically three to eight disaccharide units long) differ from neighboring domains in sequence and in ability to bind to specific proteins. Highly sulfated domains (called NS domains) alternate with domains having unmodified GlcNAc and GlcA residues (N-acetylated, or NA, domains) (Fig. 7- 22b). The exact pattern of sulfation in the NS domain depends on the particular proteoglycan; given the number of possible modifications of the GlcNAc–IdoA (iduronic acid) dimer, at least 32 different disaccharide units are possible. Furthermore, the same core protein can display different heparan sulfate structures when synthesized in different cell types. Heparan sulfate molecules with precisely organized NS domains bind specifically to extracellular proteins and signaling molecules to alter their activities. The change in activity may result from a conformational change in the protein that is induced by the binding (Fig. 7-23a), or it may be due to the ability of adjacent domains of heparan sulfate to bind to two different proteins, bringing them into close proximity and enhancing protein- protein interactions (Fig. 7-23b). A third general mechanism of action is the binding of extracellular signal molecules (growth factors, for example) to heparan sulfate, which increases their local concentrations and enhances their interaction with growth factor receptors on the cell surface; in this case, the heparan sulfate acts as a coreceptor (Fig. 7-23c). For example, fibroblast growth factor (FGF), an extracellular protein signal that stimulates cell division, first binds to heparan sulfate moieties of 974 syndecan molecules in the target cell’s plasma membrane. Syndecan presents FGF to the FGF plasma membrane receptor, and only then can FGF interact productively with its receptor to trigger cell division. Finally, in another type of mechanism, the NS domains interact — electrostatically and otherwise — with a variety of soluble molecules outside the cell, maintaining high local concentrations at the cell surface (Fig. 7-23d). FIGURE 7-23 Four types of protein interactions with NS domains of heparan sulfate. [Information from J. Turnbull et al., Trends Cell Biol. 11:75, 2001.] The protease thrombin, essential to blood coagulation (see Fig. 6- 44), is inhibited by another blood protein, antithrombin, which prevents premature blood clotting. Antithrombin does not bind to or inhibit thrombin in the absence of heparan sulfate. In the presence of heparan sulfate or heparin, the binding affinity of thrombin for antithrombin increases 2,000-fold, and thrombin is 975 strongly inhibited. When thrombin and antithrombin are crystallized in the presence of a short (16 residue) segment of heparan sulfate, the negatively charged heparan sulfate mimic is seen to bridge positively charged regions of the two proteins, causing an allosteric change that inhibits thrombin’s protease activity (Fig. 7-24). The binding sites for heparan sulfate and heparin in both proteins are rich in Arg and Lys residues; the amino acids’ positive charges interact electrostatically with the sulfates of the glycosaminoglycans. Antithrombin also inhibits two other blood coagulation proteins (factors IXa and Xa) in a heparan sulfate–dependent process. 976 FIGURE 7-24 Molecular basis for heparan sulfate enhancement of the binding of thrombin to antithrombin. In this crystal structure of thrombin, antithrombin, and a 16 residue heparan sulfate–like polymer, all crystallized together, the binding sites for heparan sulfate in both proteins are rich in Arg and Lys residues. These positively charged regions, shown in blue in this electrostatic representation of the proteins, allow strong electrostatic interaction with multiple negatively charged sulfates and carboxylates of the heparan sulfate. Consequently, the affinity of antithrombin for thrombin is three orders of magnitude greater in the presence of heparan sulfate than in its absence. [Data from PDB ID 1TB6, W. Li et al., Nat. Struct. Mol. Biol. 11:857, 2004.] 977 The importance of correctly synthesizing sulfated domains in heparan sulfate is demonstrated in mutant (“knockout”) mice lacking the enzyme that sulfates the C-2 hydroxyl of iduronate (IdoA). These animals are born without kidneys and with severe developmental abnormalities of the skeleton and eyes. Other studies demonstrate that membrane proteoglycans are important in the liver for clearing lipoproteins from the blood. Finally, there is growing evidence that proteoglycans containing heparan sulfate and chondroitin sulfate provide directional cues for axon outgrowth, influencing the path taken by developing axons in the nervous system. The functional importance of proteoglycans and the glycosaminoglycans associated with them can also be seen in the effects of mutations that block the synthesis or degradation of these polymers in humans (Box 7-3). BOX 7-3 MEDICINE Defects in the Synthesis or Degradation of Sulfated Glycosaminoglycans Can Lead to Serious Human Disease Glycosaminoglycan synthesis requires enzymes that activate monomeric sugars, transport them across membranes, condense the activated sugars into polysaccharides, and add sulfates. Mutations in any of these enzymes in humans can lead to structural defects in the glycosaminoglycan (or in the proteoglycans formed from them). The result can be any of a wide variety of defects in cell signaling, cell proliferation, tissue morphogenesis, or interactions with growth factors (Fig. 1). For example, failure to extend the 978 disaccharide unit GlcNAc-GlcA leads to a bone abnormality in which multiple, large bone spurs develop (Fig. 2). FIGURE 1 A segment of proteoglycan showing the normal structure of the glycosaminoglycans (GAGs) chondroitin sulfate or dermatan sulfate (CS/DS) (top) and heparan sulfate or heparin (HS/Hep) (bottom), attached through the linkage region to a Ser residue in the core protein. When a specific biosynthetic enzyme is absent because of a mutation, the numbered elements cannot be added to the growing oligosaccharide, and the product is truncated. The dysfunctional GAGs result in several types of human disease, depending on the site of truncation: progeroid-type Ehlers-Danlos syndrome — with hyperextensible joints, fragile skin, and premature aging; short stature or frequent joint dislocations; neuropathy (nerve damage); skeletal defects; bipolar disorder or diaphragmatic hernia; and bone deformations in the form of large bone spurs. 979 FIGURE 2 Bone deformation characteristic of multiple hereditary exostoses, a disease resulting from a genetic inability to add the GlcNAc-GlcA disaccharide to the growing heparan sulfate or heparin chain (see in Fig. 1). The extra bone growth is artificially colored red in this x-ray of the humerus (upper arm bone). When the defect occurs in degradative enzymes, the accumulation of incompletely degraded glycosaminoglycans can produce diseases ranging from moderate, as in Scheie syndrome, with joint stiffening but normal intelligence 980 and life span, to severe, as in Hurler syndrome, with enlarged internal organs, heart disease, dwarfism, intellectual disability, and early death. Glycosaminoglycans were formerly called mucopolysaccharides, and diseases caused by genetic defects in their breakdown are o en still called mucopolysaccharidoses. Some proteoglycans can form proteoglycan aggregates, enormous supramolecular assemblies of many core proteins all bound to a single molecule of hyaluronan (also called hyaluronate). Aggrecan core protein (M r ∼250,000) has multiple chains of chondroitin sulfate and keratan sulfate, joined to Ser residues in the core protein through trisaccharide linkers, to give an aggrecan monomer of M r ∼2 × 10 6. When a hundred or more of these “decorated” core proteins bind a single, extended molecule of hyaluronan (Fig. 7-25), the resulting proteoglycan aggregate (M r > 2 × 10 ) 8 and its associated water of hydration occupy a volume about equal to that of a bacterial cell! Aggrecan interacts strongly with collagen in the ECM of cartilage, contributing to the development, tensile strength, and resilience of this connective tissue. 981 FIGURE 7-25 Proteoglycan aggregate of the extracellular matrix. Schematic drawing of a proteoglycan with many aggrecan molecules. One very long molecule of hyaluronan is associated noncovalently with about 100 molecules of the core protein aggrecan. Each aggrecan molecule contains many covalently bound chondroitin sulfate and keratan sulfate chains. Link proteins at the junction between each core protein and the hyaluronan backbone mediate the core protein–hyaluronan interaction. Interwoven with these enormous extracellular proteoglycans are fibrous matrix proteins such as collagen, elastin, and fibronectin, forming a cross-linked meshwork that gives the whole ECM strength and resilience. Some of these proteins are multiadhesive, a single protein having binding sites for several different matrix molecules. Fibronectin, for example, has separate domains that bind fibrin, heparan sulfate, and collagen. Fibronectin and a number of other proteins in the extracellular matrix contain the conserved RGD sequence (Arg–Gly–Asp), through which they bind to a family of proteins called integrins. 982 Integrins mediate signaling between the cell interior and the network of molecules in the ECM. The overall picture of cell- matrix interactions that emerges (Fig. 7-26) shows an array of interactions between cellular and extracellular molecules. These interactions serve not merely to anchor cells to the ECM, providing the strength and elasticity of skin and joints. They also provide paths that direct the migration of cells in developing tissue and serve to convey information in both directions across the plasma membrane. FIGURE 7-26 Interactions between cells and the extracellular matrix. The association between cells and the proteoglycan of the extracellular matrix is mediated by a membrane protein (integrin) and by an extracellular protein (fibronectin, with its RGD motif) with binding sites for both integrin and the proteoglycan. Note the close association of collagen fibers with the fibronectin and proteoglycan. 983 Glycoproteins Have Covalently Attached Oligosaccharides Glycoproteins are carbohydrate-protein conjugates in which the glycans are branched and are much smaller and more structurally diverse than the huge glycosaminoglycans of proteoglycans. The carbohydrate is attached at its anomeric carbon through a glycosidic link to the —OH of a Ser or Thr residue (O-linked), or through an N-glycosyl link to the amide nitrogen of an Asn residue (N-linked) (Fig. 7-27). N-glycosyl bonds join the anomeric carbon of a sugar to a nitrogen atom in glycoproteins and nucleotides (see Fig. 8-1). Some glycoproteins have a single oligosaccharide chain, but many have more than one; the carbohydrate may constitute from 1% to 70% of the glycoprotein by mass. About half of all proteins of mammals are glycosylated. 984 FIGURE 7-27 Oligosaccharide linkages in glycoproteins. (a) O-linked oligosaccharides have a glycosidic bond to the hydroxyl group of Ser or Thr residues (light red), illustrated here with GalNAc as the sugar at the reducing end of the oligosaccharide. One simple chain and one complex chain are shown. (b) N-linked oligosaccharides have an N-glycosyl bond to the amide nitrogen of an Asn residue (green), illustrated here with GlcNAc as the terminal sugar. Three common types of oligosaccharide chains that are N-linked in glycoproteins are shown. A complete description of oligosaccharide structure requires specification of the position and stereochemistry ( α or β) of each glycosidic linkage. 985 As we shall see in Chapter 11, the external surface of the plasma membrane has many membrane glycoproteins with arrays of covalently attached oligosaccharides of varying complexity. Mucins are secreted or membrane glycoproteins that can contain large numbers of O-linked oligosaccharide chains. Mucins are present in most secretions; they are what gives mucus its characteristic slipperiness. Many of the proteins secreted by eukaryotic cells are glycoproteins, including most of the proteins of blood. For example, immunoglobulins (antibodies) and certain hormones, such as follicle-stimulating hormone, luteinizing hormone, and thyroid-stimulating hormone, are glycoproteins. Some of the proteins secreted by the pancreas are glycosylated, as are most of the proteins contained in lysosomes. Many milk proteins, including the major whey protein α-lactalbumin, are glycosylated. Glycomics is the systematic characterization of all carbohydrate components of a given cell or tissue, including those attached to proteins and to lipids. For glycoproteins, this also means determining which proteins are glycosylated and where in the amino acid sequence each oligosaccharide is attached. This is a challenging undertaking, but worthwhile because of the potential insights it offers into normal patterns of glycosylation and the ways in which they are altered during development or in genetic diseases or cancer. Current methods of characterizing the entire carbohydrate complement of cells depend heavily on sophisticated application of nuclear magnetic resonance and mass spectrometry. 986 The structures of a large number of O- and N-linked oligosaccharides from a variety of glycoproteins are known; Figures 7-20 and 7-27 show a few typical examples. In Chapter 27, we consider the mechanisms by which specific proteins acquire specific oligosaccharide moieties. What are the biological advantages of adding oligosaccharides to proteins? The very hydrophilic carbohydrate clusters alter the polarity and solubility of the proteins with which they are conjugated. Oligosaccharide chains that are attached to newly synthesized proteins in the endoplasmic reticulum (ER) and elaborated in the Golgi complex serve as destination labels and also act in protein quality control, targeting misfolded proteins for degradation (see Figs. 27-38, 27-39). When numerous negatively charged oligosaccharide chains are clustered in a single region of a protein, the charge repulsion among them favors the formation of an extended, rodlike structure in that region. The bulkiness and negative charge of oligosaccharide chains also protect some proteins from attack by proteolytic enzymes. Beyond these global physical effects on protein structure, there are also more specific biological effects of oligosaccharide chains in glycoproteins (Section 7.4). The importance of normal protein glycosylation is clear from at least 40 different genetic disorders of glycosylation having been found in humans, all causing severely defective physical or mental development; some of these disorders are fatal. 987 Glycolipids and Lipopolysaccharides Are Membrane Components Glycoproteins are not the only cellular components that bear complex oligosaccharide chains; some lipids, too, have covalently bound oligosaccharides. Gangliosides are membrane lipids of eukaryotic cells in which the polar head group, the part of the lipid that forms the outer surface of the membrane, is a complex oligosaccharide containing a sialic acid (Fig. 7-9) and other monosaccharide residues. Some of the oligosaccharide moieties of gangliosides, such as those that determine human blood groups (see Fig. 10-13), are identical with those found in certain glycoproteins, which therefore also contribute to blood group type. Like the oligosaccharide moieties of glycoproteins, those of membrane lipids are generally, perhaps always, found on the outer face of the plasma membrane. Lipopolysaccharides are the dominant surface feature of the outer membrane of gram-negative bacteria su