22 Feb 2012: (2012) Correction: The Murine Coronavirus Hemagglutinin-esterase Receptor-binding Site: A Major Shift in Ligand Specificity through Modest Changes in Architecture. PLoS Pathog 8(2): 10.1371/annotation/a1e2a2e4-df03-40db-b10b-fd0cfcf78d3c. doi: 10.1371/annotation/a1e2a2e4-df03-40db-b10b-fd0cfcf78d3c | View correction
The hemagglutinin-esterases (HEs), envelope glycoproteins of corona-, toro- and orthomyxoviruses, mediate reversible virion attachment to O-acetylated sialic acids (O-Ac-Sias). They do so through concerted action of distinct receptor-binding (“lectin”) and receptor-destroying sialate O-acetylesterase (”esterase”) domains. Most HEs target 9-O-acetylated Sias. In one lineage of murine coronaviruses, however, HE esterase substrate and lectin ligand specificity changed dramatically as these viruses evolved to use 4-O-acetylated Sias instead. Here we present the crystal structure of the lectin domain of mouse hepatitis virus (MHV) strain S HE, resolved both in its native state and in complex with a receptor analogue. The data show that the shift from 9-O- to 4-O-Ac-Sia receptor usage primarily entailed a change in ligand binding topology and, surprisingly, only modest changes in receptor-binding site architecture. Our findings illustrate the ease with which viruses can change receptor-binding specificity with potential consequences for host-, organ and/or cell tropism, and for pathogenesis.
Glycans cover the surface of every living cell. In vertebrates, these sugar trees commonly terminate with sialic acid (Sia) and, in consequence, Sias have become the attachment factors of choice for a multitude of pathogens: protozoa, bacteria and viruses alike. To ensure selectivity, viruses evolved to target distinct Sia species. Whether a particular type of Sia serves as receptor may depend -amongst others- on the absence or presence of specific Sia modifications. For example, most group A betacoronaviruses attach to 9-O-acetylated Sias. However, some murine coronaviruses have switched to using 4-O-acetylated Sias instead. In chemical/molecular terms this represents a momentous shift in receptor usage. We now have crystallized the hemagglutinin-esterase protein (HE) of a murine coronavirus and have solved the structure of its sugar-binding domain. Our findings reveal in exquisite detail the interactions between Sia binding site and cognate receptor. The data allow a reconstruction of how, during coronavirus evolution, the switch in receptor usage may have come about.
Citation: Langereis MA, Zeng Q, Heesters B, Huizinga EG, de Groot RJ (2012) The Murine Coronavirus Hemagglutinin-esterase Receptor-binding Site: A Major Shift in Ligand Specificity through Modest Changes in Architecture. PLoS Pathog 8(1): e1002492. doi:10.1371/journal.ppat.1002492
Editor: Ron A. M. Fouchier, Erasmus Medical Center, The Netherlands
Received: September 12, 2011; Accepted: December 5, 2011; Published: January 26, 2012
Copyright: © 2012 Langereis et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by “Pionier” program grant 99.402 and ECHO grant 700.55.007 from the Council for Chemical Sciences of the Netherlands Organization for Scientific Research (http://www.nwo.nl/) and by a grant (2006) from the Mizutani Foundation for Glycoscience (http://www.mizutanifdn.or.jp/).The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
To initiate infection viruses must bind to an appropriate host cell. Selectivity of binding is ensured by attachment proteins on the virion, tailored to recognize one -or at the most- a limited number of cell surface molecules. Remarkably, a large number of viruses, representative of at least 11 distinct families several of which of clinical and/or veterinary importance, use sialic acid (Sia) as receptor determinant. Owing to differential modification, Sia structural diversity exceeds that of any other monosaccharide . The most common type of Sia substitution, O-acetylation at carbon atoms C4, C7, C8 and/or C9, occurs in a host-, organ- and even cell-specific fashion such that even individual cells of the same type and tissue may differ in their Sia expression profile –. Viruses have evolved to selectively use particular Sia variants and their attachment proteins are high-specificity sialolectins, the binding of which might depend on the identity of the penultimate residue in the sugar chain, the type of glycosidic linkage and/or the presence or absence of substitutions –. Ultimately, this preference in Sia receptor usage affects host-, organ-, and cell-tropism –, the course and outcome of infection – as well as the efficacy of intra- and cross-species transmission , , all to extents not yet fully appreciated.
The hemagglutinin-esterases (HEs) are a class of Sia-binding envelope glycoproteins found in some negative-stranded RNA viruses, namely in influenza C and infectious salmon anemia virus (family Orthomyxoviridae; , ), but also in toro- and coronaviruses, positive-stranded RNA viruses in the order Nidovirales , . From phylogenetic and comparative structural analyses it appears that toro- and coronaviruses acquired their HE proteins separately via horizontal gene transfer, with an (hemagglutinin-esterase-fusion) HEF-like protein as progenitor –. Like influenza C virus HEF, most nidovirus HEs bind to 9-O-acetylated (9-O-Ac) Sias and, correspondingly, display sialate-9-O-acetylesterase receptor-destroying enzyme activity . Murine coronaviruses, however, occur in two closely related biotypes that differ in HE ligand/substrate preference. One of these -represented by mouse hepatitis virus (MHV) strain DVIM- displays the presumptive ancestral specificity and targets 9-O-Ac-Sias, while the other -represented by MHV strain S- appears to have evolved to use 4-O-Ac-Sias instead , – (for supplementary introduction see Text S1 and Figure S1). Given the stereochemical differences between these Sia variants (Figure 1) and the essentially different requirements for ligand and substrate recognition by the respective HEs, the question arises how this major shift in receptor usage was achieved and what changes must have occurred in the receptor-binding and O-acetylesterase domains to make this transition possible.
Figure 1. Stereochemical differences between 9-O- and 4-O-acetylated sialic acid.
Stick representation of (left) αNeu5,9Ac22Me and (right) αNeu4,5Ac22Me. Backbone αNeu5Ac2Me is colored in gray (carbon), red (oxygen) and blue (nitrogen). The 9-O-Ac group of αNeu5,9Ac22Me and 4-O-Ac group of αNeu4,5Ac22Me are highlighted in cyan (carbon).doi:10.1371/journal.ppat.1002492.g001
The crystal structures of a number of 9-O-Ac-Sia-specific nidovirus HEs have been solved , . Unlike the receptor-binding site (RBS) of influenza C virus HEF , the RBSs of the corona- and torovirus HEs seem to be exceptionally plastic as they appear to have undergone significant changes and adaptations that altered their overall architecture in a relatively short evolutionary time span. Based on these observations, we anticipated and speculated  that this plasticity might have allowed for even more substantial adjustments in the RBS of the murine coronavirus HE as to produce an entirely novel binding site specific for 4-O-acetylated Sias.
We now present the crystal structure of the MHV-S HE receptor-binding domain, both in its native state and in complex with a receptor analogue. The data reveal in exquisite detail how the RBS changed to accommodate 4-O- instead of 9-O-acetylated Sias. Surprisingly, however, this shift in receptor usage seems to have involved primarily a change in ligand binding topology and relatively modest changes in RBS architecture.
Expression, purification, and biochemical characterization of MHV-S HE
We produced the ectodomain of MHV-S HE (residues 25-403) as an Fc-fusion protein, either in enzymatically active (HE-Fc) or inactive form (HE0-Fc), by transient transfection of HEK293 cells. MHV-S HE0-Fc bound to horse serum glycoproteins (HSG), which are decorated with 4-O-acetylated sialic acids (4-O-Ac-Sia), but carry little to no 9-O-Ac-Sias (Figure 2A; ). The receptor determinants in HSG could be destroyed by treatment with MHV-S HE-Fc, but not by treatment with BCoV-Mebus HE-Fc (a sialate-9-O-acetylesterase; Figure 2B). No binding of MHV-S HE0-Fc was observed to bovine submaxillary mucin (BSM), a glycoconjugate devoid of 4-O-Ac-Sias (Figure 2A; ). The MHV-S HE ectodomain, released from HE-Fc by thrombin-cleavage, retained proper sialate-4-O-acetylesterase activity when assayed for substrate specificity with a synthetic di-O-acetylated Sia (5-N-acetyl-4,9-di-O-acetylneuraminic acid α-methylglycoside, αNeu4,5,9Ac32Me; Figure 2C). In hemagglutination assays, MHV-S HE0 specifically bound to 4-O-acetylated Sias (Figure 2D). The combined findings show that the recombinant MHV-S HE proteins are biologically active, both as Fc fusion proteins (Figures 2A and B) and after the removal of the Fc tail by thrombin-cleavage (Figures 2C and D), which we take as an indication for proper folding and protein stability.
Figure 2. HE-Fc fusion protein displays proper receptor-binding and enzymatic activities.
(A) Binding of two-fold serial dilutions (starting at 100 µg/ml) of esterase-deficient Fc-fusion proteins (HE0-Fc) of BCoV-Mebus and MHV-S in a solid-phase lectin-binding assay towards horse serum glycoproteins (HSG) and bovine submaxillary mucins (BSM). Relative binding in percentages is calculated with the binding of the highest concentration lectin set at a 100%. Wells incubated without lectin (“mock”) were included as negative control. (B) Receptor destroying enzyme activity towards HSG. Coated HSG was treated with two-fold serial dilutions (starting at 100 ng/ml) of enzymatically-active BCoV-Mebus and MHV-S HE Fc-fusion proteins and 4-O-Ac-Sia content was detected by solid phase lectin binding assay with MHV-S HE0-Fc. Decrease in signal as compared to untreated HSG is plotted in percentages. (C) MHV-S HE ectodomain displays sialate-4-O-acetylesterase activity towards the synthetic di-O-acetylated sialic acid analogue αNeu4,5,9Ac32Me. Graphs show total ion current gas-chromatograms and Sia subtypes were identified by mass spectrometry: Sia (αNeu5Ac2Me [peak 1]), 4-O-Ac-Sia (αNeu4,5Ac22Me [peak 2]), 9-O-Ac-Sia (αNeu5,9Ac22Me [peak 4], 4,9-di-O-Ac-Sia (αNeu4,5,9Ac32Me [peak 5]). Peak 3 represents a non-sialic acid compound. (D) Receptor binding activity of MHV-S HE ectodomain was assessed by hemagglutination assay with rat erythrocytes and twofold serial dilutions of the HE proteins (10,000 to 5 ng per well, arrow).doi:10.1371/journal.ppat.1002492.g002
Structure determination and overall structure
Crystals of free MHV-S HE and of a complex of HE0 with αNeu4,5Ac22Me diffracted to 2.1 and 2.5 Å resolution, respectively. The structures were solved by molecular replacement by using BCoV-Mebus HE (PDB ID 3CL5; ) as template (BCoV-Mebus and MHV-S HE share 59% sequence identity; for crystallographic details, see Table 1).
Table 1. Data collection and refinement statistics.doi:10.1371/journal.ppat.1002492.t001
In overall structure, the HE of MHV-S closely resembles that of BCoV-Mebus. It assembles into homodimers and the monomers are composed of three modules: a small membrane-proximal (MP), a receptor-binding (R), and a central esterase (E) domain (Figures 3A–C; ). The MP domain is virtually identical to that of BCoV-Mebus HE with a root mean square difference (rmsd) on main chain Cα atoms of only 0.48 Å. Unfortunately, residues in the E domain, that form the catalytic site were disordered in both crystals. Hence, the molecular basis for the unusual substrate specificity of MHV-S HE remains unknown. The structure of the R domain, however, was resolved, and in the complex the ligand molecule is well-defined (Figure S2). The R domains of MHV-S and BCoV-Mebus HE are highly similar with an rmsd on main chain Cα atoms of 0.79 Å.
Figure 3. Overall structure and comparison to BCoV-Mebus HE.
(A) Ribbon representation of the dimeric MHV-S (residues 25–395) and BCoV-Mebus HE (residues 19–376) structures. One monomer is colored grey, the other by domain: lectin domain (R, blue) with bound αNeu4,5Ac22Me (MHV-S HE) or αNeu4,5,9Ac32Me (BCoV-Mebus HE; cyan sticks) and potassium ion (magenta sphere); esterase domain (E, green); membrane-proximal domain (MP, red). (B) Linear representation of MHV HE with domains color-coded as in panel A. Grey segments indicate the signal-peptide (SP) and transmembrane (TM) domain. The bracket indicates the part of the protein for which the structure has been solved. (C) Structure- (MHV-S and BCoV-Mebus) and sequence-based (MHV-DVIM) alignment of HE sequences. Colored boxes above the sequences indicate domain organization as in panel A and B and black lines indicate loops involved in receptor binding. Note that in MHV-S HE two insertions increase the length of loops R3 and R4. Asterisks indicate the highly conserved residues of the potassium binding site and boxes indicate the critical serine, histidine and aspartic acid residues of the catalytic site. Residues that interact with the ligand are indicated in bold; those conserved among all three HEs are highlighted by grey shading. Other residues also conserved in all three HEs are highlighted in yellow. The residues in disordered loops of the esterase domain are indicated in light gray lettering.doi:10.1371/journal.ppat.1002492.g003
MHV-S HE has a unique receptor binding-site
The receptor-binding sites of BCoV-Mebus and MHV-S HE are very much alike in architecture. This is particularly surprising given the considerable differences in ligand preference and in their requirements for binding (i.e. binding of 9-O-Ac-Sia in a 9-O-Ac-dependent fashion versus binding of 4-O-Ac-Sia in 4-O-Ac-dependent fashion, respectively; Figure 1). The MHV-S HE receptor-binding site (RBS), like that of BCoV-Mebus HE, is composed of 5 surface exposed loops, four of which extend from the conserved 8-stranded “Swiss role” core-structure (loops R1 through R4; Figures 3C and 4A) and one originating from the E-domain (E-loop). Whereas the R1-, R2- and E-loops of the BCoV-Mebus and MHV-S HE sites are almost identical, the R3- and R4-loops adopt different conformations in the two proteins as result of amino acid insertions in MHV HE (Figures 3C and 4A). Two other conspicuous elements of the MHV-S HE RBS are the RBS-hairpin and a conserved metal-binding site with a potassium ion that stabilizes the R3-loop and the RBS-hairpin exactly as in BCoV-Mebus HE (Figure 4B). The potassium ion is coordinated by main chain oxygen atoms of Ser231, Glu280 and Leu282 and side chain oxygen atoms of Asp230, Gln232 and Ser278. These residues are conserved in BCoV-Mebus HE and in all other coronavirus HEs with the exception of HCoV-HKU1 HE .
Figure 4. Comparison of the MHV-S and BCoV-Mebus HE receptor binding sites.
(A) Ribbon superposition of the MHV-S and BCoV-Mebus HE receptor binding sites. BCoV-Mebus HE is colored gray, coloring of MHV-S HE as in panel A. Bound receptor analogues are shown as cyan sticks and potassium ions as magenta spheres. The five surface exposed loops and the RBS-hairpin that interact with the receptor are indicated. Note that only the R3- and R4-loops differ in conformation. (B) Close-up of the HE-potassium binding-site of MHV-S HE and BCoV-Mebus HE. Shown in ribbon representation are the R3-loop (salmon) and RBS-hairpin (purple) that interacts with the potassium ion (magenta sphere).doi:10.1371/journal.ppat.1002492.g004
While the overall organization of the MHV-S RBS is similar to that of BCoV-Mebus HE, the orientation of the receptor analogue with respect to the RBS is strikingly different (Figures 4B, 5A and B). As compared to the ligand in the BCoV-Mebus HE binding site (Figures 5C and D), the αNeu4,5Ac22Me receptor analogue is rotated by about 90° and shifted by about 2.5 Å. Figures 5A and B show how residues from the four R-loops, the E-loop and the RBS-hairpin interact with the Sia receptor molecule. Two hydrogen bonds are formed between the nitrogen and oxygen main-chain atoms of Lys217 and the oxygen of the C4 acetyl group and the nitrogen of the 5-N-acetyl group, respectively. The Ser220 main chain nitrogen accepts an additional, weak hydrogen bond from the C8 hydroxyl group of the ligand (Figure 5B).
Figure 5. MHV-S HE has a unique receptor-binding site that binds specifically 4-O-acetylated sialic acid.
(A) Surface and (B) stick representation of the MHV-S HE receptor-binding site in complex with a receptor analogue. The ligand bound to the HE receptor-binding site is shown in stick representation and the potassium ion as a magenta sphere, indicated by a black arrow in panel A. Hydrogen bonds between HE and the receptor are shown as black dashed lines. Surface representation of the MHV-S HE receptor-binding site reveals two pockets accommodating the 4-O- and 5-N-acetyl groups of the receptor, respectively. Note that crystals were soaked with αNeu4,5,9Ac32Me, but most likely as a result of the low pH crystallization conditions, the 9-O-Ac group was lost . (C) Surface and (D) stick representation of the BCoV-Mebus HE receptor-binding site. Note that the topology of the two hydrophobic pockets is conserved, except they bind different substituents of the receptor analogue. (E) The effect of Ala substitutions on receptor binding. Relative binding affinity of wild-type HE0 (wt) and its derivatives was assessed by hemagglutination assay with rat erythrocytes and twofold serial dilutions of each of the HE0-Fc chimeras (5,000 to 10 ng per well, arrow). (F) Binding of twofold serial dilutions of wild-type (wt) HE0-Fc chimera and its derivatives in a solid-phase lectin-binding assay towards horse serum glycoproteins (HSG) as described in Figure 2A.doi:10.1371/journal.ppat.1002492.g005
Most remarkably, the hydrophobic pocket that in BCoV-Mebus HE accommodates the 9-O-acetyl moiety of the receptor (comprised of Leu161, Tyr184 Leu266 and Leu267) -arguably the most crucial element of the BCoV HE RBS- is conserved in MHV-S HE (comprised of Ile166, Tyr189, Tyr281, and Leu282), but it now accepts the Sia 5-N-acetyl group, while the Sia glycerol side-chain is solvent exposed (Figure 5A). Moreover, the hydrophobic patch in the BCoV-Mebus HE RBS that interacts with the Sia 5-N-acetyl group (Figure 5C) apparently changed into a shallow pocket that accommodates the Sia 4-O-acetyl moiety (Figure 5A). The residues orthologous to BCoV-Mebus HE Thr114, Leu161, Phe211, and Leu266 were replaced by Leu119, Ile166, Ser216, and Tyr281, respectively, and Leu260 was recruited from the R4-loop, which in MHV-S HE is reoriented as compared to the one in BCoV-Mebus HE (Figure 4A). These residues, together with conserved Phe212, form the hydrophobic lining of the newly shaped pocket (Figures 5A and B). As the Sia-4-O-acetyl group is crucial for ligand recognition by MHV-S HE, this pocket must be key to receptor-binding. In accordance, single Ala substitutions of Leu119, Ile166, Phe212, Leu260, or Tyr281 all reduced receptor-binding activity (although that of Ile166 to lesser extent) as shown by hemagglutination assay (Figure 5E) and solid-phase lectin binding assay (Figure 5F).
Relatively modest changes in the MHV-S receptor binding-site changes ligand specificity
The data reveal in minute detail not only the mode of interaction between MHV-S HE and its cognate receptor determinant, but also clarify how a CoV HE RBS for 9-O-Ac-Sia might have transformed into one that now specifically binds 4-O-Ac-Sia. The most striking observation is that this major shift in ligand specificity required only minimal changes in the protein and that the binding site architecture was essentially maintained. How this was possible can be explained from the mode of lectin-ligand interaction, based largely on the docking of the methyl groups of the Sia-acetyl moieties into hydrophobic pockets, and from the structures of the two types of ligands. The juxtaposition of the Sia 5-N- and 9-O-acetyl moieties is quasi-similar to that of the Sia 4-O- and 5-N-acetyl groups. The distance between the groups may be different (7.1 versus 5.7 Å as measured between the methyl carbon atoms, respectively), but for each combination the acetyl groups are located in roughly the same plane and at roughly similar angles (Figure S3). Thus, it can be envisaged that a pre-existing site for 9-O-Ac-Sia was converted to accommodate 4-O-Ac-Sia instead by (i) having the ligand rotate (with binding of the ligand in the novel orientation facilitated through hydrogen bonding with residues introduced by substitutions and/or insertions in the R3 loop) and (ii) by bringing the original 9-O-acetyl binding pocket and 5-N-acetyl binding patch more closely together so that they now can accept the 5-N- and 4-O-acetyl moieties, respectively (Figure S3). From attempts to fit αNeu5,9Ac22Me into the MHV-S RBS by in silico modelling, the the 9-O- and 5-N-acetyl groups would seem to be spaced too far apart to conveniently dock into the acetyl-binding pockets. Moreover, were the ligand to bind in this orientation, the Sia carboxylate would clash with the modified R3-loop. These findings thus provide an explanation for exclusion of the original ligand and for the specificity of MHV-S HE for 4-O-Ac-Sias (Figure S3 and Video S1).
The structure of the MHV-S HE-receptor complex allows guarded predictions only of how glycosidic linkage or additional Sia modifications might affect ligand binding. The C2-oxygen through which glycosidically-bound Sia would be linked to the penultimate residue of the glycan chain is exposed to the solvent and we would therefore expect the lectin to bind Sias in a linkage-independent fashion. Still, the R4- and/or E-loops, as they are proximal to Sia C2 (Figure 4A), might affect ligand binding such as to cause a preference for a particular linkage type. The pocket for the Sia 5-N-acetyl group would seem sufficiently wide to also accommodate the slightly larger 5-N-Gc substituent (Figure 5A); whether the lectin does accept 5-N-glycolylated Sias as ligands remains to be shown, however. Finally, from the topology and orientation of αNeu4,5Ac22Me in the RBS of MHV-S HE, ligand binding would seem to be tolerant to modifications at the Sia glycerol side chain (Figure 5A). Yet, as demonstrated by hemagglutination assay with native and sialate-9-O-acetylesterase-treated erythrocytes, MHV-S HE apparently prefers 4-mono-O- over 4,9-di-O-acetylated Sias .
The occurrence of two distinct MHV lineages –exemplified by strains S and DVIM– that through their HE proteins bind to widely different Sia subtypes poses an interesting conundrum. While the structure reported here provides clues to how an HE protein ancestral to that of MHV-S may have changed to bind to 4-O- rather than to 9-O-acetylated Sias, the conditions that selected for this shift in ligand specificity and the biological consequences thereof are unknown. The limited data available on the in vivo role of HE suggests that it promotes viral spread . Entry of murine coronaviruses, however, is mediated not by HE, but by the S protein, a type I fusion protein that binds to the principal receptor CAECAM1a –. We propose that HE may act during the very early stages of the infectious cycle as a molecular timer for temporary virion attachment. Through the concerted actions of its lectin and sialate-O-acetylesterase domains, HE would allow virus particles to bind with high avidity and yet reversibly to sialylated surfaces. The time allowed for virions to remain attached would be a function of HE binding affinity/avidity, esterase activity and local Sia density. Virions by binding to the ubiquitous and highly accessible Sias in the glycocalix would buy time for the S protein to find and bind the main receptor at the cell's surface as an obligatory prelude to penetration. Such a strategy would be advantageous particularly under conditions of low receptor density or poor receptor accessibility. If within the allotted time, HE-mediated virion attachment would not progress to this next stage of entry (for example, because the particle attached not to a susceptible cell, but to decoy receptors on a non-cell-associated glycoconjugate), the default would be for the virus to elute and “take its business elsewhere”. In this model, , MHV HE would appreciably contribute to host cell selection, its ligand preference potentially affecting host-, organ- and cell tropism. Our findings pave the way to study the function of CoV HE and to assess the importance of ligand and substrate specificity through an approach of structure-guided mutagenesis, reverse genetics and animal experimentation in a natural infection model.
Materials and Methods
Protein expression and purification
A synthetic DNA with human codon-optimized sequence for the HE ectodomain of MHV strain S (MHV-S; amino acid residues 25–403) was cloned in pCD5-Ig , , a derivative of expression plasmid S1-Ig . The resulting construct, pCD5-MHV-S-HE-T-Fc, codes for a chimeric HE protein provided with an N-terminal CD5 signal peptide and, at its C-terminus, preceded by a thrombin cleavage site, the Fc domain of human IgG1 (HE-Fc). The QuikChange XL II site-directed mutagenesis kit (Stratagene) was used to construct pCD5-MHV-S-HE-T-Fc derivatives that code for an enzymatically inactive HE-Fc with the esterase catalytic residue Ser45 replaced by Ala (HE0-Fc), and for HE0-Fc mutants with Ala substitutions in the receptor-binding site. For analytical purposes, HE-Fc fusion proteins were produced by transient expression in HEK293T cells and then purified from the cell culture supernatants by protein A-affinity chromatography and low-pH elution (0.1M Citric-acid pH 3.0). The pH of the eluate was neutralized by adding Tris pH 8.0 to a final concentration of 0.2 M and the protein solution was dialyzed against phosphate-buffered saline (PBS). For crystallography, HE-Fc fusion-proteins were transiently expressed in HEK293 GnTI(-) cells  and the MHV-S ectodomain was purified by protein A-affinity chromatography and on-the-beads thrombin cleavage as described , .
Solid-phase lectin binding assay (SLBA)
Maxisorp 96-well plates (NUNC) were coated for 16 hrs at 4°C with horse serum glycoproteins (HSG; 10% v/v horse serum in PBS) or bovine submaxilary mucin (BSM; 10 mg/ml; Sigma) at 100 µl per well. The wells were washed with washing buffer (PBS, 0.05% Tween-20) and treated with blocking buffer (PBS, 0.05% Tween-20, 2% bovine serum albumin, BSA) for 1 hr at RT. Two-fold serial dilutions of HE0-Fc lectins were prepared in blocking buffer (starting concentration 100 µg/ml) and 100 µl samples of these dilutions were added to the glycoconjugate-coated wells. Incubation was continued for 60 min after which unbound lectin was removed by washing three times. Bound lectin was detected using an HRP-conjugated goat anti-human IgG antiserum (1:10,000 in blocking buffer; Southern Biotech) and TMB Super Slow One Component HRP Microwell Substrate (BioFX) according to the instructions. The staining reaction was terminated by addition of 0.3 M phosphoric acid, the optical density was measured at 450 nm, and graphs were constructed using GraphPad software. To assess and compare the enzymatic activities of BCoV-Mebus and MHV-S HE-Fc towards 4-O-acetylated Sias, HSG coated in Maxisorp plates was treated with samples from two-fold serial dilutions of either enzyme (starting at 100 ng/ml in PBS, 100 µl/well) for 2 hrs at 37°C. The destruction of 4-O-Ac-Sia receptor determinants was determined by SLBA with MHV-S HE0-Fc (5 µg/ml in blocking buffer) as described above. Enzymatic de-O-acetylation of αNeu4,5,9Ac32Me was analyzed by gas-chromatography-electron impact mass-spectrometry (GC-MS) as described , , .
Hemagglutination assay was performed in V-shaped 96-well plates (Greiner Bio-One). Two-fold serial dilutions in 50 µl PBS, 0.1% BSA of HE0-Fc or of purified HE0 ectodomains (starting amounts indicated in the text) were mixed with 50 µl of a rat erythrocyte suspension (Rattus norvegicus strain Wistar; 0.5% in PBS) and incubated for 2 hours on ice.
Crystallization conditions were screened by the sitting-drop vapor diffusion method using a Honeybee 961 (Genomic Solutions). Drops were set up with 0.2 µl of HE protein solution in 10 mM Tris-HCl pH 8.0 and 0.2 µl reservoir solution. Crystals with space group P212121 were obtained from 0.2 M KH2PO4, 0.2 M sodium malonate, 15% (w/v) PEG3350 and 0-5% (w/v) glycerol at 18°C. Crystals for diffraction experiments were grown with the hanging drop vapor diffusion method set up by hand with reservoir and protein solution ratio 1:1 (1.6 µl total) at 18°C, and grew to a final size of up to 0.25×0.20×0.20 mm within one week. For data collection, crystals were flash-frozen in liquid nitrogen using reservoir solution containing 20% (w/v) glycerol as the cryoprotectant. To determine the HE structure in complex with its receptor, crystals of HE0 were soaked by adding 2 µl of 10 mM αNeu4,5,9Ac32Me in cryoprotectant solution directly into the margin of the drop, resulting in a final substrate concentration of about 7 mM. Crystals were flash-frozen after 5 to 10 minutes.
Data collection and structure solution
Diffraction data of crystals of MHV-S HE and its complex (Table 1) were collected at ESRF station ID-14-1 and ID-14-3, respectively. Diffraction data of native and ligand-soaked HE crystals were processed using XDS  and scaled using SCALA from the CCP4 suite . Molecular replacement was performed using PHASER with BCoV-Mebus HE as template (PDB ID: 3CL5; ). Models were built manually with Coot  and refinement was carried out using REFMAC . Water molecules were added using ARP/WARP, graphics generated with PYMOL (http://pymol.sourceforge.net).
In the Ramanchandran plot three residues are found in disallowed regions. The electron density of these residues supports the modeled conformation. In both HE monomers present in the asymmetric unit of the crystal structure of free as well as ligand-bound HE, the active site region of the esterase domain is largely disordered. No electron density is observed for esterase domain residues A52-A59, B51-B59, A108-A114, A308-A314, A335-A347 and B338-B346, while residues 44-50, 60-72, 332-334, and 348-358 adopt different conformations in the two monomers. Modeling of chain A residues 397-401 and chain B residues 334-337 and 394-398 should be considered tentative. C-terminal residues 396-403 followed by the 7-residue thrombin recognition sequence of the cleavable Fc-fusion are stabilized by crystal packing interactions suggesting that the observed conformation is not physiologically relevant.
Rooted Neighbor-Joining tree depicting the evolutionary relationships among coronavirus HE proteins with Influenza C virus (IFC) HEF as outgroup. Confidence values calculated by bootstrapping (1000 replicates) are indicated at the major branching points. HEs specific for 9-O-acetylated Sia as determined on the basis of their lectin ligand specificity and/or sialate-O-acetylesterase substrate preference are high-lighted in green, the one specific for 4-O-acetylated Sia in red. HCoV, human coronavirus; PHEV, porcine hemagglutinating encephalomyelitis virus, EqCoV, equine coronavirus.
Electron density of the receptor. (A) Difference electron density map calculated from the final model from which the ligand had been omitted. The contour level is 3.0 σ. (B) 2Fo-Fc map of the final model contoured at the 1.0 σ level.
A large shift in ligand specificity through modest changes in receptor-binding site architecture. (A) Stick representation of αNeu4,5,9Ac32Me in gray with (left) the 5-N-Ac- and 9-O-Ac- or (right) the 4-O-Ac- and 5-N-Ac-groups colored in red (oxygen), blue (nitrogen) and cyan (carbon). Arrows indicate the distances between Ac methyl groups and asterisks the position of the O2 atom through which Sia would be linked to the penultimate residue of the glycan chain. The patch that accommodates the Sia-5-N-Ac group in BCoV-Mebus HE, and the newly formed pocket that accommodates the Sia-4-O-Ac group in MHV-S HE are colored in cyan. The pockets that harbor the Sia-9-O-Ac group in BCoV-Mebus HE and now accommodates the Sia-5-N-Ac group in MHV-S HE are colored in green. (B) Surface representation of the MHV-S HE receptor binding site with Neu5,9Ac22Me modeled in silico in a topology corresponding to that in BCoV HE. The model predicts that 9-O-Ac-Sia will not be accepted as ligand because of (i) the spatial arrangement of the two hydrophobic pockets at too close a distance of each other and (ii) a clash of the Sia carboxylate with residues of the extended R3 loop.
Supplementary introduction; Receptor switching from 9- O - to 4- O -Ac-Sias or the other way around?
A large shift in ligand specificity through modest changes in receptor-binding site architecture. Only a few changes in the receptor binding site architecture of the MHV-S HE protein resulted in the specific binding of 4-O-acetylated sialic acid and the exclusion of 9-O-acetylated sialic acid.
We acknowledge the European Synchrotron Radiation Facility for providing beamline facilities and the beamline scientists at ID-14-1 and ID14-3 for their help with data collection. We thank P. Reeves and M. Farzan for sharing HEK293S GnT-I(-) cells and pCD5 expression plasmid, respectively, and P. Rottier and J. de Groot-Mijnes for critical reading of the manuscript.
Conceived and designed the experiments: MAL QZ EGH RJdG. Performed the experiments: MAL QZ BH. Analyzed the data: MAL QZ EGH RJdG. Wrote the paper: MAL QZ EGH RJdG.
- 1. Angata T, Varki A (2002) Chemical diversity in the sialic acids and related alpha-keto acids: an evolutionary perspective. Chem Rev 102: 439–469.
- 2. Klein A, Roussel P (1998) O-acetylation of sialic acids. Biochimie 80: 49–57.
- 3. Dumermuth E, Beuret N, Spiess M, Crottet P (2002) Ubiquitous 9-O-acetylation of sialoglycoproteins restricted to the Golgi complex. J Biol Chem 277: 18687–18693.
- 4. Klein A, Krishna M, Varki NM, Varki A (1994) 9-O-acetylated sialic acids have widespread but selective expression: analysis using a chimeric dual-function probe derived from influenza C hemagglutinin-esterase. Proc Natl Acad Sci U S A 91: 7782–7786.
- 5. Rogers GN, Herrler G, Paulson JC, Klenk HD (1986) Influenza C virus uses 9-O-acetyl-N-acetylneuraminic acid as a high affinity receptor determinant for attachment to cells. J Biol Chem 261: 5947–5951.
- 6. de Groot RJ (2006) Structure, function and evolution of the hemagglutinin-esterase proteins of corona- and toroviruses. Glycoconj J 23: 59–72.
- 7. Herrler G, Szepanski S, Schultze B (1991) 9-O-acetylated sialic acid, a receptor determinant for influenza C virus and coronaviruses. Behring Inst Mitt. pp. 177–184.
- 8. Haselhorst T, Fleming FE, Dyason JC, Hartnell RD, Yu X, et al. (2009) Sialic acid dependence in rotavirus host cell invasion. Nat Chem Biol 5: 91–93.
- 9. Olofsson S, Bergstrom T (2005) Glycoconjugate glycans as viral receptors. Ann Med 37: 154–172.
- 10. Krempl C, Schultze B, Laude H, Herrler G (1997) Point mutations in the S protein connect the sialic acid binding activity with the enteropathogenicity of transmissible gastroenteritis coronavirus. J Virol 71: 3285–3287.
- 11. Alexander DA, Dimock K (2002) Sialic acid functions in enterovirus 70 binding and infection. J Virol 76: 11265–11272.
- 12. Arnberg N, Kidd AH, Edlund K, Olfat F, Wadell G (2000) Initial interactions of subgenus D adenoviruses with A549 cellular receptors: sialic acid versus alpha(v) integrins. J Virol 74: 7691–7693.
- 13. Lipton HL, Kumar AS, Hertzler S, Reddi HV (2006) Differential usage of carbohydrate co-receptors influences cellular tropism of Theiler's murine encephalomyelitis virus infection of the central nervous system. Glycoconj J 23: 39–49.
- 14. Olofsson S, Kumlin U, Dimock K, Arnberg N (2005) Avian influenza and sialic acid receptors: more than meets the eye? Lancet Infect Dis 5: 184–188.
- 15. Bauer PH, Bronson RT, Fung SC, Freund R, Stehle T, et al. (1995) Genetic and structural analysis of a virulence determinant in polyomavirus VP1. J Virol 69: 7925–7931.
- 16. Bauer PH, Cui C, Liu WR, Stehle T, Harrison SC, et al. (1999) Discrimination between sialic acid-containing receptors and pseudoreceptors regulates polyomavirus spread in the mouse. J Virol 73: 5826–5832.
- 17. Lopez-Bueno A, Rubio MP, Bryant N, McKenna R, Agbandje-McKenna M, et al. (2006) Host-selected amino acid changes at the sialic acid binding pocket of the parvovirus capsid modulate cell binding affinity and determine virulence. J Virol 80: 1563–1573.
- 18. Nam HJ, Gurda-Whitaker B, Gan WY, Ilaria S, McKenna R, et al. (2006) Identification of the sialic acid structures recognized by minute virus of mice and the role of binding affinity in virulence adaptation. J Biol Chem 281: 25670–25677.
- 19. Vines A, Wells K, Matrosovich M, Castrucci MR, Ito T, et al. (1998) The role of influenza A virus hemagglutinin residues 226 and 228 in receptor specificity and host range restriction. J Virol 72: 7626–7631.
- 20. Hellebo A, Vilas U, Falk K, Vlasak R (2004) Infectious salmon anemia virus specifically binds to and hydrolyzes 4-O-acetylated sialic acids. J Virol 78: 3055–3062.
- 21. Cornelissen LA, Wierda CM, van der Meer FJ, Herrewegh AA, Horzinek MC, et al. (1997) Hemagglutinin-esterase, a novel structural protein of torovirus. J Virol 71: 5277–5286.
- 22. Snijder EJ, den Boon JA, Horzinek MC, Spaan WJ (1991) Comparison of the genome organization of toro- and coronaviruses: evidence for two nonhomologous RNA recombination events during Berne virus evolution. Virology 180: 448–452.
- 23. Zeng Q, Langereis MA, van Vliet AL, Huizinga EG, de Groot RJ (2008) Structure of coronavirus hemagglutinin-esterase offers insight into corona and influenza virus evolution. Proc Natl Acad Sci U S A 105: 9065–9069.
- 24. Langereis MA, Zeng Q, Gerwig GJ, Frey B, von Itzstein M, et al. (2009) Structural basis for ligand and substrate recognition by torovirus hemagglutinin esterases. Proc Natl Acad Sci U S A 106: 15897–15902.
- 25. Smits SL, Gerwig GJ, van Vliet AL, Lissenberg A, Briza P, et al. (2005) Nidovirus sialate-O-acetylesterases: evolution and substrate specificity of coronaviral and toroviral receptor-destroying enzymes. J Biol Chem 280: 6933–6941.
- 26. Regl G, Kaser A, Iwersen M, Schmid H, Kohla G, et al. (1999) The hemagglutinin-esterase of mouse hepatitis virus strain S is a sialate-4-O-acetylesterase. J Virol 73: 4721–4727.
- 27. Langereis MA, van Vliet AL, Boot W, de Groot RJ (2010) Attachment of mouse hepatitis virus to O-acetylated sialic acid is mediated by hemagglutinin-esterase and not by the spike protein. J Virol 84: 8970–8974.
- 28. Rosenthal PB, Zhang X, Formanowski F, Fitz W, Wong CH, et al. (1998) Structure of the haemagglutinin-esterase-fusion glycoprotein of influenza C virus. Nature 396: 92–96.
- 29. Hanaoka K, Pritchett TJ, Takasaki S, Kochibe N, Sabesan S, et al. (1989) 4-O-acetyl-N-acetylneuraminic acid in the N-linked carbohydrate structures of equine and guinea pig alpha 2-macroglobulins, potent inhibitors of influenza virus infection. J Biol Chem 264: 9842–9849.
- 30. Reuter G, Pfeil R, Stoll S, Schauer R, Kamerling JP, et al. (1983) Identification of new sialic acids derived from glycoprotein of bovine submandibular gland. Eur J Biochem 134: 139–143.
- 31. Kazi L, Lissenberg A, Watson R, de Groot RJ, Weiss SR (2005) Expression of hemagglutinin esterase protein from recombinant mouse hepatitis virus enhances neurovirulence. J Virol 79: 15064–15073.
- 32. Williams RK, Jiang GS, Holmes KV (1991) Receptor for mouse hepatitis virus is a member of the carcinoembryonic antigen family of glycoproteins. Proc Natl Acad Sci U S A 88: 5533–5536.
- 33. Dveksler GS, Pensiero MN, Cardellichio CB, Williams RK, Jiang GS, et al. (1991) Cloning of the mouse hepatitis virus (MHV) receptor: expression in human and hamster cell lines confers susceptibility to MHV. J Virol 65: 6881–6891.
- 34. Bosch BJ, Rottier PJM (2008) Nidovirus entry into cells. In: Perlman S, Gallagher T, Snijder EJ, editors. Nidoviruses. Washington D.C.: ASM Press. pp. 157–177.
- 35. Li W, Moore MJ, Vasilieva N, Sui J, Wong SK, et al. (2003) Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus. Nature 426: 450–454.
- 36. Reeves PJ, Callewaert N, Contreras R, Khorana HG (2002) Structure and function in rhodopsin: high-level expression of rhodopsin with restricted and homogeneous N-glycosylation by a tetracycline-inducible N-acetylglucosaminyltransferase I-negative HEK293S stable mammalian cell line. Proc Natl Acad Sci U S A 99: 13419–13424.
- 37. Kamerling JP, Vliegenthart JFG (1989) Carbohydrates. In: Lawson AM, editor. Clinical Biochemistry - Principles, Methods, Applications, Vol 1, Mass Spectrometry. Berlin: Walter de Gruyter. pp. 176–263.
- 38. Kabsch W (1993) Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants. J Appl Cryst 26: 795–800.
- 39. Collaborative Computational Project, Number 4 (1994) The CCP4 suite: programs for protein crystallography. Acta Crystallogr D Biol Crystallogr 50: 760–763.
- 40. Emsley P, Cowtan K (2004) Coot: model-building tools for molecular graphics. Acta Crystallogr D Biol Crystallogr 60: 2126–2132.
- 41. Winn MD, Isupov MN, Murshudov GN (2001) Use of TLS parameters to model anisotropic displacements in macromolecular refinement. Acta Crystallogr D Biol Crystallogr 57: 122–133.
- 42. Kamerling JP, Schauer R, Shukla AK, Stoll S, Van Halbeek H, et al. (1987) Migration of O-acetyl groups in N,O-acetylneuraminic acids. Eur J Biochem 162: 601–607.