Transcription in all cellular organisms is orchestrated by the multisubunit DNA-dependent RNA polymerase (RNAP), a multifunctional enzyme that synthesizes RNA and is a major target for the regulation of gene expression. The Escherichia coli RNAP, which is the best characterized, comprises an essential catalytic core of two α-subunits (each 36.5 kDa), one β-subunit (150.6 kDa), and one β′-subunit (155.2 kDa) that is responsible for transcript elongation and termination. The holoenzyme contains an additional regulatory subunit, normally σ70 (70.2 kDa), and is capable of promoter-specific recognition and transcription initiation. To identify the location of subunits within the RNAP, we developed a strategy to target insertions of a protein domain into surface-exposed, nonessential regions that can be visualized by electron microscopy (EM) and image processing. This strategy was readily applicable to either of the two largest subunits, β and β′, which have colinearly arranged regions of strong amino acid sequence similarity from bacteria to humans (1–3). The highly conserved regions are separated by relatively nonconserved spacer regions. In some organisms, the nonconserved regions contain large gaps or insertions compared with E. coli. In the β-subunit, nine conserved regions, labeled A through I, have been identified (ref. 3; Fig. Fig.1).1). In addition, the β-subunit of E. coli RNAP contains two regions of poor sequence conservation, centered about residues 300 and 1,000, which are often missing in β-homologs from other organisms (Fig. (Fig.1).1). Figure 1 Staphylococcus aureus protein A (SPA) insertion in the β-subunit of E. coli RNAP. The primary sequence of the β-subunit is represented as a black bar. Evolutionarily conserved regions are shaded grey and labeled A–I (3). Evolutionarily ... As expected, mutations affecting the conserved regions frequently cause severe defects in RNAP function (ref. 4 and references therein). β-Residues 1,065 and 1,237, in conserved regions H and I, respectively, participate in the formation of the initiating site of the enzyme (5). Mutations in the β-subunit render the enzyme resistant to the antibiotic inhibitors rifampicin and streptolydigin (refs. 6–9; Fig. Fig.11). Alternatively, mutations in poorly conserved regions can have little or no apparent effect on RNAP function. In fact, large deletions in the two poorly conserved regions in β do not affect RNAP assembly and basic function in vitro (10, 11). These regions have thus been termed DRI (situated between conserved regions B and C, extending approximately from residues 170 to 430) and DRII (situated between conserved regions G and H, extending approximately from residues 930 to 1,030). Both of these regions are poorly conserved or absent in β-subunit homologs from other organisms; both can tolerate numerous mutations without fatal consequences in vivo; and both must be structurally autonomous to accommodate large deletions and insertions without disturbing the critical functions of the enzyme. Thus, both DRs are likely to comprise separate and relatively isolated domains on the RNAP structure. Although neither DR of the β-subunit plays an important role in RNAP assembly or basic transcription activity, it is assumed that their presence points to a role in regulatory functions specific to E. coli that have not yet been identified. Indeed, DRI is targeted by the bacteriophage T4-Alc protein, which selectively induces premature termination of E. coli RNAP transcription on E. coli DNA (11), indicating that regulatory factors can impact transcription through these DRs. Herein, we report a structural analysis of a mutant E. coli RNAP harboring an insertion of a protein domain in β-DRII. The mutant RNAP was used to locate directly the inserted domain on the three-dimensional structure of RNAP by using cryo-EM and difference analysis with the previously determined native RNAP structure, thereby locating its site of insertion. The approach used for this study is similar to the peptide-based difference mapping reported by Conway et al. (12), except that a larger protein domain was used to ensure visualization at the lower resolution of our analysis (20-Å compared with 11-Å resolution), and an internal rather than a terminal region of the polypeptide was located.