Advanced search
553
Views
3
CrossRef citations to date
0
Altmetric
Review

Stabilize and connect: the role of LARP7 in nuclear non-coding RNA metabolism

ORCID Icon, & ORCID Icon
Pages 290-303
Received 13 Feb 2020
Accepted 07 May 2020
Accepted author version posted online: 13 May 2020
Published online: 03 Jun 2020

ABSTRACT

La and La-related proteins (LARPs) are characterized by a common RNA interaction platform termed the La module. This structural hallmark allows LARPs to pervade various aspects of RNA biology. The metazoan LARP7 protein binds to the 7SK RNA as part of a 7SK small nuclear ribonucleoprotein (7SK snRNP), which inhibits the transcriptional activity of RNA polymerase II (Pol II). Additionally, recent findings revealed unanticipated roles of LARP7 in the assembly of other RNPs, as well as in the modification, processing and cellular transport of RNA molecules. Reduced levels of functional LARP7 have been linked to cancer and Alazami syndrome, two seemingly unrelated human diseases characterized either by hyperproliferation or growth retardation. Here, we review the intricate regulatory networks centered on LARP7 and assess how malfunction of these networks may relate to the etiology of LARP7-linked diseases.

Introduction

The eukaryotic La-related protein (LARP) family comprises members with various functions in RNA metabolism. LARPs typically exploit a combination of a winged helix domain, termed La-motif (LaM), and an adjacent RNA recognition motif (RRM1) to contact their RNA substrates [1–6], whereby the LaM and the RRM1 are collectively referred to as ‘La module’ (reviewed in [7]).

Target interactions are nevertheless not restricted to this characteristic structure. Instead, LARPs possess additional RNA binding motifs, which are also critical for substrate binding [8–11]. The presence of such specialized building blocks together with a versatile La module allow LARPs to target specific subsets of RNAs applying a fascinating mechanistic diversity (reviewed in [12]). While LARP1, the two paralogs LARP4 and LARP4B, as well as LARP6 mainly localize to the cytoplasm where they bind to mRNAs [11,13–16], LARP7 and the genuine La protein/SSB are mostly nuclear and both interact with non-coding RNAs transcribed by RNA polymerase III (Pol III). These transcripts terminate on an oligo-uridine stretch, which is the primary binding site for the La module of the two closely related proteins [4,5]. LARP7 and La also share a similar overall structural organization as they both contain a second RRM, referred to as RRM2 [17,18].

Despite these analogies, LARP7 and La fulfill unrelated and non-overlapping cellular functions [19,20]. Even though both gene products are essential in mouse, the homozygous Larp7 knockouts survive until embryonic day 16.5 [21], whereas disruption of the La gene impairs early embryonic development and is lethal already at the pre-implantation stage [22]. In addition, distinct symptomatic spectra of human pathologies are linked to these two homologous proteins. For example, La, but not LARP7, is a common autoantigen found in autoimmune diseases such as Lupus erythematosus [23].

La is known to bind virtually all primary Pol III transcripts immediately upon transcription termination [24]. However, the interaction with La is usually transient and restricted to the maturation of the respective RNA targets [25] and its release is mandatory for the assembly of the targeted RNAs into functional RNPs [26]. In contrast, LARP7 interacts with and promotes the stability of its main target, the non-coding 7SK RNA [20,27,28]. This highly structured transcript folds into four hairpin domains and serves as a scaffold for the binding of several proteins assembling into 7SK snRNP complexes (reviewed in [29]). More than a decade ago, LARP7 was first observed to co-purify with such particles together with the positive transcription elongation factor b (P-TEFb) [30]. P-TEFb functions as important regulator of RNA polymerase II (Pol II) activity and it has been well established meanwhile that it can be tethered in an inactive state to the 7SK RNA (reviewed in [31]).

This review aims to provide an overview of our current understanding of LARP7-associated pathways and processes. In the first part, we focus on recent structural and biochemical studies providing novel insights into the assembly and function of LARP7-7SK RNA complexes. We will pinpoint common and distinguishing structural properties between LARP7 and the homologous La protein as well as LARP7 orthologs from unicellular eukaryotes lacking the 7SK RNA [32–36]. The central part of this review will focus on recent studies that uncovered thus far unknown 7SK RNA-independent functions of LARP7 in different aspects of RNA metabolism, including RNA modification. In the last sections, we will discuss the implications of 7SK-dependent and -independent LARP7 functions for human malignancies.

A clamp and a clip to grab the 7SK RNA

The LARP7 protein consists of the N-terminal La module and the C-terminal RRM2, which are connected by a central region predicted to be largely unstructured [5] (Fig. 1A). Studies on the RNA binding properties of LARP7 focused on the interaction with its main target, the 7SK RNA and in particular on its 3ʹ terminal hairpin (often referred to as HP4 or SL4). As any other Pol III transcript, the 7SK RNA possesses a 3ʹ terminal single-stranded oligo-uridine tail, which is, together with the preceding HP4, essential for LARP7 binding in vivo [26]. Recent structural studies have resolved the details of the protein-RNA interactions, demonstrating that the HP4 is contacted by both, the La module as well as the RRM2 [5,9,10].

Figure 1. Two RNA-binding platforms within the LARP7 protein mediate contacts to RNA. (A) Schematic overview of the domain organization of human LARP7. The N-terminal La motif (LaM, blue) and the adjacent RNA recognition motif (RRM1, green) form the so-called La module, the common hallmark of LARPs. A flexible linker (dashed line, grey) connects it to the C-terminal RRM2 (salmon). (B) Graphical representation of the binding of LARP7 to terminal hairpin of the 7SK RNA (HP4, yellow) according to the structural studies by Uchikawa et al. (2015) and Eichhorn et al. (2018). The La module is represented as a clamp pinching the single-stranded 3ʹ end of HP4, the RRM2 as a clip inserted on top of the apical loop. The corresponding structures are shown at the sides and were depicted using PyMOL (https://pymol.org/2/) and the data deposited in the Protein Data Bank (PDB): 4WKR for the La module and 6D12 for the RRM2, starting from residue 456. Colours as in (A) with the α3-helix highlighted by darker colouring

The structure of the isolated La module of LARP7 in complex with terminal fragments of the HP4 revealed a synergistic interplay of the LaM and the RRM1 to clamp the single stranded oligo-uridine substrate [5] (Fig. 1B). This arrangement had been previously observed also for the La module of human La in complex with oligo-uridine RNA [4,6]. Remarkable differences between the La modules of both proteins are nevertheless found within the RRM1 [5], suggesting an adaptation to different functional requirements. Indeed, replacement of the RRM1 of LARP7 with the RRM1 of La abolishes the interaction with the 7SK RNA in vivo [20]. Classical RRMs consist of a four-stranded β-sheet packed against two α-helices, whereby canonical RNA contacts are formed at the surface of the β-sheet (reviewed in [37]). In addition, the RRM1 of La contains a C-terminal α-helix, which, together with one β-strand, is apparently lacking in LARP7 [5]. Importantly, the cleft formed by the LaM and the RRM1, tailored to bind the single-stranded oligo-uridine end of LARP7 and La targets, does not involve the canonical RNA binding surface of the RRM1. This suggests that the RRM1 of LARP7 and La is available for additional RNA contacts [4–6,38] and may explain the different target selectivity between the two proteins (reviewed in [12]).

Despite the ability of the La module of LARP7 to bind RNA strongly in vitro [5,10], it is not sufficient for 7SK RNA binding and stabilization in vivo [20], suggesting that other segments are necessary for the efficient LARP7-7SK RNA interaction. Experimental evidences indicate that the long central linker of LARP7 might contribute to such additional contacts, in particular via several stretches of basic residues [5,10].

While further investigations are required to fully understand the rather enigmatic role of the central linker, it is meanwhile clear that, in addition to the La module, the C-terminal RRM2 constitutes a second fundamental RNA binding region on LARP7 [5,9,10,20]. Isothermal titration calorimetry (ITC) measurements, for example, demonstrated that the RRM2 has an additive effect on the binding of the 7SK HP4 in the context of the full-length LARP7 protein [10]. The same study also solved the crystal structure of the isolated RRM2 in complex with an artificial RNA encompassing parts of the HP4 and revealed that the RRM2 is placed like a cap on top of HP4 [10] (Fig. 1B). These data also uncover the sequence-specific binding of the RRM2 to the apical loop of HP4, confirming previous biochemical observations [5,9,26]. Of note, a two-nucleotide bulge within the stem of HP4 has also been shown to impact on the interaction with LARP7 [26,39]. Since the RNA substrate co-crystallized in complex with RRM2 did not encompass the region containing the bulge [10], LARP7 binding to this region cannot be fully excluded. Such interaction could occur, for example, in solution during the assembly or remodelling of LARP7-7SK RNA complexes.

Despite this ongoing debate, the structures of the RNA-free [9] and RNA-bound RRM2 [10] of the human LARP7 protein clearly uncovered several peculiarities shared with the RRM2 of genuine La [35,40], as well as p65, an ortholog of LARP7 in the ciliate Tetrahymena [35]. Canonical RRMs possess two conserved consensus motifs, referred to as RNP1 and RNP2, and both participate in RNA interactions (reviewed in [37]). The RRM2 of LARP7 family members, however, lacks these motifs. Instead, the RRM2 has acquired a novel consensus motif (RNP3), which relocated to a different region of the β-sheet surface but still contributes to RNA binding. An even more distinguishing feature of this atypical RRM is the presence of an α-helical extension (α3) at its C-terminus, which is positioned in a perpendicular orientation, across the β-sheet surface [9,10,35,40,41]. In the case of human LARP7, the RRM2 resembles a clip and inserts on top of the HP4 of the 7SK RNA providing additional contacts to the double-stranded stem of HP4 [10] (Fig. 1B). Due to these characteristic structural features, the RRM2 of La and LARP7 family members have been grouped into a distinct subclass of the RRM domain which was named ‘xRRM’ [9,10,35,41].

The comparison of the RNA-free NMR structure [9] and the RNA-bound X-ray crystal structure [10] of the LARP7 RRM2 further revealed that the C-terminus of the domain becomes more structured in the presence of RNA. This results in the extension of half a turn of the α3-helix (referred to as α3x), a peculiarity which is found also in the xRRM of p65 [35]. Finally, the very C-terminus of LARP7 appears to be rather unstructured, although an additional α-helix (α4) might be present [9]. A recent study conducted by our labs indicates that this region of LARP7 might play a critical role for the interaction with RNA [42], but also a function unrelated to RNA binding has been reported and will be described in one of the following sections [39].

Taken together, LARP7 uses two distinct RNA binding modules to grab the 3ʹ terminal hairpin of the 7SK RNA. While the La module pinches the 3ʹ end, the RRM2 clips onto the apical loop of HP4 (Fig. 1B). However, these studies were restricted either to the isolated La module or RRM2 and they applied short or artificial RNA constructs. It is possible that the interaction of LARP7 to the 7SK RNA in the context of full-length protein and RNA still bears additional surprises.

LARP7 and MePCE form the core of the 7SK snRNP

LARP7 binding stabilizes the 7SK RNA in vivo and hence ensures 7SK RNP formation and function. However, 7SK RNA stabilization is not accomplished solely by LARP7, which indeed acts in concert with the methylphosphate capping enzyme (MePCE), also known as BCDIN3 [20,28,30,43–45] and both are core components of the 7SK snRNP [44] (Fig. 2). An interdependency of these two proteins is also supported by the fact that decreased MePCE levels are accompanied by downregulation of LARP7 [44,46]. Inversely, however, LARP7 knockdown does not affect MePCE expression [43].

Figure 2. Maturation of the 7SK snRNP. The upper panels depict the maturation events according to the current model. The 7SK RNA is immediately bound by La (blue) upon transcription by Pol III. While the La module of La binds the 3ʹ end of the 7SK RNA, other regions of La might contact the 5ʹ end of bound substrates as well [147,148]. Binding of MePCE (green) to the 7SK RNA and the following methylation of the cap were proposed to allow replacement of La with LARP7 [49]. The resulting core 7SK snRNP is composed of the 7SK RNA, MePCE and LARP7 and is boxed in yellow. The C-terminus of LARP7 was shown to inhibit the catalytic activity of MePCE (represented by the red dashed line) and was thus named MePCE inactivating domain (MID). As an alternative to this La-dependent 7SK snRNP maturation pathway, chromatin bound MePCE might directly sequester and methylate the 5ʹ end of nascent 7SK RNA transcripts precluding the access by La and recruiting LARP7 (La-independent pathway, lower left panel)

Importantly, MePCE contains a methyltransferase domain accounting for the transfer of a methyl group from S-adenosyl methionine (SAM) to the γ phosphate at the 5ʹ end of the 7SK RNA. This catalytic activity is involved in the stabilization of the RNA [30,45] and the acquisition of the cap modification might also govern the maturation of 7SK RNA particles.

A small fraction of cellular 7SK RNA has been found associated with the La protein [26,47,48], which might reflect a pool of newly transcribed 7SK RNA immediately bound by La upon transcription termination. Since it is very likely that the interaction of the 7SK RNA with La or LARP7 is mutually exclusive, La needs to be replaced by LARP7 for maturation of 7SK snRNPs. It has been thus hypothesized that the methylation of the 5ʹ end of the 7SK RNA by MePCE might induce the dissociation of La from the transcript and could prime newly synthetized RNAs for the assembly into 7SK snRNPs [49,50] (Fig. 2, La-dependent pathway, upper panel).

In contrast, this proposed order of maturation events during the biogenesis of 7SK snRNPs is questioned by the fact that capping of the 7SK RNA by MePCE seems to occur co-transcriptionally [43]. Taking this into consideration, nascent 7SK transcripts might be pre-bound by MePCE resulting in the rapid acquisition of the 5ʹ γ-monomethyl phosphate cap. In this La-independent maturation pathway, La binding to the 3ʹ end of the fully transcribed RNA would be impeded by the cap modification [50] and most 7SK RNA transcripts could directly assemble with LARP7 (Fig. 2, lower panel). Concomitantly, the spurious interaction of La with the 7SK RNA might be indicative for an alternative minor La-dependent maturation pathway (Fig. 2).

Of note, MePCE also modifies the 5ʹ end of the spliceosomal U6 snRNA [30,51]. In this case, however, the enzyme dissociates from the RNA upon catalysis while it remains as stable component in the 7SK RNP. Indeed, protein factors assembling on 7SK snRNP complexes might further strengthen the association to MePCE accounting for its selective retention on the 7SK RNA [39]. Structural models suggest that specific regions in LARP7, together with elements on the 3ʹ segment of the 7SK RNA, cooperatively form a binding surface allowing docking of MePCE [10]. This model is further supported by the finding that the presence of LARP7 is sufficient to tether MePCE on 7SK RNA variants lacking RNA segments directly bound by MePCE [26]. Inversely, MePCE alone is not able to recruit LARP7 to 3ʹ terminally truncated 7SK RNA variants highlighting the importance of this region for the formation of a stable trimeric complex [26,43].

In a current model, protein-protein contacts within the core 7SK snRNP were proposed to be formed between the catalytic center of MePCE and the very C-terminal part of LARP7 [39,43]. This region of LARP7, which follows the RRM2 and comprises the aforementioned α4 helix, was shown in in vitro assays to be critically required for the recruitment of MePCE to the 7SK RNA [39]. Intriguingly, it also confers LARP7 the ability to repress the methylation activity of MePCE [39,43] and has been thus referred to as MePCE inactivating domain (MID), albeit the physiological relevance of this property has not yet been entirely understood [39].

The interaction between MePCE and LARP7, while assembled on the 7SK RNA, has important structural implications. Taken into consideration that MePCE primarily associates at the 5ʹ end of the RNA, whereas LARP7 grasps its 3ʹ end, it is appealing to speculate that these protein-protein contacts might promote a non-covalent circularization of the 7SK RNA. Indeed, such a ‘closed’ conformation of the 7SK RNA [52] has been proposed as an alternative to the ‘open’ structure model suggested by Wassarman and Steitz [48]. Native 7SK snRNP complexes are believed to be dynamic and to undergo substantial remodelling in terms of protein composition [28,53] and RNA structure [39,54,55]. The latter involves the formation of alternative intramolecular base-pairing interactions within the 7SK RNA (Fig. 3). The concept that such reshaping occurs and might even mediate the 7SK snRNP function, reconciled the two opposing models suggesting that the ‘open’ and the ‘closed’ conformations of the 7SK RNA cycle between each other and co-exist in a native environment [39].

Figure 3. Models for the conformation of the 7SK RNA within the core 7SK snRNP. (A) In the ‘open’ structure model, the initial segment of the 7SK RNA is involved in basepairing interactions, which extend the stem of the 5ʹ terminal hairpin. In this conformation, the MID of LARP7 is not able to repress the catalytic activity of MePCE (represented by the red dashed lines). (B) In the ‘closed’ structure model, the 5ʹ end of the 7SK RNA basepairs with the basal region of HP4, joining the ends of the RNA. Consequently, LARP7 and MePCE are also in close spatial proximity and the MID of LARP7 inhibits MePCE by direct contacts (red lines)

The models are adapted from Wassarman and Steitz (1991), Marz et al. (2009) and Brogie and Price (2017). The RNA segments forming the alternative interactions are depicted in orange.

Direct interactions between MePCE and LARP7 in the ‘closed’ state of the 7SK RNA can be easily inferred. In contrast, it is more tempting to assume that MePCE and LARP7 are physically separated in the ‘open’ configuration, where the ends of the 7SK RNA are not joined by RNA-RNA contacts [39]. The possible switching between the two conformations of the 7SK RNA consequently implies breaking or, at least, rearranging of MePCE-LARP7 interactions. Further studies are required to provide more details concerning this interesting structural aspect.

The 7SK snRNP controls availability of active P-TEFb

As a core component of the 7SK snRNP, LARP7 participates in all processes regulated by this complex. Although it is not the scope of this review to provide a detailed overview of the latest discoveries related to 7SK snRNP functions (reviewed in [56–58]), some emerging concepts and in particular those implying a direct role of LARP7 will be discussed in the following section.

In general, within the 7SK snRNP, different protein components are tethered to the RNA scaffold. The most extensively studied example is the binding and inactivation of P-TEFb, which consists of the cyclin-dependent kinase 9 (CDK9) and a cyclin T (either T1 or T2) [59,60]. This is achieved by the repressor protein HEXIM1 or its paralog HEXIM2, which are also found within 7SK snRNP complexes [61–65]. Interestingly, the 7SK RNA itself works as an allosteric co-factor inducing conformational changes in the HEXIM proteins [64,65], which in turn allows the formation of an inhibitory interaction between HEXIMs and P-TEFb [66,67].

P-TEFb plays an essential role during gene expression as it phosphorylates Ser2 in the C-terminal domain (CTD) of Pol II as well as subunits of two repressive complexes (DSIF and NELF) that maintain the polymerase in a promoter-proximal paused state (reviewed in [68]). The establishment of the post-translation modification on these targets ultimately allows Pol II to transit into the productive elongation phase. Thus, the current model implies that LARP7 and the 7SK snRNP are important regulators of P-TEFb activity (reviewed in [31]). During the last years of research, the captivating concept arose that the 7SK snRNP does not simply buffer the P-TEFb reservoir broadly attenuating Pol II activity. Instead, it is now thought that inactive P-TEFb is required to be delivered via 7SK snRNPs to the sites of request (reviewed in [36,56,58,69]). In support of this notion, cumulative reports argue that 7SK snRNPs associate with chromatin and in particular with promoter-proximal regions [70–77], albeit a broader occupancy across the entire length of transcribed genes has been also reported for the 7SK RNA as determined by sequencing of chromatin isolation by RNA purification (ChIRP-seq) experiments [55]. The triggered release of P-TEFb from the chromatin-associated inhibitory complexes would subsequently relieve Pol II pausing directly ‘on site’ (reviewed in [36,56,58,69]).

The question then arises of how the 7SK snRNP is tethered on genes with paused Pol II. Several scenarios have been envisioned to address this point, whereby the 7SK RNA was proposed to directly bind to a specific set of histones [72]. 7SK snRNP-associated proteins such as HMGA1 [78] or WDR43 [73] were also shown to link the complex to chromatin and act thereby as adaptor molecules. Intriguingly, LARP7 itself might also act as an anchor point through direct binding of KAP1 (also known as TRIM28), a protein associated with chromatin-bound transcription factors [75].

How does the controlled release of P-TEFb occur? Diverse stress factors, e.g., inhibition of transcription or UV-light [59,60] as well as genotoxic stress induced by DNA-damaging agents [79] were found to liberate P-TEFb and some of the underlying molecular mechanism have been revealed (reviewed in [57,58]). These involve, for example, the post-translational modification of selected 7SK snRNP-associated proteins [57,80]. Importantly, a specific 7SK snRNP complex has been identified where HEXIM and P-TEFb are replaced by a set of heterogeneous nuclear ribonucleoproteins (hnRNPs) [28,81] and binding of these proteins appears to be a prerequisite for efficient P-TEFb release [53,82]. Thus, the 7SK snRNP was proposed to exert its regulatory function by cycling between the alternative P-TEFb-bound or hnRNP-bound states [39,82].

In contrast to this scenario, which would allow re-cycling of the 7SK snRNP components, the release of P-TEFb can be also obtained by more drastic strategies triggering the irreversible disassembly of the whole 7SK snRNP. This might for example occur upon removal of the monomethyl cap at the 5ʹ end of the 7SK RNA by the JMJD6 demethylase, which is expected to cause the destabilization of the RNA [72]. Furthermore, during the differentiation of megakaryocytes, the blood cells accounting for platelets production, the protease calpain 2 was found to cleave MePCE leading to a pronounced reduction of its steady-state levels. LARP7 expression was also downregulated in megakaryocytes compared to the progenitor cells, but this effect occurred in a calpain 2-independent manner as it was not reversed by its inhibition. Together, the severe and global depletion of the core 7SK snRNP components causes the activation of P-TEFb and guarantees the correct megakaryocytic morphogenesis and perturbation of this pathways are associated with a subset of human megakaryocytic leukemias [83].

In the mechanisms unravelled so far, the 7SK snRNP is manipulated in a way that weakens its interaction with P-TEFb, which apparently simply falls off the 7SK snRNP. According to other studies, however, specific factors extract P-TEFb more actively from the repressive complex [54,70,72–74,79].

One of these factors was shown to be the WDR43 protein, which is not only implicated in the recruitment of the 7SK snRNP to nascent RNA transcripts but also stimulates the release of P-TEFb from the complex [73]. Although the underlying molecular mechanism is yet unknown, the authors of this work hypothesize that it might occur in phase-separated ‘transcription bodies’. They further provide first evidences suggesting that the formation of such structures is promoted by low-complexity sequences within the WDR43 protein, as well as DDX21, another known 7SK snRNP interactor [73]. It will be interesting to follow, whether such a dependency on a phase-transited environment for the function of the 7SK snRNP will be further corroborated. Of note, the RNA helicase DDX21 itself also promotes the release of P-TEFb and this turned out to strictly require its ATP-dependent catalytic activity [74].

The involvement of an RNA helicase [53,74,84], as well as the structural changes observed upon hnRNP binding to the 7SK RNA [39,54], suggest that the remodelling of the RNA scaffold might be a prerequisite for the dissociation of P-TEFb from the 7SK snRNP. The contribution of RNA remodellers to this process might thus be more significant than so far appreciated. Intriguingly, LARP7 possesses RNA chaperone activity [19] similarly to its paralog La [85–89] and it is thus tempting to speculate, that this intrinsic property might mediate at least some of the dynamic rearrangements within the life-cycle of the 7SK snRNP. Furthermore, the ability to induce conformational changes of the bound RNA substrate has been attributed as well to the LARP7-related p65 protein in Tetrahymena [90–92].

Unfortunately, direct experimental evidences supporting the idea that metazoan LARP7 proteins might actively reshape the structure of the 7SK RNA are still missing. Nevertheless, it is remarkable that RBM7, a factor recently described to promote the release of P-TEFb upon genotoxic stress, directly interacts with LARP7 and MePCE [79]. Still, it is unclear, how this accounts for the observed effects on the composition of the 7SK snRNP. Might any of the identified P-TEFb release factors exert its function by inducing the RNA chaperone activity of LARP7?

LARP7 in P-TEFb-unrelated 7SK RNPs

7SK snRNP complexes with diverse protein composition have been identified over the last years and it became more and more clear that P-TEFb is not a constitutive component of all these particles. Some of them reflect intermediates within the cycle of P-TEFb regulation, whereby P-TEFb alternates with hnRNPs for binding to the 7SK snRNP [53,82].

However, dedicated 7SK snRNPs exist and they act in pathways unrelated to the regulation of P-TEFb activity (reviewed in [55]). Genome-wide analyses using ChIRP-seq and ChIP-seq techniques provided indications for this idea by revealing distinct occupancy profiles for the 7SK RNA and other canonical 7SK snRNP components, as well as for selected transcription factors and chromatin marks [55]. The intersection of these datasets has driven the hypothesis that the 7SK RNA forms different complexes which might fulfill dedicated functions at distinct genomic locations. Biochemical experiments confirmed that the 7SK RNA not only co-localizes but also interacts with the BAF chromatin remodelling complex suggesting an additional function of specific 7SK snRNPs in the re-positioning of nucleosomes [55].

Biochemical investigations demonstrated that the 7SK RNA, together with MePCE and LARP7, are integral components of the little elongation complex (LEC) and that they are required for its structural integrity [76]. This multi-subunit transcription factor is specifically involved in the Pol II-dependent synthesis of snRNAs and of small nucleolar (sno)RNAs transcribed from independent promoters. Indeed, knockdown of LARP7 impaired the recruitment of the LEC and consequently of Pol II to the promoters of such genes. This ultimately lead to the reduced production and accumulation of the snRNAs transcribed by Pol II and of those snoRNAs, which are not processed from the introns of host genes [76].

Finally, a distinct 7SK snRNP has been also recently identified in the cytoplasm of motoneurons and this localization depends on the interaction with hnRNP R, which directly binds the 7SK RNA [93]. Both were detected in close proximity of each other along the axons and in the growth cones of motoneurons. Depletion of hnRNP R or the 7SK RNA caused a reduction in axon growth and both components were required for the axonal translocation or stabilization of a subset of transcripts [93]. Given its vital role in the stabilization of the 7SK RNA, it is likely that also LARP7 might be a component of this cytoplasmic neuronal 7SK RNA-hnRNP R complex but this aspect has not been addressed experimentally.

Moonlighting function of LARP7 proteins

During the process of DNA replication, linear chromosomal ends are progressively shortened, which, however, is prevented by the action of the telomerase complex. This RNP consists of a non-coding telomerase RNA (named TR, TER or TERC), a reverse transcriptase subunit (TERT) and several accessory factors (reviewed in [94]). Among them, members of the LARP7 protein family were found to be integral components of the mature telomerase holoenzyme in ciliates [95–97]. TER is transcribed by Pol III in these organisms, and in Tetrahymena, for example, the LARP7 ortholog p65 [17,18] binds the 3ʹ terminal oligo-uridine stretch and additional elements of the RNA [98–100]. The interaction with p65 stabilizes TER, mediates the hierarchical and efficient assembly of the telomerase complex and promotes its catalytic activity [90,91,97,100,101].

Despite the ancestral function of the telomerase complex, an astonishing diversity in the origin and biogenesis of this RNP is found across eukaryotes (reviewed in [94]). In the yeast Schizosaccharomyces pombe, TER is produced by Pol II and its maturation requires an elaborated processing strategy, including an unconventional single cleavage reaction performed by the spliceosome. This event is stimulated by the binding of the Sm proteins, which are replaced in the mature telomerase complex by the Sm-like (LSm) complex [102]. Sm and LSm proteins are structurally related proteins, which assemble in heteroheptameric ring structures enclosing the bound target RNAs (reviewed in [103,104,105]). Surprisingly, three independent groups reported that the S. pombe LARP7-like protein Pof8 (or Lar7) promotes the loading of the LSm complex on TER. Similarly to p65 in Tetrahymena, Pof8 was identified as a constitutive component of the telomerase holoenzyme, which also depends on this protein to exert its full catalytic activity [32–34]. The involvement of a LARP7 family member in the maturation of telomerase complexes containing Pol II-transcribed TER was rather unexpected and might be interpreted as a hint for a more general function of LARPs in the assembly of this RNP. While no evidence has been found to date for a possible incorporation of LARP7 within the human holoenzyme [106–108], telomerase activity was recently reported to be reduced upon depletion of LARP7 and cell lines carrying LARP7 loss-of-function mutations had shorter telomeres [109] indeed suggesting a role of LARP7 family members in this process. This model, however, needs to be further corroborated.

In addition to the binding to non-coding RNAs, Larp7 overexpressed in mouse embryonic stem cells (mESCs) has been found to co-immunoprecipitate the mRNA of the developmental regulator Lin28. Apparently, this association accounts for the recruitment of the terminal nucleotidyl transferase Tut1/Star-PAP to the Lin28 mRNA and is critical for the stability of this transcript. Albeit the molecular mechanism how Tut1/Star-PAP preserves the expression of Lin28 has not been elucidated, perturbation of this axis was found to prime mESC for precocious differentiation [110].

Intriguingly, TUT1/Star-PAP also extends the primary transcript of the U6 snRNA with non-templated uridines [111,112]. In addition, a number of other LARP7 interactors have been implicated in U6 snRNP biogenesis as well [28,110]. Most prominently, MePCE catalyzes the same 5ʹ γ-monomethyl phosphate cap modification on the U6 and the 7SK snRNA [51]. In contrast to the 7SK snRNP, however, the interaction with the U6 snRNA is restricted to its maturation process and MePCE is not found in mature U6 particles. More recently, LARP7 was co-immunoprecipitated from cell lysates together with METTL16 [113], which was identified as the methyltransferase responsible for the m6A modification of an internal base within the U6 snRNA [113,114]. In order to join the splicing reaction, U6 snRNA-containing particles are assembled step-wise into the U4/U6 di-snRNP and finally into the U4/U6.U5 tri-snRNPs, whereby the former process is assisted by SART3 [115–117]. SART3 was among the first factors identified within 7SK snRNP complexes [30] and it was repeatedly detected in LARP7 affinity purifications [28,75], again linking LARP7 with U6 biogenesis. The U6 snRNA itself was observed to interact with LARP7, but was initially considered as an unspecific binding event [26,28]. Two studies have recently challenged this interpretation and demonstrated that LARP7 fulfils a yet unappreciated role for the biogenesis of the U6 snRNP. Indeed, the depletion of LARP7 resulted in the specific loss of 2ʹ-O-methylated nucleosides within the U6 snRNA [42,118]. These post-transcriptional modifications are introduced with the help of C/D box snoRNAs, which undergo base-pairing interactions with target RNAs and, by that, direct snoRNPs containing the methyltransferase fibrillarin to selected sites (reviewed in [119]). In this context, LARP7 was shown to use the La module to bind to the U6 snRNA, while contacts to the U6-specific snoRNAs occur via the RRM2 [42,118]. This model describes LARP7 as the first auxiliary factor supporting efficient snoRNA-mediated snRNA modification. Furthermore, the two RNA binding platforms are proposed to act as distinct modules bridging the interaction between two different RNAs, which is, according to the current knowledge, a unique characteristic among LARPs.

Evidences for the requirement of LARP7 during bacterial infection

LARP7 has been linked to several human pathologies, most recently to infectious diseases caused by intracellular bacterial pathogens [120]. Members of the Legionella genus, which are responsible for the Legionnaires’ disease, secrete so-called effector proteins into the infected cells to target and to manipulate molecular pathways of the host (reviewed in [121]). For example, in L. pneumophila the ankyrin repeats-containing protein Ankyrin H (AnkH, alternatively also known as LegA3 or Lpg2300), supports efficient intracellular replication of the pathogen [122]. Intriguingly, AnkH is among the very restricted set of effector proteins with homologs across all Legionella species, as well as other human and animal pathogenic bacteria, like e.g., the Q fever-causing Coxiella burnetii [123–125]. In order to define the molecular role of AnkH, interaction studies were performed and LARP7 was identified as the top AnkH binding partner. Confirming the relevance of this interaction, efficient infection of human cells with L. pneumophila depend on the presence of LARP7. AnkH was thus proposed to hijack LARP7 and to partially prevent its function in the context of the 7SK snRNP. It was further hypothesized that the assumed perturbations of P-TEFb activity would be responsible for transcriptional reprogramming of infected cells, which ultimately favours bacterial replication [120]. While this model needs to be further solidified, the striking occurrence of AnkH homologs across several intracellular pathogens, hints towards a pivotal and widespread mechanism of LARP7 targeting by bacterial proteins with important implications for infection.

LARP7: a putative tumor suppressor for different cancer types

The involvement of LARP7 within the 7SK-mediated sequestration of P-TEFb and the resulting inhibition of Pol II activity supported the obvious concept that LARP7 might act as a tumor suppressor [20]. Indeed, a link between LARP7 mutations and human diseases was established in gastric cancer by a study analysing the frequency of mutations occurring within microsatellite repeats [126]. Microsatellites are short repetitive sequences across the genome that are particularly prone to acquire length variations, a phenomenon that is termed microsatellite instability [127]. Of note, two distinct microsatellites within the LARP7 genes were reported to be frequently mutated in different cancers [126,128,129]. Such events can have detrimental consequences if the short repeat is located within a protein coding sequence, as insertions or deletions can lead to frameshift mutations. This is the case for one of the microsatellites embedded within the LARP7 gene where a stretch of eight consecutive adenosines was found to be altered in gastric tumors [126]. The location of this microsatellite repeat would likely induce non-sense mediated decay (NMD) of LARP7 mRNAs with a shifted coding frame. Nevertheless, even if these transcript variants would be translated, the resulting LARP7 proteins would lack the RRM2 and, consequently, would be unable to stabilize the 7SK RNA in vivo [20].

Reduced LARP7 expression was later confirmed in gastric cancer [130], in invasive breast cancer [131] as well as in papillary thyroid cancer [132]. While the relevance of LARP7 microsatellite polymorphisms has not been addresses in these studies, it is conceivable that deregulated LARP7 expression could not only be dependent on such genomic mutations but also be caused by other pathways.

Importantly, several studies report that reduced LARP7 levels promote tumorigenesis, cancer progression and metastasis formation [20,130–132] corroborating the assumption that LARP7 is a tumor suppressor gene. While this might hold true for invasive breast cancer [131], data obtained from other systems point towards the existence of a compensatory mechanism counteracting the possible hyperproliferative outcome expected upon LARP7 reduction. Indeed, prolonged knockdown of Larp7 in mESCs [110] or similar experiments performed in human cell lines [28] were accompanied by a reduction in the protein levels of the P-TEFb subunit Cdk9 [110]. This or similar, but yet unidentified reactions, are possible reasons why a general enhancement of Pol II activity was absent despite the expected increase in P-TEFb availability following breakdown of 7SK snRNPs [42,118].

Loss-of-function mutations in the LARP7 gene cause the Alazami syndrome

Mutations in the LARP7 gene have been associated over the last years with a rare developmental disease termed Alazami syndrome [133]. To date, around 20 different patients have been reported carrying homozygous or compound heterozygous LARP7 loss-of-function mutations [42,109,133–140]. This severe disease manifests in developmental defects, most prominently, primordial dwarfism, which is characterized by pre- and postnatal growth retardation. A broad spectrum of additional skeletal and in particular facial dysmorphisms, including microcephaly, contribute to the complex phenotype of the disease. Severe intellectual disability with delayed speech development, as well as delayed motor development are additional characteristic symptoms [134,136]. Most of the Alazami-associated mutations identified so far introduce frameshifts in the coding sequence of the LARP7 mRNA and likely cause NMD. However, this has been experimentally demonstrated only in one study showing a complete loss of LARP7 in patients-derived cells [133]. A possible exception might occur in a recently reported case of Alazami syndrome, whereby a patient has a genotype of biallelic LARP7 variants, one being a frameshift mutation and the other one an intronic point mutation [139]. The latter variant possibly leads to compromised splicing, although residual expression of functional LARP7 protein might occur and could account for the milder clinical phenotype manifested by this specific patient [139]. More recently, a short deletion in the LARP7 gene has been described. This mutation destroys the last intron-exon boundary and, in a homozygous state, causes the Alazami syndrome in two siblings of a consanguineous family. Importantly, this specific mutation does not lead to NMD of the transcript, but results in a LARP7 protein variant with a slightly different C-terminus and impaired functionality [42]. The pleiotropic nature of the Alazami syndrome and the broad expression of LARP7 across tissues [21,44,133] are indicative of its widespread physiological relevance. Nevertheless, the exact function of LARP7, which becomes critical for the pathogenesis of the Alazami syndrome, remains elusive. Concomitantly with the first reports of Alazami patients, it has been critically questioned, whether the etiology of the disease could be explained in a satisfying way by the canonical role of LARP7 in promoting P-TEFb-dependent transcriptional control through the stabilization of the 7SK snRNP [133]. A major concern is the fact that the affected patients do not display any phenotype that might be readily explained by the hyper-activation of Pol II transcription, a consequence that can be envisioned due to LARP7 deficiency. Nevertheless, a single Alazami patient has been reported to date, who has been additionally diagnosed with papillary thyroid carcinoma. Analysis of a tumor biopsy identified the somatic V600E mutation in the BRAF gene, which is the most common mutation found in this type of cancers. Driven by the surprisingly young age of the patient affected by this malignancy, the authors of the study speculated that loss of LARP7 in Alazami patients might cause an increased susceptibility for the acquisition of cancer-related mutations [140].

Differing from this report, the vast majority of Alazami patients do not provide indications for uncontrolled cellular proliferation or tumorigenesis. Instead, the syndrome is characterized by the opposite effect, such as reduced growth [21,44,133]. Similarly, Larp7 depletion caused severe developmental defects and reduced growth of zebrafish [44] and mouse embryos [21]. In contrast to LARP7 deficiency in human, homozygous Larp7 knockout mouse embryos died before birth. The reason for this discrepancy is unclear, but a possible explanation might be found in the presence of the microRNA (miRNA) cluster miR-302/367 within an intronic sequence of the Larp7 gene, which could be affected in these animals. Importantly, these miRNAs were demonstrated to control the differentiation of neural progenitor cells and depletion of the miR-302/367 cluster had detrimental consequences during the murine embryonic development leading to prenatal lethality [141,142]. Of note, miR-302/367 knockout embryos displayed defects in the closure of the neural tube, a phenotype which was also observed in some of the Larp7 knockouts [21]. Since it has not been assayed whether the generation of the knockout mice had an influence on the miRNA cluster embedded within the targeted Larp7 gene, it cannot be excluded that the reported embryonic lethality is due to impaired miR-302/367 expression.

Still, the mutations described in patients affected by the Alazami syndrome are not expected to compromise the correct biogenesis of the miRNAs encoded within the miR-302/367 cluster. Furthermore, the developmental defects, which are consistent between the patients and the phenotypes observed in different model systems, strongly indicate the primary cause for the disease is the loss-of-function of the LARP7 protein. Due to the role of LARP7 orthologs in the assembly of the telomerase complex in ciliates and yeast (see above), it has been hypothesized that a similar function might be conserved in human. In support of this idea, reduced telomere lengths were detected in the DNA isolated from leukocytes of Alazami patients, but the molecular mechanism underlying this finding are yet unclear. Intriguingly, knockdown of LARP7 affected the alternative splice pattern of the mRNA coding for the catalytic subunit of the telomerase complex and less functional transcript was produced [109]. Perturbations of alternative splicing upon knockdown of LARP7 were also reported previously. In this case, the effects were attributed to the P-TEFb-dependent regulation of transcription, which is tightly linked to pre-mRNA processing [44]. Two recent studies further indicate that LARP7 is an essential factor required for the post-transcriptional modification of the spliceosomal U6 snRNA and, by that, LARP7 might affect splicing robustness [42,118]. Taken together, several evidences point towards a possible role of aberrant splicing in the etiology of the Alazami syndrome.

Concluding remarks and perspectives

LARP7 plays key roles in various RNA pathways, including transcription, modification, processing and transport, which are summarized in Fig. 4. In order to fulfill this broad spectrum of functions, LARP7 assembles in distinct RNPs. A future challenge will be to isolate specific particles in order to determine their individual composition and, to acquire structural insights into such LARP7-containing complexes. The advanced studies of the 7SK snRNP in the context of P-TEFb sequestration have already provided clear mechanistic evidences for the dynamic nature of the LARP7 interactome. Even though different complexes might be related to the same pathway, they might undergo substantial remodelling in terms of protein composition and secondary structure of the bound RNA. This notion adds an additional layer of complexity to the attempt of defining specific functions for selected RNP subpopulations.

Figure 4. Versatile functions of LARP7. Summary of the 7SK snRNP-dependent (A) and – independent (B) roles of LARP7

A growing number of studies have revealed that the action of LARP7 has specific implications for certain cell and tissue types [21,110,118]. In particular, Larp7 was recently shown to be essential for male fertility, which was attributed to its role in the 2ʹ-O-methylation of the U6 snRNA [118]. Furthermore, varying subnuclear localization has been observed for Larp7 across embryonic tissues in flies, albeit the implications of this finding are not clear [143]. In human cells, LARP7 was mainly detected in the nucleoplasm [27,75] but it was also identified in nucleoli [144] as well as in Cajal bodies [76]. Indirect evidences also exist for the presence of LARP7 in nuclear speckles [145] and a small portion might even find its way to the cytoplasm [93]. Careful investigations are thus required to determine the mechanisms governing the subcellular trafficking of LARP7 and further address the consequences of this multifarious localization.

The variety of processes affected by LARP7 also complicates the interpretation of knockdown or knockout experiments, which impair all LARP7-related pathways at the same time. Complementation with LARP7 mutants that retain only selected activities will be a very helpful tool to resolve the intricate network of LARP7 functions. Successful examples for such an approach have been reported recently [42,118].

Albeit the role of the 7SK snRNP in the inhibition of P-TEFb has been well established, the exact consequences for the regulation of Pol II transcription are still rather puzzling. According to the current model, destruction of the 7SK snRNP is expected to result in increased P-TEFb activity, enhanced Ser2-phosphorylation of Pol II and, ultimately, increased general transcriptional output. However, this was neither the case in cell lines derived from an Alazami patient or HEK293 LARP7 knockout cells [42], nor in conditional Larp7 knockout mice [118] and also not upon knockdown of MePCE in breast cancer cells [45]. Only subsets of selected transcripts reacted to the loss of physiological amounts of 7SK snRNPs [42,118,146] and, even more surprisingly, reduced amounts of Ser2-phosphorylated Pol II were found at selected genes [45]. In light of the growing evidences indicating that the 7SK snRNP acts as a delivery system for P-TEFb (reviewed in [56,58]) and for other factors [53,55,70,74,76,82], more studies need to clarify how this targeted delivery is achieved and why only specific subsets of transcription units are susceptible to perturbation.

Acknowledgments

We thank Giriram Kumar Mohana and Jean-Yves Roignant for discussions.

Disclosure statement

The authors declare no conflict of interest.

Additional information

Funding

This work was supported by the Deutsche Forschungsgemeinschaft [Me2064/9-1]; Deutsche Forschungsgemeinschaft [Fi573/15-2]; Deutsche Forschungsgemeinschaft [Fi573/20-1]; European Research Council [682291 “moreRNAs”].

References

Alternative formats

 

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.