Complete chloroplast genome sequence of Punica granatum ‘Nana’ (Lythraceae) and phylogenetic analysis

Abstract Pomegranate (Punica granatum L.) is of great significance both as a fruit tree and an ornamental plant. Hereon, we sequenced and characterized the complete chloroplast genome of Punica granatum ‘Nana’ and performed phylogenetic analysis concerning related species. It turned out that the length of chloroplast genome sequence reached 158,639 bp and exhibited a four-conjoined structure, i.e., a large single copy region (LSC, 89,022 bp), a small single copy region (SSC, 18,685 bp) and twain inverted repeat regions (IRa and IRb, 25,466 bp). 112 unique genes were identified, consisting of 78 protein-coding genes, four ribosomal RNA (rRNA) genes and 30 transfer RNA (tRNA) genes. The result of phylogenetic analysis based on Neighbor-joining (NJ) method was consistent with that of Bayesian inference (BI), which strongly supported that Punica granatum ‘Nana’ was close to its original species Punica granatum and they together had a close relationship with Heimia myrtifolia within Lythraceae.

Pomegranate (Punica granatum L.), belonging to Lythraceae, is not only economically important in agricultural production as a welcome fruit tree, but also an excellent ornamental plant in horticulture for its showy flowers (Qin and Shirley 2007). It originates from central Asia and nowadays has been cultivated in tropical and subtropical regions worldwide (Yan et al. 2019). The long history of planting pomegranate has witnessed many varieties and cultivars with good taste and abundant progenies (Al Khayri et al. 2019). Hence, we described the traits of Punica granatum 'Nana' chloroplast genome and performed phylogenetic analysis, which would promote genetic and breeding research within this genus.
Fresh leaves of Punica granatum 'Nana' were collected in the nursery of Zhejiang A&F University, Hangzhou, Zhejiang province, China (30 13 0 48 00 N, 119 43 0 12 00 E). The specimen was stored at Herbarium of Zhejiang A & F University (specimen code ZAFU1912245). Extracting total genomic DNA was done according to the method proposed by Doyle (1987) and Yang et al. (2014). Following the establishment of a sequencing library based on purified DNA, paired-end reads were primarily obtained utilizing the Illumina High-throughput Sequencing technology, and then Trimmomatic v0.3 (Bolger et al. 2014) worked to filter raw reads. The process of de novo assembly was accomplished by CLC v9.11 (Nicolas et al. 2017). The alignment of contigs was under the BLAST algorithm (Johnson et al. 2008) with Punica granatum plastid genome as reference (Gu et al. 2019). The genome was annotated using DOGMA v1.2 (Wyman et al. 2004) and submitted to GenBank (Accession number: MN833212). MsatCommander v0.8.2.0 (Faircloth 2008) was utilized to identify simple sequence repeats (SSRs). Plastid circular map was drawn via OGDRAW v1.3.1 illustrated by Greiner et al. (2019).
The chloroplast genome size of Punica granatum 'Nana' reached 158,639 bp, exhibiting a typical four-conjoined structure, i.e., a large single copy region (LSC, 89,022 bp), a small single copy region (SSC, 18,685 bp) and twain inverted repeat regions (IRa and IRb, 25,466 bp). The overall GC content was 36.92%, and detailedly, corresponding contents to LSC, SSC and IR regions were 34.89%, 30.63% and 42.78%, respectively. 53 simple sequence repeats (SSRs) were found out, among which 50(94.34%) were mono-repeats, 1(1.89%) were direpeats and 2(3.77%) were tri-repeats. A total of 112 unique genes were detected, including 78 protein-coding, four ribosomal RNA (rRNA) genes and 30 transfer RNA (tRNA). There were 83 genes found in LSC, with 12 in SSC and 17 in IR regions, respectively. Eight protein-coding and six tRNA genes contained only one intron whereas three genes (ycf3, clpP and rps12) contained two. Gene rps12, particularly, was trans-spliced, characterized by first exon locating in LSC and the other two in IR regions.

Disclosure statement
No potential conflict of interest was reported by the author(s).

Funding
This work was supported by the Zhejiang Provincial Natural Science Foundation of China [LY17C160003].

Data availability
The data that support the findings of this study are openly available in GenBank at https://www.ncbi.nlm.nih.gov/genbank/, reference number: MN833212.