Complete chloroplast genome of sageretia thea (rhamnaceae), an ornamental fruit and medicinal tree

Abstract Sageretia thea (Osbeck) M.C. Johnst (1968) is an important fruit and medicinal species of Rhamnaceae family. The complete chloroplast genome (cp) of Sageretia genus was sequenced and reported for the first time in this study. The cp genome had a total length of 161,352 bp, consisting of a largesingle-copy (LSC) region of 89,802 bp, a small single-copy (SSC) region of 18,914 bp, and a pair of inverted repeat (IR) regions totaling 26,318 bp. The plastid genome contained 129 genes, including 84 protein-coding genes, 37 tRNA genes, 8 rRNA genes. The overall GC content of the genome was found to be 37.10%. Phylogenetic analysis based on comparison with 27 chloroplast genomes revealed that S. thea is closely related to genera Rhamnus and Berchemia. The findings of this study can provide fundamental insights for the conservation, exploitation, and systematic genomic investigation of Rhamnus plants.


Introduction
Sageretia plants, belonging to Rhamnaceae family and first discovered in 1826, have been documented with over 40 species.It is primarily distributed in southern and eastern Asia, with a few species found in the Americas and Africa.China alone accounts for 22 of these species.The majority of Sageretia plants possess high economic and horticultural value, Sageretia thea (S. thea) was one of the most significant fruit and medicinal plant.The underground part of S. thea possesses properties such as expectorant, carminative, and dehumidifying effects.Previous studies on S. thea have primarily focused on its fruit yield and medicinal ingredients (Sang et al. 2015;Khalil et al. 2021), with limited attention given to its distribution and evolutionary relationships within the Sageretia genus.To date, there is no complete chloroplast genome available for Sageretia species, which could provide valuable information for exploring phylogenetic relationships within the Rhamnaceae family and serve as a foundation for future research.
0295848.The fresh leaves were frozen in liquid nitrogen and stored at −80 C until further use.Total genomic DNA was extracted using Doyle's (1987) method.Paired-end reads of 150 bp were generated using illumina NovaSeq 6000 platform (Illumina, San Diego, CA).SPAdes v3.10.1 software was used to de novo assembly the cp genome (Bankevich et al. 2012;Kongkachana et al. 2022).The fastp v0.20.0 (https:// github.com/opengene/fastp)software were used to filter the raw data, the filtration criteria were as follows: (1) truncate the sequencing linker in the Reads and the primer sequence (2) filter out the Reads whose average quality value is less than Q5, (3) filter out N Reads whose number is more than 5.The de novo assembly was performed using filtered reads.The clean data consisted of a total base count of 5,300,864,700 with a percentage mass value greater than or equal to 20 up to 96.75%.The cp genome of Berchemia flavescens (GenBank accession MK460212.1)was used as the reference genome for quality control in this study (Figure S1).Twenty six plastid genomes of Rhamnaceae family were chosen to draw the phylogenetic tree, Vitis davidii and Vitis amurensis as outgroups.The sequences used in this study were downloaded from NCBI GenBank.The cp genome of S. thea were aligned with 27 species belonging to Rhamnaceae family from the same starting point using MAFFT v7.427 (auto mode).The evolutionary tree was build using RAxML v8.2.10 (https://cme.h-its.org/exelixis/software.html) software, chosen GTRGAMMA model and set bootstraps as 1,000 based on the rapid Bootstrap analysis.

Discussion and conclusion
Due to their high-throughput, time-saving, and cost-effectiveness, Next-and Third-generation sequencing technologies have gradually gained popularity in genomic research (Cronn et al. 2008).Although the Rhamnaceae family comprises approximately 900 species, there is a limited availability of genomic sequences for this taxonomic group (Ma et al. 2017).Phylogenetic analysis of representative species from different genera within Rhamnaceae revealed a close relationship between the cp genome of S. thea and those of Rhamnus, Berchemiella, Rhamnella, and Berchemia; however, it showed distant relatedness to Ziziphus.Ziziphus, as a member of the Rhamnaceae family, is widely distributed in subtropical and tropical regions of Asia and America.On the other hand, S. thea exhibits wide distribution in subtropical regions of Asia.Therefore, we hypothesize that geographical isolation exists between S. thea and Ziziphus species leading to significant genetic diversity differences (Ma et al. 2017).
The chloroplast genome of S. thea was detected and analyzed in this study, providing essential insights for the conservation, utilization, and phylogenomic investigations of the Rhamnaceae family.

Figure 1 .
Figure 1.The reference image of the plant of sageretia thea(taken by misha zhan, photographed in the Mountain slope regions of the Zhejiang province, China; the most characteristic feature of the specimen: Rattan shrub, leaves subopposite or alternate, elliptic, margin serrate).

Figure 2 .
Figure 2. Gene map of the sageretia thea cp genome.The four areas (SSC, LSC, IRA and IRB) had been noted in black coil.Genes reside in the inside and outside of the outer circle are in the forward and reverse directions, respectively.The dark and light gray bars in the inner circle denote G þ C and a þ T contents, respectively.