The complete chloroplast genome of Phellodendron chinense (Rutaceae), an Endangered medicinal plant in southern China

Abstract Phellodendron chinense is an Endangered medicinal plant in southern China. In this study, the complete chloroplast genome sequence of P.chinense was characterized by de novo assembly. The length of the whole chloroplast genome was 158,537 bp, containing a large single copy region (LSC) of 86,250 bp and a small single copy region (SSC) of 18,287 bp, which were separated by a pair of 27,000 bp inverted repeat regions (IRs). The sequence contains 114 unique genes, including 30 tRNA, 4 rRNA, and 80 protein-coding genes. The overall GC content of the chloroplast genome is 38.4% and those in the LSC, SSC, and IR regions are 36.6, 33.2, and 42.9%, respectively. The phylogenetic analysis based on reported chloroplast sequences of Rutaceae showed that P. chinense is sister to P. amurense, consisting a monophyletic group, and that Phellodendron is closely related to Zanthoxylum.

Phellodendron is a small genus of aromatic deciduous trees in family Rutaceae distributed in eastern Asia. It consists of two recognized species, P. chinense Schneid. and P. amurense Rupr. (Ma et al. 2006). Phellodendron chinense is endemic to Anhui, Hubei, Hunan, Sichuan and Yunnan provinces, southern China, scattering in subtropical broad-leaved forests or mixed forests (Zhang et al. 2019). This species is best known for its bark (cortex) which was used as one of the most famous Chinese traditional medicines (Tang et al. 2016). Due to its high medicinal value, it was listed as one of the 237 species of national Endangered and key protected wild plants in China (Yu 1999). In the past decades, wild populations of P. chinense have declined significantly, suggesting that urgent conservation measures need to be taken (Shen et al. 2009;Tang et al. 2016;Zhang et al. 2019). In this study, we assembled the complete chloroplast genome of P. chinense using next-generation sequencing to provide a gene source for further genetic and conservation studies.
Fresh young leaves of P. chinense were collected from Tianquan County, Sichuan Province, China (N 29 59 0 19.61ʺ, E 102 38 0 34.13 00 ). Voucher specimen (collection numbers: BJFUZLC061) was deposited in the Herbarium of Beijing Forestry University (BJFC). The total genome DNA was extracted using a modified protocol (Chen et al. 2014) and sent to Majorbio (http://www.majorbio.com, China) for nextgeneration sequencing using Illumina Hiseq Xten. About 2.5 Gb high quality, 2 Â 150 bp pair-end reads were obtained from High-throughput sequencing. The chloroplast genome of P. amurense (Genbank accession no. KY707335.1) was used as a reference to exclude nuclear and mitochondrial reads by Geneious Prime 2020.2 (Kearse et al. 2012). Filtered chloroplast reads were exploited for de novo assembly with Geneious Prime. The assembled chloroplast sequences of P. chinense were then annotated using Plann (Huang and Cronk 2015) by referring to the relative group.
In order to understand the phylogenetic status of P. chinense within Rutaceae, a phylogenetic analysis was performed based on complete chloroplast genome sequences of 18 other species of the family. These sequences were obtained from NCBI and Canarium album (Burseraceae) was used as the outgroup. All sequences were aligned, and a phylogenetic tree was constructed using the maximum likelihood method by MAGE-X (Kumar et al. 2018;Katoh et al. 2019). Bootstraps were calculated for 1000 replicates to comfirm the bootstrap value of each node, and the Tamura-Ne model operation was selected to obtain the tree (Figure 1). The phylogenetic results showed that P. chinense is sister to P. amurense, consisting a monophyletic group. Then Phellodendron is closely related to Zanthoxylum, which is consistent with the phylogenetic relationship between the two genera based on plastid and nuclear markers (Poon et al. 2007;Appelhans et al. 2018).

Disclosure statement
No potential conflict of interest was reported by the author(s).

Data availability statement
The data that support the findings of this study are openly available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov, reference number MT916287.