The complete chloroplast genome sequence of Angelica laevigata Fisch.

Abstract Angelica laevigata (Fisch 1812) is an important medicinal plant endowed with a rich chemical composition. In the present study, we present the complete chloroplast genome sequence of A. laevigata. The total length was 146,161 bp, comprising a large single-copy region of 93,538 bp and a small single-copy region of 17,779 bp separated by two inverted repeats of 17,422 bp each. A total of 128 genes were identified containing 87 protein-coding genes, 33 tRNA genes, and 8 rRNA genes. Phylogenetic analysis suggests that A. laevigata is closely associated with Angelica laxifoliata from the Umbelliferae family.

Angelica laevigata (Fisch 1812) refers to a perennial plant belonging to the Umbelliferae family. It is native to the Northeast region of China, Mongolia, the Russian Far East, and the Korean peninsula. It contains a variety of volatile compounds and a small number of coumarins, making it become an important medicinal plant (Suleimen et al. 2014). Till the present, most of the studies on this species have been mainly concentrated on describing its morphological variation and chemical composition while molecular and evolutionary researches on A. laevigata are lacking (Liao et al. 2013). In the current work, we provide a complete chloroplast genome sequence of A. laevigata which is of great necessity to comprehend its phylogenetic relationships with other Umbelliferae species.
The fresh green leaves of A. laevigata were collected from Yichun, Heilongjiang Province, China (N:48 22 0 55 00 , E:129 27 0 95 00 ). The plant materials and voucher herbarium specimen were stored at the Pharmacy College of Heilongjiang University of Chinese Medicine (http://yxy. hljucm.net/, Weichao Ren, lzyrenweichao@126.com) under the voucher number YCL20200531T7. Besides, total genomic DNA including nuclear and organelle genome was extracted by employing the CTAB method (Doyle 1987;Yang et al. 2014) and was stored at Heilongjiang University of Chinese Medicine, Harbin, China. The qualified DNA was adopted to construct a 150 bp paired-end library for sequencing via Illumina NovaSeq high-throughput Sequencing platform (Benagen, Wuhan, China). The raw data was filtered using SOAPnuke (version:1.3.0) (Chen et al. 2018). Subsequently, 4.5 Gb of clean data was de novo assembled into circular contigs by SPAdes (version:3.13.0) (Bankevich et al. 2012). PGA (version:1) (Qu et al. 2019) was used to conduct chloroplast genome annotation as well as predict gene encoding proteins, transfer RNA (tRNA), and ribosomal RNA (rRNA). Together with gene annotation, the complete chloroplast genome sequence has been submitted to GenBank under the accession number MW696157.
Results demonstrated that the total length of A. laevigata chloroplast genome was 146,161 bp. Based on a typical quadripartite structure, a pair of inverted repeats (17,422 bp) was separated by a small single-copy region of 17,779 bp and a large single-copy region of 93,538 bp. The chloroplast genome of A. laevigata consisted of 128 genes including 87 protein-coding genes, 8 rRNA genes, and 33 tRNA genes.
To clarify the phylogenetic relationship of A. laevigata, the complete chloroplast genome sequences of A. laevigata and those from 15 related species were collected. Additionally, the chloroplast genome sequence of Arabidopsis thaliana was adopted as an outgroup. All sequences were aligned with the use of MAFFT (version:7.307) (Katoh and Standley 2013), followed by the construction of a phylogenetic tree obtained from a neighbor-joining (NJ) analysis with 1000 bootstraps in MEGA 7.0 (Kumar et al. 2016). The NJ phylogenetic tree revealed the close relationship of A. laevigata with Angelica laxifoliata, a member of the Umbelliferae family (Figure 1). This published chloroplast genome will be useful data for phylogenetic and evolutionary studies in Umbelliferae. MITOCHONDRIAL DNA PART B 2021, VOL. 6, NO. 9, 2496-2497https://doi.org/10.1080/23802359.2021 Disclosure statement No potential conflict of interest was reported by the author(s).

Funding
This study was supported by the Key Project at Central Government Level, the Ability Establishment of Sustainable use for Valuable Chinese Medicine Resources [2060302].

Data availability statement
The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at (https://www.ncbi.nlm.nih.gov/) under the accession no. MW696157. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA733316, SRR14723676, and SAMN19374476, respectively (https://www.ncbi.nlm.nih.gov/sra/ SRR14723676).