The complete chloroplast genome of Ananas comosus var. erectifolius (L.B. Smith) Coppens & Leal

Abstract Ananas comosus var. erectifolius (L.B. Smith) Coppens & Leal, a tropical plant from Bromeliaceae family, has immense applications, especially for fiber production of excellent quality. The lack of available chloroplast (cp) genome information limits its breeding and application. Here, we assembled its complete cp genome using Illumina high-throughput sequencing technology. The cp genome size is 159,983 bp, with 37.4% GC content, including a large single copy region (LSC) of 87,787 bp, a small single copy region (SSC) of 18,606 bp, and a pair of inverted repeat regions (IRs) of 26,795 bp. It encodes 89 protein-coding, 38 tRNA and 8 rRNA genes. Phylogenetic analysis showed that A. comosus var. erectifolius was close to Ananas comosus. The complete cp genome sequences could provide valuable information for variety breeding and genetic analysis of agronomic and economic traits in A. comosus var. erectifolius.

Ananas comosus var. erectifolius (L.B. Smith) Coppens & Leal, popularly known as curau a, is a plant species from the Bromeliaceae family in the order of Poales (Baker and Collins 1939). It is native to the Amazon region and distributed in wide parts of the world. Fiber produced from its leaves is one of the most resilient plant fibers and largely used in leather, synthetic and automobile industry (Mishra et al. 2004). This species is an environmentally friendly and biodegradable natural resource for industrial use, and also has significant social roles by using manpower of low-income populations (Barbosa et al. 2015). Its smooth and erect leaves are probably owing to the long term artificial selection for an abundance of long easily-extractable fibers. Apart from being a medicinal plant, the inflorescences of A. comosus var. erectifolius also present values for the production of cut-flowers and landscape-gardening (Leão et al. 2009;Teixeira et al. 2019;Manigandan et al. 2021). As a closely related plant species of pineapple, its special traits that are responsible for the valuable fiber production, like the dense, erect and smooth leaves, have the potential to be transferred into the commercial cultivars of pineapple through inter-specific hybridization, which might be helpful for enhancing the comprehensive utilization value of pineapple (Jawaid et al. 2020). Moreover, owing to the absence of spines and purplish leaves of A. comosus var. erectifolius, it can also be used as parent in the genetic population construction for the location of the spine/ color-related genes. For its extensive application in the industry and other field, large amount of researches were aimed to improving the quality and production of A. comosus var.
erectifolius (Pereira et al. 2008;Moreira et al. 2016). Organelle genomes have been commonly utilized in the researches on taxonomy, phylogeny and evolution (Yang et al. 2019;Wang et al. 2020). However, little is known about the chloroplast genome of A. comosus var. erectifolius, which affect its application and breeding processes. In this study, the complete chloroplast genome of A. comosus var. erectifolius was characterized and the phylogenetic relationships with other related species were analyzed, which were helpful for variety breeding and genetic analysis of agronomic and economic traits in A. comosus var. erectifolius.
The fresh leaves of A. comosus var. erectifolius were collected from Horticultural Germplasm Conservation Center of South China Agricultural University for breeding and research in Guangzhou, Guangdong Province, China (113 22'4 00 N, 23 9'5 00 E). A specimen was deposited at South China Agricultural University (Chaoyang Liu, email: chyliu5@163.com) under the voucher number SCAU20190725001. The total genomic DNA was extracted from about 100 mg fresh leaves using a modified CTAB method (Doyle and Doyle 1987). Paired-end Libraries with an average length of 350 bp were constructed by using the Nextera XT DNA Library Preparation Kit (Illumina, San Diego, CA), and then the libraries were sequenced on Illumina Novaseq 6000 platform (Shenzhen Huitong Biotechnology Co. Ltd). In total, 16,822,772 raw reads were obtained and edited by using NGS QC Tool kit (Patel and Jain 2012), 4.99 Gb clean data was assembled by using the de novo assembler SPAdes v.3.11.0 software, with a k-mer set of 93, 95, 97, 103, 105, 107 and 115 (Bankevich et al. 2012). The chloroplast genome of Ananas comosus (Accession number: AP014632.1) was used as the reference. Finally, the assembled complete cp genome was annotated by via PGA software (Qu et al. 2019). The complete chloroplast genome was submitted to GenBank with Accession no. MZ457322.
Phylogenetic analysis was performed based on complete cp genomes of A. comosus var. erectifolius and other 20 related species reported in the Commelinids clade, Arabidopsis thaliana as outgroup. The chloroplast genome sequences were aligned with MAFFT 7.407 with default parameters (Nakamura et al. 2018), and then the maximumlikelihood tree was constructed by IQ-TREE 1.6.12 with the parameter (-bb 1000) (Nguyen et al. 2015). All the Ananas species formed a monophyly. The A. comosus var. erectifolius was a basal species in the Ananas genus (Figure 1). In conclusion, the characterized cp genome sequence of A. comosus var. erectifolius provides a useful genetic resource for future phylogenetic identification and analysis of important agronomic and economic traits.

Disclosure statement
No potential conflict of interest was reported by the authors.

Ethics statement
Ananas comosus var. erectifolius (L.B. Smith) Coppens & Leal is a cultivated plant species, and is not listed in the appendices I, II and III of the Convention on the Trade in Endangered Species of Wild Fauna and Flora, that has been valid from 22 June 2021 (https://cites.org/eng/app/ appendices.php). No approvals were required for this study, which complied with all relevant regulations.