The complete mitochondrial genome of the stink bug Eocanthecona furcellata (Hemiptera: Pentatomidae)

Abstract The predatory stink bug Eocanthecona furcellata belongs to the subfamily Asopinae of Pentatomidae. In the current study, the complete mitochondrial genome of E. furcellata is determined. This mitogenome is 16,085 bp in size and comprises of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a control region. Gene order is identical to that of the putative ancestral arrangement of insects. Nucleotide composition is biased toward A and T, which together made up 75.5% of the entire genome. All tRNAs have the clover-leaf structure except for the tRNASer(AGN) and the length of them ranges from 61 to 73 bp. The monophyly of Pentatomidae is highly supported by the phylogenetic tree and E. furcellata is very close to other carnivorous species of the remaining Pentatomidae species.

Eocanthecona Bergroth, 1915 belongs to the family Pentatomidae, a small genus with 20 known species in the world and 11 species in China (Zhao 2013). Eocanthecona furcellata (Wolff, 1801), as a common predator in south China, was massively reared in China as a biological agent. Widespread species occur from India and Sri Lanka through China and southeast Asia to Japan, the Philippines, and Indonesia (Rider and Zheng 2002). In this study, the complete mitochondrial genome of E. furcellata was sequenced and described. Adult specimens were collected from Heyuan city (24 6 0 30 00 N; 114 4 0 39 00 E) of Guangdong Province in China in 2021. Specimens were deposited in the Natural Enemy Insects Herbarium (accession number: GDPPRI-NI2021-22) of the Plant Protection Research Institute Guangdong Academy of Agricultural Sciences (GDPPRI) (Yi Guo, guoyi@gdaas.cn, Room 111, Plant Protection Building).
The total genomic DNA was extracted from the whole body of the specimen using the DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany) and stored at À20 C until needed. The mitogenome was sequenced in BerryGenomics company that used NGS. One microgram of genomic DNA was used to generate libraries with an average insert size of 350 bp, which were sequenced using the Illumina HiSeq S6000 (San Diego, CA) with 150 bp paired-end reads on one sample per flowcell lane. A total of 17,687,046 raw paired reads were generated. The quality of all sequences was checked using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc). Clean reads were assembled and annotated using the MitoZ v2.4 pipeline (Meng et al. 2019).
The complete mitogenome of E. furcellata is 16,085 bp in size (GenBank accession number: MZ440302) including 37 typical insect mitochondrial genes (13 protein-coding genes, 22 transfer RNA genes, and two ribosomal RNA genes) and a control region. Gene order is identical to that of the putative ancestral arrangement of insects (Cameron 2014;Xu et al. 2019;Wang et al. 2021). The nucleotide composition of the mitogenome is biased toward A and T, with 75.5% of A þ T content (A ¼ 41.5%, T ¼ 34.0%, C ¼ 14.2%, and G ¼ 10.3%). The AT-skew is positive (0.10) whereas GC-skew is negative (-0.16). Nine PCGs (COII, COIII, ATP6, ND3, ND4, ND5, ND4L, ND6, and CYTB) initiate with ATN codons, three PCGs (COI, ATP8, and ND1) initiate with TTG codons, and one PCG (ND2) initiates with TTA codon. The stop codons TAA and TAG are assigned to 11 and one protein-coding genes, respectively, whereas the TA residue is used by ND5 as incomplete stop codon which is commonly in Heteroptera mitogenomes (Wang et al. 2017(Wang et al. , 2021. There are 22 tRNA genes, ranging from 61 to 73 bp in length, and all of them can be folded into typical clover-leaf secondary structure except for tRNA Ser(AGN) , the dihydrouridine (DHU) arm of which forms a loop, as is the case with most other insects (Jiang et al. 2016;Wu et al. 2020;Wang et al. 2021). The length of IrRNA and srRNA is 1340 bp and 790 bp, respectively. The A þ T content of IrRNA and srRNA is 78.7% and 77.0%, respectively. The control region is located between srRNA and tRNA IIe , which is 1398 bp in length with an A þ T content of 79.9%.
Maximum-likelihood (ML) tree was constructed based on sequences of 13 protein-coding genes and two rRNA genes from 13 species of different families and two outgroups by IQ-TREE 2.0.6 (Bui et al. 2020) under the GTR þ IþG model (Figure 1). Each family showed a monophyletic cluster. The monophyly of the Pentatomidae was highly supported in this phylogenetic analysis, and predatory species were evolved from herbivorous species in the family Pentatomidae, which is also recovered in previous study (Wang et al. 2021). The complete mitogenome of E. furcellata could provide the molecular genetic markers for the further phylogenetic analysis in Pentatomidae.

Disclosure statement
All authors have read and approved the final manuscript. The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper.

Data availability statement
The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov/ under the accession no. MZ440302. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA740352, SRR14901855, and SAMN19842624, respectively.