The Ecuadorian Microbiome Project: a plea to strengthen microbial genomic research

ABSTRACT A technological revolution provides nations with access to unprecedented quantities of molecular information, and this is particularly evident in the vast and yet poorly understood realm of the microbiome. Traditionally, many developing countries in Asia, Africa, and South America remain marginal participants in the global flow of biodata, which will eventually affect their productivity and economies. Here, we present the Ecuadorian Microbiome Project (EcuMP) as an integrative initiative to close the research gap in the microbiome for Ecuador. We discuss the relevance that the study of the microbiome has for our understanding of diversity and new forms of production and biocapital. We also evaluate the state of research in metagenomics and the microbiome for South America, with emphasis on Ecuador as a small but biodiverse country. In the strict sense of access, understanding, and technological innovation based on molecular data, we propose the definition of bioliteracy. As indirect estimates of bioliteracy, we measured the number of indexed publications, BioProjects, monthly global internet traffic to GenBank, and patent applications in Espacenet. South America has a notable unevenness in scientific productivity related to the microbiome and metagenomics. Brazil leads productivity, with most of the measured parameters remaining one order of magnitude higher than other countries in the region. Participation of South American countries in the global flow of genomic information dwarfs when compared to the US. To reduce the effects of technological dependency and the associated lack of economic productivity, Ecuador should address the technological gap in the study of the microbiome. Our assessment reveals the urgency to translate the study of microbiomes into a source of technological prowess and the basis for local biocapitals.


The unseen biodiversity and its relevance to bioliteracy and biocapital
The estimated number of microorganisms on the planet (i.e. 5 × 10 30 -10 × 10 30 ) [1,2] is larger than the number of stars in the known universe (1 × 10 24 ) [3]. Despite representing up to 19% of Earth's biomass or between 10 16 and 10 17 g, our knowledge of this vast component of biological diversity is still wanting [4]. Some have suggested that at least half of the planet's biomass belongs to the microbial realm [1,5]. Access to genomic information has become increasingly easier due to the availability of powerful technologies, thus resulting in the discovery of a new realm of cultureindependent lineages. These poorly known lineages are often known as the "rare biosphere" or the "dark matter" of microbial diversity [6]. Bacterial lineages without culture isolates consist of most of life's diversity and some may even represent a new domain of life [7]. This occult and unexplored diversity, which is present in natural environments, in the soil, and water can be a source of natural products with agricultural and biotechnological applications [8]. Thus, the importance to know the microbial universe has become essential to the development of areas such as medicine, agriculture, industry, chemical sciences, and others [9]. This is what we herein call the unseen biodiversity, which holds special relevance to bioliteracy and biocapital [10,11].
Bioliteracy is a term originally coined by Daniel Janzen to express his concern for the accelerated decline of humanity's knowledge and attention for biodiversity, especially circumscribed to the Barcoding of Life Initiative [11,12]. Janzen's proposal has been used to denounce our ignorance of nature, in that we only have cataloged a small proportion of the species on Earth [13]. However, some have challenged Janzen's proposal that the public is "bioilliterate" [13]; particularly from the point of view of the social sciences that propose non-technological forms in which nature and society establish cultural connections [14]. Within the discussion of technological innovation, Enriquez and Martinez [15] have elaborated on the concept of bioliteracy, to stress the importance of understanding and using the molecular resources of biodiversity for new forms of biocapital. It is on this last use of the concept "bioliteracy" that we develop our discussion.
Metagenomic information is an important source of big data for the biological sciences [16]. The capacity to produce, access, and use molecular information has been associated with the level of social and economic development of nations [17], and their possibility for technological innovation [15,18,19]. In a pioneering study, with a methodological approach that has not been replicated to date, Enriquez and Martínez [15] assessed the amount of biodata flow across countries during a period of six months by measuring the number of bytes that were downloaded from GenBank, EMBL (European Molecular Biology Laboratory) and DDBJ (DNA Data Bank of Japan). They showed a significant gap that separated developed nations from regions such as South America. Nearly 92% of all information downloaded from molecular data banks reached the US and nine countries in Europe; with less than 1% being transferred to countries in Asia, Africa and South America. According to Enriquez and Martínez [15], bioliteracy is a necessary condition for new forms of economical development such as a bio-based economy. More recently, Xia and Liu [17] defined the difference between developing and developed countries in the usage patterns of genomic data as a "divide", with an "overwhelming silence" in the academic community of "third-world countries".
The establishment and continuous improvement of new technologies that allow the production of genomic information, and its direct application to "profit-oriented enterprise", "product-making" and "profit-seeking" has led to new concepts in economic theory, such as "biocapital" [10]. Biocapital refers to all biological resources, especially molecules and genomes, that can be transformed into informatic commodities, and that are used to generate value, profit, wealth, and new markets [20]. The biocapital paradigm has been applied to the realm of the microbiome, where genomics has made important progress in the design of "microbial factories" to produce drug precursors, chemical compounds, and plastic monomers [21][22][23]. However, some believe that the biocapital will have far-reaching effects on current global production systems [24]. It is therefore important for developing countries to consider their role in the global production of information related to what can be considered as bioinformatic commodities [25].

Microbial genomic research in Latin America
South American countries such as Argentina, Brazil, and Venezuela have joined The Earth Microbiome Project (EMP) ( [26][27][28]. The EMP (www.earthmicro biome.org) is a multidisciplinary initiative that proposes to survey microbial communities at a planetary scale. Another important global initiative is The Terragenome Project, which is an International Soil Metagenome Sequencing Consortium that seeks to complete the sequencing of soil metagenome at a planetary scale [29]. Ecuador has no current participation as an active and contributing member in either of the previously mentioned projects. Ecuador and other countries in South America, with their distinctive geographic and biological diversity, must get involved in these global projects. Refraining from participating contributes to the technological gap that affects their social and economic development as nations.
In an assessment of the status and development of microbiome research in Brazil, Pylro and Roesch [30] established that it has been for the most part neglected, and that research in the country is scattered among numerous small research groups [30]. They highlight biodiversity indexes that place Brazil as one of the most biodiverse countries in the world but remark that microbial diversity is consistently neglected. Recognizing microbes as a central resource for Brazil's technological and economic development, Pylro and Roesch made a plead to strengthen and unify efforts across the country [30]. Two such efforts have been the creation of the Brazilian Microbiome Project (BMP) and the Brazilian Institute of Science and Technology for Microbiome Studies (INCT), which strive to organize microbiome research at the national level and to integrate with international initiatives. As Brazil, Ecuador is also a country that leads global biodiversity lists and is considered a hotspot of biodiversity [31]; however, its contributions to the study of microbial diversity are considerably smaller [32]. To reduce the scientific and technological gap in the study of microbiomes in Ecuador, the authors of the present study have established an interinstitutional research group called "The Ecuadorian Microbiome Project" (EcuMP). This is a budding effort to survey the unseen diversity in Ecuador's bacterial ecosystems.
The objective of this study was to analyze the current state of Ecuador's research in metagenomics and the microbiome compared to all countries in South America. Herein, we discuss the research progress in Ecuador, the potential contribution made by the EcuMP, and the implications that Ecuador's current state of development in metagenomics and microbiome research has for technological biocapital and bioliteracy.

Methods
We evaluated the state of research in metagenomics and the microbiome for all countries or territories in the South American continent through the PubMed Advanced Search Builder (ASB). The countries considered for analysis were Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Paraguay, Peru, Surinam, Uruguay, and Venezuela. We also included French Guiana as a territory that is part of the South American continent. PubMed is the search engine for the MEDLINE database and is also part of the Entrez database system [33]. The MEDLINE database includes 200 journals that belong to the regional index Latindex. To include all possible adjectival and demonymic forms for countries, only the root of the country name was used and followed by the wildcard "*". The search keywords "Ecuador*" and "metagenom*" were applied to all fields of the ASB to assess the number of publications including these words. To provide a comparison basis for Ecuador, a similar search was done for all thirteen countries and territories in the South American continent (e.g. "Argentin*" and "metagenom*"). A similar search was made for studies on the microbiome by using the search term "microbiome*".
A subsequent advanced Entrez text search in the BioProjects database in GenBank was used to determine the number of matches for the words "Ecuador*", "Argentin*", "Venezuela*", "Brazil*" and the rest of the countries and territories in South America within the "Description" field. The search results in the Entrez system provided a classification within taxonomical groups and research methods, which included the categories "bacteria" and "metagenomes" correspondingly; the reported frequencies or counts in both categories were used as the focus of our analysis. We present the potential limitations of this search approach in the discussion section.
To obtain a rough and indirect estimate of monthly access rates to the NCBI portal (ncbi.nlm. nih.gov), we used the online intelligence tool SimilarWeb [34]. Estimates were obtained between May and July 2018. SimilarWeb is a commonly used traffic analysis tool to perform research and generate insights on estimated online traffic, and it relies on over 50 million users and 1.1 billion records of website visits to infer statistics [35][36][37]. The rough estimate provided by our traffic measurement approach to the NCBI portal could be used as an indirect metric of bioliteracy (sensu Enriquez & Martínez [15]) or technology transfer (sensu Wang et al. [38]) and an assessment of global participation in the microbiome and metagenomics research. All previously described searches were made on January 16th, 2019.
As an indirect tool to measure internet traffic, caution is required when working with SimilarWeb as it may provide broad estimates only [39,40] and should not be used as a replacement to server monitoring services and log file analyzers when server access is available [41]. However, on servers that receive frequent daily visits by specific countries, such as NCBI and the US, and where large contrasts may be available for other regions of the world, such as Ecuador, results obtained by tools such as SimilarWeb are valuable references that provide a preliminary sense of how different countries may be using resources and data [42].
Metagenomics is a tool for the discovery of new molecules with diverse functions to industry [43] and therefore a potential source of innovation and patenting processes as precursors of biocapital [44]. Patent production does not relate linearly to scientific research [45], but the former is a reliable indicator of the potential that societies have for innovation and the creation of new knowledge [- [46][47][48]. To assess the participation of South America in patent creation, we used the advanced search engine available in Espacenet (European Patent Office EPO, worldwide.espacenet.com) [49]. The search, in both title and abstract, was based on the keywords "metagenomic*", "microbiome", and "bacteria", and were individually combined with the name of the applicant country. Espacenet has worldwide coverage and offers information on inventions and technical development from 1836 to the present, with daily coverage. As applicants often file patent applications for the same invention to multiple patent offices, Espacenet consolidates multiple sources and provides access to reference documentation. For comparison of South America as a whole, we also included metrics for the United States as a leading country in the production and use of information related to the microbiome. Ecuador is part of those jurisdictions around the world where organisms are not patentable; however, our search intended to find those technological products that are derived from the study of organisms, such as new methods and processes or synthetic molecules.
To understand the content of the publications found for Ecuador, research trends in the country, and their significance to environmental microbial biodiversity, we applied language analysis. First, publications found in the ASB search were categorized into Scopus subject areas. Second, a word cloud analysis provided a visualization of the most frequent words and a description of the essential information produced for Ecuador. Abstracts of the publications in the four most frequent Scopus categories (i.e. molecular biology, agricultural and biological sciences, immunology and microbiology, and environmental science) were used as the source data for word clouds. The commonality word cloud depicted the most frequent 30 words in a corpus of publications across Scopus categories. The comparison word cloud contrasted word frequencies across documents and provided a depiction of the first 120 words that contributed the most to differentiate among Scopus categories [50]. The word cloud analysis was implemented through the "tm" [51] and "wordcloud" [50] packages in R [52]. For those cases in which Scopus provided more than one category, the authors selected the most appropriate one according to manuscript content and used it for the comparison cloud. Database searches included all available records with no restrictions to time periods. A glossary of terms, with core concepts used in this manuscript, can be found as supplemental information (SI -Glossary of terms).

Comparative analysis of research productivity and publications
A total of fourteen indexed publications were found for Ecuador in the ASB at PubMed for research in metagenomics. Of these, twelve were research-based studies and two reviews. Eleven belonged to studies based on samples obtained from Ecuador's territory. Three studies were based on research exclusively developed in Ecuadorian territory, while most of the research activities, laboratory procedures, and data analysis for the remaining eleven studies were conducted abroad ( Figure 1). Of these fourteen publications, nine included an author who belongs to an Ecuadorian institution, and five were published exclusively by foreign institutions. Only two had an Ecuadorian researcher as the first author, and three included an Ecuadorian last author. The fourteen publications found through the search included ten with an explicit metagenomic application in the methods and results sections and four that did not include metagenomic approaches or methods ( Figure 1).
A total of fifteen indexed publications concerning the microbiome were found for Ecuador in the ASB at PubMed: however, three were already included in the metagenomics group and were not considered for the following tally. Three of the twelve publications for the microbiome were reviews and the remaining nine were research-based studies. Four publications included samples from Ecuadorian territory and not one included research done in Ecuador. Nine included an author representing an Ecuadorian institution, six had an Ecuadorian first author, and two had an Ecuadorian last author ( Figure 2).
Both metagenomics and microbiome publications were included in eight Scopus categories, the most frequent being molecular biology, agricultural and biological sciences, and immunology and microbiology. Environmental sciences, a category that should include most of the studies related to microbial ecology, was the fourth most frequent. A single publication was found for the categories of chemical engineering, neuroscience, and chemistry ( Figure 3a).
The comparison cloud places heavier weights (i.e. larger words) on those words that contribute the most to differentiate among categories; therefore, the following description is an overview of the central content of the publications on metagenomics and the microbiome for Ecuador (Figure 3b) within each Scopus category. The molecular biology category is characterized by studies related to the effect of antibiotics in the microbiome of rat guts and the microbiome associated with the Leptospira infectious cycle. The agricultural and biological sciences category is distinguished by studies on shrimp microbiota (in which the CKC4 bacterial lineage is studied), and the cassava fermentation process that is involved in the production of the indigenous chicha drink. For the immunology and microbiology category, the differentiating trait of the publications is the characterization of microbial communities through metagenomic applications with interest to human and environmental health, such as those associated with the metabolism of hydrocarbons (e.g. toluene). Finally, for the environmental science category, the distinctive concepts are for antibiotic resistance in poultry (including the presence of antibiotic resistance genes (ARGs), and mobile genetic elements (MGEs)), the virome of urban streams affected by fecal contamination, and the distribution of dangerous bacteria to amphibians (Figure 3b). The commonality cloud shows that the publications for Ecuador are often related to the analysis of the microbiota through the sequencing of the 16S rRNA. The composition and diversity of bacterial communities, in a human or environmental context, is often the focus of these studies (Figure 3c). A detailed list of the publications considered in this analysis, including objectives, organisms studied, and main results are included in the supplemental information (SI - Table 1). In a regional context, Brazil was the most productive with 375 publications related to the microbiome, followed by Chile (62), Argentina (59), Peru (51), Colombia (42), Venezuela (16), and Ecuador (15). The latter was only more productive when compared to Uruguay and Bolivia with three and four publications each. Brazil led the list of countries in South America with the most publications on metagenomics, totaling 409. Chile was second with 80 publications, followed by Argentina (64) and Peru (64), which together occurred above the third quartile. Both Ecuador and Venezuela were close to the median with 14 and 12 publications respectively. The rest of the countries in South America remained below the first quartile with 8 publications or less (Figure 4a).
The results of the Entrez text search in the BioProjects database showed that Brazil leads the production of metagenomic data with a total of 57 records. Colombia, Chile, and Argentina remain on or close to the third quartile. Venezuela, French Guiana, and Peru remain close to the median. The rest of the countries, including Ecuador, have three or fewer records and remain on or below the lower quartile ( Figure 4b). A similar search for the United States (US) revealed a difference by two orders of magnitude with Brazil, with a total of 3,519 BioProjects associated with the metagenome.
The same Entrez search provided how many projects are related to the domain Bacteria as an organism group. Brazil leads the list with 472 BioProjects in Bacteria, followed by Colombia (181) and Chile (82) and all are above the third quartile. The distribution of BioProjects in Bacteria, as the former two cases (Figure 4a and Figure 4b) is highly skewed to the left, where Argentina and Peru, with 70 and 67 records respectively, represent the median and are followed by Uruguay with 16 records. Ecuador, with 12 records, represents the first quartile and the remaining countries have nine or fewer records (Figure 4c). The difference between Brazil and the US is minor in this case, the latter having a total of 339 BioProjects, slightly less than the former.
Access to NCBI is an order of magnitude larger for Brazil when compared with the rest of South American countries, with over 3% of the global traffic share. Ecuador remains close to the median and leads the tail of countries with less than 0.1% of global participation in accessing the information on NCBI ( Figure 5). The United States is an order of magnitude larger than Brazil, with 33.53% of the total share of monthly traffic to NCBI. No country in South America registered patent applications with words related to metagenomics or the microbiome. Only Chile (17), Brazil (7), and Colombia (4) registered patent applications that included the keyword bacteria. There is a stark difference when South America is compared to the US. The latter has 828 patent applications related to bacteria, 7 to metagenomics, and 3 to the microbiome. A worldwide search revealed 159 applications with words related to metagenomics.

Research production and publications
The present study presents an analysis of scientific productivity by South American countries in both the microbiome and metagenomic fields. From the assessment of the evidence available in international public databases, such as PubMed, Entrez, and Espacenet, it is evident that South America suffers from a pronounced unevenness, with Brazil as the single most important center of research in the region. As has been hinted in the present study, when metrics for the US are included in the comparisons, the gap in knowledge production increases to an order of magnitude larger than Brazil, the latter as the leading country in the South American region.
The role of Chile, as the leading nation in the region for patents on bacteria, may respond to processes within the country that are not necessarily related to the measurements of scientific productivity we used in this study (e.g. number of publications); particularly, since there are very small numbers to compare with for other countries in South America, we may see this extreme value for Chile as the result of unique circumstances that provide a remarkable difference to the overall tendencies of countries within the region. This is particularly true if we consider the non-linearity by which patents may be produced and that could be decoupled from the processes and conditions involved in the production of scientific literature and academic knowledge [45,53]. What drives Chile as the leading nation in the region for patents on bacteria requires analyses and answers that remain beyond the scope of the current study. Further exploration is needed to compare Chile's drivers for innovation in the microbiological field with the conditions for other countries in South America; especially as the innovation environments and incentives do not necessarily perform similarly across countries and societies [54]. How countries produce and file patents may depend on numerous and varied factors, from macroeconomic processes by which industry and academy are financed and taxed, including lines of investment and capital risk, to micromanagement factors such as patent filing procedures and bureaucratic requirements [55]. Cultural values, human capital, trust, infrastructure, and laws are also relevant factors involved in the production of patents and other forms of innovation [46,48]. Our measurement of patents is basic and serves only as a very preliminary insight into a more complex system, which requires measurement and explanation.
Of relevance is the extremely limited participation of South America in the data transfer to NCBI, where the region essentially remains secluded from the global flow of genomic information. This agrees with the previous findings by Enriquez and Martínez [15] and Xia and Liu [17]. However, as we did not have access to the NCBI servers for direct measurement of traffic, our indirect approach is only approximate, limited in its scope, and cannot be strictly compared to dedicated software installed at host servers for traffic monitoring [39,40]. The very small number of patent applications for the region may be a symptom of the necessity for new strategies to strengthen bioliteracy and research on the microbiome and its associated genomes.

Research trends in Ecuador
The studies in Ecuador on metagenomics and the microbiome are mostly in aspects related to general molecular biology applications, agricultural sciences, public health, and to a lesser extent in environmental sciences. For a matrix detailing the assessed publications, please refer to SI - Table 1. Studies related to direct applications in conservation, such as the study of water quality or pathogens affecting amphibian populations (SI -Table 1), are infrequent as most have applied interests to public health (gut microbiome, hydrocarbon degradation) and industrial productivity (e.g. poultry or shrimp). Within the reduced corpus of publications for Ecuador, there is a clear lack of efforts in the field of microbial ecology through powerful methods such as metagenomics. No studies on soil metagenomic diversity have been published for Ecuador (SI - Table 1).
The Ecuadorian Microbiome Project started in 2016, with the ambitious goal to catalog the bacterial microbial diversity in Ecuador. The EcuMP follows the methods proposed by the Earth Microbiome Project (www.earthmicrobiome.org/pro tocols-and-standards/) [27] and minor modifications to the 16S Illumina amplicon protocol for samples from extreme environments [56]. Metagenomic sequencing has been done exclusively on the MiSeq Illumina platform. The application of these standardized methods for data acquisition and analysis allows the comparison of Ecuadorian bacterial microbiomes to others from around the world. For all samples, we also record physicochemical variables, which provide a basic ecological framework to understand the causes or effects of microbial diversity (e.g. electrical conductivity, content of organic matter, humidity, cationic exchange, dissolved oxygen, pH, temperature, and presence of P, N, Ca, Mg, SO 4 ). We encourage the standardization of the methods used in the EcuMP to coordinate complementary efforts by other research groups in the country and contribute to a catalog of Ecuadorian microbiomes.
The EcuMP has collected and processed bacterial microbiome samples from over 1,000 sites, including tropical rainforest affected by different intensities of human activity, cloud forest, dry forest, Andean-Choco forest, and extreme environments including hot springs, Andean volcanoes and glaciers, and deep-sea muds. Immediately after collection, the total DNA for each sample was isolated. Metagenomic sequences are available for 500 samples for a total of 24,585,878 giga base pairs, 81,680,662 sequences, and 2.72 Terabytes of data. For taxonomic assignment, biodiversity, and phylogenetic analysis we currently use bioinformatic tools, including dada2 [57], QIIME 1 [58], QIIME 2 [59], and Mothur [60]. Physicochemical data is used for numerical ecology to understand factors driving microbiome biodiversity patterns. Preliminary results are extensive, and a broad snapshot of Ecuador's biodiverse microbiome is being described and will be published during 2021.
Although the EcuMP remains far from what the BMP has accomplished in Brazil, we have gathered hundreds of samples and metagenomic data from a wide range of habitats, elevations, and regions, including tropical forests and extreme environments such as Andean volcanoes. The EcuMP will soon contribute to new insights into understanding the range of variability and structure of bacterial communities in Ecuador. Such publications will enable the EcuMP to establish links of collaboration with regional and international partners. Further stages of the EcuMP will consider the role of climate change and other major abiotic and biotic components associated with biological diversity. The evaluation of functional diversity and the exploration of the microbial transcriptome, proteome, and metabolome are considered long-term goals of the EcuMP and expected within a segment of five years of uninterrupted development of the EcuMP in Ecuador.
Our data will only have relevance if we share it with the global community to contribute to knowledge discovery and innovation. Data management in the EcuMP project pays special attention to the principles of findability, accessibility, interoperability, and reusability (FAIR) [61]. The application of the FAIR principles to the data gathered by the EcuMP will contribute to strengthen further efforts in understanding the microbiome and guarantee that our data becomes reusable research objects, and acquire as much social and scientific value as possible [62].

Potential sources of bias and future improvements
The use of keywords in the previously described searches provided an indirect measurement of how active countries are in the different categories that were considered. However, the results of these searches do not specify if core research and data production was made within the country or occurred elsewhere. In most cases, search results do not inform us if samples were obtained from the source country and taken abroad for analytical processes or if there was at least a minimum of collaboration within the country to produce publications and sequence data. To understand to what degree researchers, research centers, and other institutions within a country actively participate and contribute to microbiome and metagenomic research will require a finer assessment of the details in each recorded publication and registry in the assessed databases.
Many biodiverse countries are part of the international agreement known as the Nagoya Protocol, which proposes a general mechanism for fair and equitable sharing of benefits arising from research and technological innovation on genetic resources. Ecuador has developed a corpus of legislation in concordance to the Nagoya Protocol, a specific component of which is the enforced participation of local researchers in international research initiatives related to genetic resources [63]. It remains to be assessed the extent to which the analyzed publications have been the product of this enforced mechanism of collaboration, and if the expected transfer of knowledge and technology has been achieved.
Another potential bias in our sampling approach is multiple submissions within a project by a single research group, laboratory, or research institute. This will overestimate how much a country invests in metagenomic and microbiome research. An instance of such a case is Venezuela, where eight of the nine metagenome BioProjects have been submitted by a single project from one research institute. Nevertheless, our study provides a general overview of how countries in the South American region participate in the production of microbiome and metagenomic knowledge.

Challenges and prospects
Most developing countries have recognized the importance of microbial metagenomic research, with the establishment of well-funded national programs [64]. Setting aside the production from developed regions around the world, there is a substantial gap in the production of metagenomic and microbiome knowledge within South America. When Ecuador is compared to Brazil, Argentina, Chile, and Colombia, such a gap consists of at least an order of magnitude in the production of metagenomic information. Ecuador consistently lags behind the latter countries, and although it remains in the second or third quartiles, it leads the group of countries with the weakest production. Brazil is remarkable and stands out far beyond what any other country has produced in terms of publications and metagenomic data in South America.
The literature associated with Ecuador is mostly foreign production with only secondary participation by Ecuadorian authors. Independently of the origin of the data and samples, little to no processes, analyses, and technological activities occur within the Ecuadorian territory. The marginal role that Ecuadorian institutions and accompanying authors have in studies related to bacterial metagenomics or the microbiome in Ecuador is both a consequence and a cause of the technological gap that is observed within the South American region.
It may be argued that the differences in scientific productivity presented in this study are well-known facts, which cut across all disciplines and fields of knowledge. However, we argue, such facts are circumscribed to a small community of science specialists and the discussion does not reach into the broader decision-making sector. We believe that a critical assessment of the regional situation of Ecuador in relevant areas of scientific and technological importance is necessary for establishing sound national strategies, especially when this information becomes available to broader audiences.
It is a well-known fact that researchers in developing countries experience a range of social, cultural, and economic deterrents and pitfalls for the publication of their work in international journals [65][66][67]. Language is one of the main barriers involved in the divide between mainstream science and local publications or even the presence of forgotten studies that remain unpublished [68]; especially when we consider that English is the predominant lingua franca among scientific journals. The coverage of the literature quantified in the present study is restricted mostly to MEDLINE as the primary bibliographic database source for PubMed. The extent to which native South American research on the microbiome and metagenomics is not properly represented in the assessed databases remains unknown to us. Factors such as limitations to communicate in written English, access to competitive technology, political instability, and a set of cumbersome laws and regulations to access and work with genetic assets may all interact together for the observed lack of productivity in the region [69][70][71]. The application of the Nagoya Protocol, of which Ecuador is signatory through local legislation has been extensively criticized by the scientific communities of many countries, due mainly to its complexity and excessive requirements [63,[72][73][74]. In this sense, the contrasting differences between Brazil and the rest of South American countries, or the case of Chile for patents on bacteria, should be studied in additional detail to understand those social, cultural, and economic factors that are most favorable to the study of the microbiome and scientific research within the South American region.
Patents are the result of complex social and cultural interactions and are one "core artifact" in a broader system by which a bioeconomy may develop [75]. Thus, the term "bioeconomy", in its broadest sense, consists of the transformation of natural resources to achieve food security, sustainability, avoidance of fossil-fuel dependency, climate change mitigation, strengthening of the job market, and rural development. A range of strategies may be applied to reach many forms of bioeconomies, depending on the nature of a country's geography, culture, and state of development, from diversified high-tech to basicprimary economies [76]. The many ways and strategies a country may transform its production matrix into a new bioeconomy [77] must include not only the promotion of new forms of technology and knowledge (such as patents and scientific publications) but also new forms of political visions and social interactions [75,78,79].
Besides cumbersome laws and language barriers; the development of microbial ecology, as a discipline in Ecuador, shares with all fields of the natural sciences and biotechnology other critical obstacles. These include: 1) excessive costs and taxes for equipment and supplies, most of which have to be imported through intermediaries along extended periods, 2) long-term research usually lacks long-term funding to transpire into internationally relevant initiatives, 3) there is no clear separation between transient political moments and sustained national strategies in science and technology (sensu Bush [80]), and 4), universities in Ecuador have been traditionally disconnected from scientific inquiry and committed to replicating knowledge produced elsewhere [70].
A thorough assessment of obstacles for research in microbial biodiversity goes beyond the scope of this manuscript. We consider that the seminal proposal by Bush [80], for the successful science and technology system implemented in the US, could provide hints for a sound strategy in Ecuador. Particularly important among Bush's proposal is the principle of independence and stability that long-term research requires in both funding and political interests and decisions. Open markets with zero tax liability for scientific equipment and supplies, relaxed controls for public funding spending in science and technology, and a stable longterm national fund for developing extended science and technology programs are all part of what we believe is necessary to remove the discussed obstacles.
A synopsis of the 787 accredited researchers by the Ecuadorian national authority of higher education, science, technology, and innovation (SENESCYT), reveals that only 4 are related to research in microbial diversity, metagenomics, and the microbiome. The Constitution of Ecuador determines that 0.55% of the national GDP must be allocated to national scientific research activities. According to the World Development Indicators database by the World Bank and reported in Roser and Ortiz-Ospina [81], the most recent report for Ecuador is from 2014 and corresponds to 0.44% of the GDP. This is larger when compared to Colombia (0.30%) and Peru (0.11%), but considerably smaller than the investment in Brazil (1.17%). Reported expenditures to the World Bank do not necessarily reflect direct investments into research programs, with some arguing that the effective investment for scientific research and development is close to 1% of that required by the Constitution [82]. Given the small available workforce dedicated to studying microbial diversity in Ecuador, and that our national expenditure in science and technology is reduced, at least in terms of direct financing to research programs [82], a national strategy is required for the continued collaboration of Ecuadorian institutions with scientists from developed countries. Genomic research is relatively expensive and international collaboration is essential for strengthening national competitiveness.
To reduce the breach within the South American region in the production of metagenomic and microbiome knowledge, efforts such as the Ecuadorian Microbiome Project require steady support of government agencies related to scientific and technological development. Steadiness in budget provisions and independence from the transitory political periods have both been a foundational component of the United States strategy for science and technology, since the publication of its first national scientific program [80]. The simplification and rationalization of the current national regulations and processes for the procurement of imported technology and machines are also essential. It is particularly critical to the success of bioprospecting projects that the processes for obtaining permits and contracts to access biological samples and information get streamlined; or otherwise, these bureaucratic establishments will remain unsurpassable obstacles to development [83][84].
The national strategies to quantify and valorize biological diversity cannot any longer neglect the important realm of microorganisms, on which most of the current and future biotechnologies depend. We make a call for local teams in Ecuador and abroad, that are involved in research on microbial diversity, to join the Ecuadorian Microbiome Project and help strengthen this important area of scientific and technological development.
Technological progress towards an economy based on biocapitals is especially crucial for developing countries that depend on non-renewable commodities such as petroleum and raw agricultural materials with little added technological value. How nations assign value to their biological resources, such as those held in the microbiome, is particularly critical nowadays in the age of rapidly expanding technologies that can read, design, and translate the digital language of life. The economic welfare of nations may increasingly come to depend on how rapidly they adapt to new paradigms of technological biocapitals and bioliteracy. Modern biotechnology is at the heart of political, economic, and social debates, both internationally and nationally. The state of microbiome and metagenomic research in Ecuador and the South American region remains to be developed, such that the observed gap will no longer represent a risk to technological independence and economic productivity.