Relationship between economic diversification and CO2 emissions: ARDL-EC modeling in South Africa

ABSTRACT Economic diversification transforms an economy to utilize multiple sectors for growth. The problem is that developing countries have been slow in shifting away from mineral-energy complex-based development. However, sector-heterogeneity either reduces or increases exposure and/or adaptive capabilities to greenhouse gas emissions. The objective of the study was to ascertain the relationship between economic diversification and carbon dioxide (CO2) emissions in South Africa. The study utilized the autoregressive distributed lag-error correction model (ARDL-ECM) and included macroeconomic variables such as gross domestic product (GDP), population, foreign direct investment and trade balance. The Tress Index of 10 industries was utilized to measure economic diversification. Annual data from 1993 to 2020 were used in the study. The study found that in the short run, CO2 Granger caused economic diversity but in the long run, there was no association. Population also had a long- and short-run relationship with economic diversification, while CO2 emissions had a long- and short-run relationship with GDP. The study concludes that there is a unidirectional causality between economic diversification and CO2 emissions in the short run and no relationship in the long run. Economic diversification should be considered in national, economic and climate change policies of the country.


Introduction
Sub-Sahara African (SSA) economies are still not diversified, relying on the production and export of unprocessed primary products (Proctor 2014). Economic diversification is a strategy in the transformation of an economy to utilize multiple income sources spread over all sectors (UNFCCC 2016b(UNFCCC , 2016a. It aids in economic growth and reduces vulnerability to technological changes, market fluctuations, and economic crisis (Ferraz et al. 2021). The primary purpose of economic diversification is to improve economic tolerance, alleviating poverty, building resilience against fluctuations in economic activity, job creation, and reducing vulnerability to income loss. It is also a strategy that reduces or enhances exposure and/or adaptation to greenhouse gas (GHG) emissions, such as carbon dioxide (CO 2 ), which have been utilized as proxies for global warming and climate change (Shahzad et al. 2020;Laverde-Rojas, Guevara-Fletcher, and Camacho-Murillo 2021). This is due to the heterogeneity of GHG emissions based on sectors of GDP structures. The level of heterogeneity is thus propelled by the level of diversification within an economy. Various factors drive economic diversification, and they need to be understood holistically. These include firm productivity, GDP per capita, exchange rate, inflation, FDI, terms of trade, population, human capital, quality of institutions and climate change policies (UNFCCC 2016b(UNFCCC , 2016a. Despite the apparent significance of economic diversification on GHS emissions, there has been a limitation in the literature that highlights the linkages between industrial policy, economic and environmental sustainability (Ferraz et al. 2021).
South Africa is the most diversified economy in SSA, which is industrialized and technologically advanced (DoTIC 2021). There has been growth in South Africa's economy, mainly resulting from advances in the manufacturing and services sector. The significant investments in energy-intensive industries have, however, limited diversification (Winkler and Marquand 2009). The 2007 National Industrial Policy Framework (NIPF) in South Africa endeavored to facilitate diversification and promote industrialization which was broad-based and labor absorbing: intensification of the country's industrialization and movement toward a knowledge economy (TIPS 2016). The NIPF was envisaged to shift away from the Mineral-Energy Complex (MEC) which was capital intensive, de-centralization of industrial activity, and support for Small to Medium Scale Enterprises (SMEs). This resulted in the country's manufacturing economy becoming more diverse. Export-led growth philosophy was adopted as a strategy to accelerate economic growth. Thus, the industrial policy was more export oriented with intensions of job creation (TIPS 2016).
However, there has been a trade-off envisaged from South Africa's growth. The country became the 13th largest CO 2 emitter in the world, and the largest in Africa (du Plooy and Jooste 2011). This is due to its reliance on coal, which accounts for 72% of the country's primary energy consumption. Since 1990, CO 2 emissions in South Africa have increased by 64%, as electricity demand has increased and coal has remained the dominant source of generation. Export of carbon-intensive goods account for 40% of South Africa's emissions (du Plooy and Jooste 2011). This is due to the fact that the country has a large mineral wealth, a strategy of energy-intensive industry investment and a 90%-plus carbon-intense coal-based electricity generation base. This is despite the country pledging at the 2009 Conference of Parties of the UNFCCC to limit GHG emission to a Peak Plateau Decline (PPD) of 34% and 42% reduction by 2020 and 2025, respectively. This PPD was refined as presented in the country's National Climate Change Response Paper (NCCRWP) and National Determined Contribution (NDC) to a level of between 398 -614 Mt CO 2 -equivalent for the years 2020-2025 and 212-428 Mt CO 2 -equivalent for the years 2036-2050 (Trollip and Boulle 2017). Fifty-six percent of CO 2 emission in South Africa is in electricity and heat production, followed by construction and manufacturing (23%), transport (13%), residential buildings, commercial and public services (4%), and other sectors (4%) (Beidari, Lin, and Lewis 2017;Adebayo and Odugbesan 2021;Shikwambana, Mhangara, and Kganyago 2021).
Even though economic growth and diversification remain key developmental goals in South Africa, it has been difficult to diversify away from a mineral-energy complex-based development path inherited from the apartheid era, which is still attracting significant local and foreign direct investment (Winkler and Marquand 2009). This energy system in South Africa is a point of contention between the country's economic development and SDGs commitment to reducing GHGs. Jury (2002) went on to further identify that economic diversification in South Africa did not reduce vulnerability to emission-induced climate change. According to Strambo and Atteridge (2021) as well as Strambo, Burton, and Atteridge (2019), economic diversification plays a role in offsetting the declining coal sector in South Africa. This is despite Meyer (2020) indicating that economic diversification had a significant impact on economic growth and development, which is coalbased, in South Africa. The objective of the study was to ascertain the relationship between economic diversification and CO 2 emission in South Africa. To the best of author's knowledge, little to no studies have been carried out in South Africa that focus on the relationship between economic diversification and CO 2 emissions, even though there are global studies (Ferraz et al. 2021). There is not any available study that considers South Africa in the context of economic diversification and CO 2 emissions. Undertaking studies that evaluate this relationship add value to planning policies aimed at increasing economic diversification as adaptation strategies, a response measure, or means to reduce adverse impacts of response measures to CO 2 emissions, especially at a national level allowing for country-specific policy formulation. There is a need for research on the association between economic diversification and CO 2 emissions in South Africa. This is in recognition of the need to move beyond aggregate growth to include the quality of economic development. Furthermore, it tends to highlight vulnerabilities of economic systems both as perpetrators and/or victims of CO 2 emissions, contributing to sustainability. Understanding the intricacies between economic diversification and CO 2 emissions will allow scrutiny into more inclusive and sustainable production systems.

Literature review
Economic diversification literature have focused on economic development and have just recently explored environmental sustainability (Ferraz et al. 2021). In the literature, there has been interchangeability in assessing economic diversification and economic complexity. According to Hausmann et al. (2013) economic complexity measures the intricate network of interactions and is expressed as the country's productive output. Yet, Inoua (2021) acknowledges that diversity in products reflects diversity in knowhow and productive capacity. However, Ferraz et al. (2021) aptly differentiates economic complexity, which goes beyond economic diversification. Economic complexity relates to not only the number of economic activities, but also the quality and network positions (Ferraz et al. 2021).
According to UNFCCC (2009) emissions, which affect climate change, have an effect on livelihoods and economic activities. These effects have spatial and temporal variations. This is also true for the economies. There is a relationship between economic activities and CO 2 emissions (Laverde-Rojas, Guevara-Fletcher, and Camacho-Murillo 2021). This is due to the fact that industrial production is accompanied by intensive consumption in energy and gas emissions. Environmental degradation can, however, transcend production volumes, as alluded to by Bekhet, Matar, and Yasmin (2017), Bélaïd and Youssef (2017), Cerdeira Bento, Paulo, and Moutinho (2016) as well as Mirza and Kanwal (2017). Other authors have explored product diversification and its relationship with CO 2 emissions (Shahzad et al. 2020;Can, Dogan, and Saboori 2020). These studies found that product diversification had a positive and significant impact on CO 2 emissions. Several studies have been carried out focusing on the role of economic diversification and complexity in reducing CO 2 emissions (Ferraz et al. 2021;Can and Gozgor 2017;Moutinho, Madaleno, and Elheddad 2020;Laverde-Rojas, Guevara-Fletcher, and Camacho-Murillo 2021). Can, Ahmad, and Khan (2021) found that economic complexity promotes CO 2 emissions in a study of 10 newly industrialized countries (NICs) for the period of 1970-2014. In France, a study was conducted by Can and Gozgor (2017) for the period of 1964-2014, and highlighted the long-run negative relationship between economic complexity and CO 2 emissions. In a study covering a period of 1971-2014 in Colombia, Laverde-Rojas, Guevara-Fletcher, and Camacho-Murillo (2021) found that the country does not yet benefit from economic complexity in reducing environmental degradation. Ikram et al. (2021) found that in Japan, there was a bi-directional relationship between economic complexity and ecological footprint. In China, Sarwar et al. (2019) found that for the period of 2004-2015, industrialization had a positive effect on CO 2 emissions. The identified literature highlighted that technological and innovative products reduce emissions through reducing energy consumption. Wang et al. (2021) support this with the findings of long-run cointegration between green growth and technological innovation in China. This has been through the promotion of the eco-friendly technology. Thus, there are several benefits which include economic growth while reducing environmental impact, providing an adequate environment for energy transition and managing pollution control. The literature also identified that there is a prioritization of products that reduce environmental impact. There is also a concentration of investment in sectors that reduce environmental impact (Ferraz et al. 2021;Guo et al. 2021). However, most of the literature concentrates on economic complexity relative to economic diversification. Economic complexity studies emphasize on knowledge-based activities such as intensive technological products or industrial sectors, neglecting other activities such as agriculture (Ferraz et al. 2021). Okamoto (2013) circumvented this shortfall in a study evaluating the impact of structural change from manufacturing to a service economy on CO 2 emissions in Japan. Furthermore, such studies have been scanty in a developing country such as South Africa. In South Africa, Meyer (2020) assessed the relationship between economic diversification and growth, while Wewege (2013) identified growth paths accounting for climate change. Kwakwa and Adusah-Poku (2020) concentrated on the manufacturing sector contribution to CO 2 emissions, whereas Ngarava et al. (2019) assessed the possibilities of agricultural sufficiency given CO 2 emissions. In as much as, there are differing focuses of the studies, there were also various methodologies that were used, with Meyer (2020) utilizing a Fisher-Johansen framework, Wewege (2013) utilizing the Economic Complexity methodology, and Ngarava et al. (2019) employing the Environmental Kuznets Curve (EKC). Ferraz et al. (2021) advocates methodological integration in economic diversity and environmental sustainability literature to comprehend 'development traps, constraints and opportunities for economic catch-up and leap-frogging ahead'.

Theoretical framework
The Environmental Kuznets Curve (EKC) theory stipulates that environmental degradation increases as income increases until saturation, whereafter, it starts to decrease with an increase in income (Moutinho, Madaleno, and Elheddad 2020). This is the so-called inverted U-shape ( Figure 1) (Can and Gozgor 2017). From the EKC, initial economic development is associated with scale effects especially in primary industries such as agriculture (Ngarava et al. 2019). The next stage in the development process involves industrialization and extensive structural effects which emit intensive pollution. The last stage involves reduction in emissions based on the improved technology. More economic transformation is exhibited in this stage with its associated diversification. This is due to the fact that technological improvements are associated with new products and systems (Can and Gozgor 2017).
The EKC theory draws from two narratives: (i) interaction between scale, structure, and technology, and (ii) quality of life increases based on income increase (Ngarava et al. 2019). Increase in scale is associated with the increase in CO 2 emission. Structural changes also increase emissions, but to a lower extent and eventually lower it. Technology and R&D improves the efficiency and quality of environment. Other authors, such as Baek (2015), argue that instead of an inverted U-shape, the EKC is an inverted N-shape, with a last stage of increase in environmental degradation in a developed economy ( Figure 1). In the quality-of-life proposition, more attention is given to better healthy consumption, environmental protection, and government intervention in environmental protection. The current study hypothesizes that economic diversification can lead to high CO 2 emissions, which is highly undesirable. However, high CO 2 emissions can also lead to economic diversification policies as mitigation strategy to curb unwarranted emissions. Thus, there is bi-directional dependence on each other. Various measures have been used in measuring diversification within an economy (Laverde-Rojas, Guevara-Fletcher, and Camacho-Murillo 2021). Product diversification was utilized by Mania (2020). Economic Complexity Index (ECI) was developed by Hidalgo and Hausmann (2009), enabling quantification of a country's production through trade data interpretation. In this instance, a country's income levels correlate with economic complexity. Authors, such as Neagu and Teodoru (2019), Doğan, Saboori, and Can (2019) as well as Can and Gozgor (2017), have utilized the ECI to ascertain whether product diversity has an impact on environmental degradation. There has been a divergence in the results obtained, offering an opportunity to ascertain how the interaction between environmental degradation and economic diversification plays out in a developing country such as South Africa.

Methodology
Economic diversification can be measured as the share of sectors in GDP, share of sectors in exports, dependence of a country on the export of a good, and the employment share of sectors. Measurement can be either from a country's absolute specialization (e.g. diversification index, Gini index, Herfindahl-Hirshman index, entropy index, and ogive index) or from a country's economic structure from a reference group of industries (e.g. inequality in productive sectors, relative Gini coefficient, and Theil index) (UNFCCC 2016a, 2016b). Tress Index (TI), as utilized by Meyer (2020), was used in the study. The current study is the first to utilize the TI in its association with CO 2 emissions. The TI indicates the level of concentration or diversification in a region's economy (Quantec easydata 2021). It has a range of 0-100, where 0 represents a totally diversified economy, while a 100 represents a concentrated economy which is vulnerable to exogenous variables such as adverse climate change (Spocter 2012). A time series of the TI, which is increasing, reflects an expanding dependence of the local economy on a few or single economic activities, and increases vulnerability to climate change. The TI was ideal because it did not rely on exports, unlike the export diversification index. Furthermore, it considered the complexities within the economy relative to the Herfindahl-Hirshman index and concentrated on the contribution of each sector based on weighted averages relative to the entropy index which assumes an equi-proportional distribution (Bebczuk and Daniel Berrettoni 2006;Espoir 2020;Takada et al. 2020). There are 5 steps involved in calculating the TI (Quantec easydata 2021): (1) Calculating the contribution of each sector to the GDP; (2) Sector ranking based on contribution; (3) Multiplying each sector by its appropriate weighting; (4) Summing up the total weighted values of the sectors; and (5) Subtracting from the total and then dividing by the difference between the highest and lowest potential total weighted values.
The study utilized the TI of 10 industries, as prescribed by Quantec easydata (2021), which included (i) finance, insurance, real estate, and business services; (ii) manufacturing; (iii) general government; (iv) wholesale and retail trade, catering, and accommodation; (v) transport, storage, and communication; (vi) community, social, and personal services; (vii) agriculture, forestry, and fishing; (viii) construction; (ix) electricity, gas, and water; and (x) mining and quarrying. The TI index was calculated as follows: where P i is the percentage contribution of sector i to the overall GDP, W i is the weight of sector i based on ranked contribution to overall GDP (the higher the contribution, the higher the weight), W l is the lowest potential weight (which is 550 for 10 industries) and W h is the highest potential weight. The difference between the highest and lowest potential weights is 4.5 for 10 sectors (Quantec easydata 2021).
The study utilized the autoregressive distributed lagerror correction model (ARDL-ECM) to estimate the relationship between economic diversification and CO 2 emission, as well as other variables such as gross domestic product (GDP), population, foreign direct investment (FDI), and trade balance. This was modeled on the function shown in Equation (2).
The ARDL-ECM was utilized by authors such as Can, Dogan, and Saboori (2020), Coulibaly and Akia (2017) as well as Laverde-Rojas, Guevara-Fletcher, and Camacho-Murillo (2021). According to Coulibaly and Akia (2017), ARDL distinguishes between endogenous and explanatory variables. Its model's long-term estimates are super-coherent in small samples, providing unbiased coefficients and valid results even when explanatory variables are endogenous. It also has an advantage of application regardless of the order of the variables, either I(0) or I(1), and simultaneously estimates both long-and short-run parameters (Kohler 2013). The ARDL model was specified as follows (Adeleye 2018): where Y ′ i was a vector and the variables in (X ′ t ) ′ were allowed to be purely I(0) or I(1) or co-integrated; b and w were coefficients; a was the constant; i = 1, . . . , k; p, q were optimal lag orders; m it was a vector of the error termsunobservable zero mean white noise vector process. This was reduced to the following form: where TI, GDP, CO 2 , Pop, FDI, TRD , and 1 are Tress Index, gross domestic product per capita, carbon dioxide emission (kilo-tons), population, foreign direct invest, trade balance, and error terms, respectively. All variables except the TI were at constant 2000 level. All variables were taken into logarithmic form before estimating the models. All the data were annual from 1993 to 2020, due to the fact that the TI recording in South Africa was started in 1993. The data were obtained from Quantec easydata (2021). These variables were included in the model due to the fact that they affect CO 2 emissions and economic diversification (Can, Dogan, and Saboori 2020). Tress Index (TI), FDI, and trade balance were expected to have a negative association with CO 2 emissions, while GDP and population were expected to have a positive association. Increase in TI indicates a concentrated economy which is less advanced and technologically friendly, therefore as diversification increases, CO 2 emission will decrease. Increase in FDI and trade balance will also improve the mix of activities in the economy and product diversity, respectively, thereby improving diversification and reducing CO 2 emission. Carbon dioxide (CO 2 ), GDP, FDI, population, and trade balance were expected to have a positive association with TI. Increase in TI indicates a more concentrated economy which is energy intensive. This is also the case of increase in GDP, while population increase induces pressure and demand on natural resources. A positive trade balance allows for specialization, reducing economic diversification. The ARDL bounds model, which was used in the study, was based on the Wald statistic (F-statistic) for cointegration analysis (Odhiambo 2012). The null hypothesis of no cointegration was tested against the alternative of the existence of cointegration. If the F-statistic exceeded the critical bounds value, the H 0 was rejected, while it was not rejected when it is lower. If the statistic fell between the upper and lower bounds, then it was inconclusive. If no cointegration was detected, the following ARDL p, q 1 , q 2 , q 3 , q 4 , q 5 model was specified as follows: If cointegration was detected, the error correction model (ECM) was specified as follows: where ECT t−1 was the error correction term which should be negative and statistically significant. Post estimation techniques were employed after analyzing the short-run and long-run relationships. Diagnostic tests of normality, collinearity, heteroscedasticity, linearity, and stability were performed. The Jarque-Bera test was used to determine the distribution of the model. The Breush-Godfrey Serial Correlation test was used to examine the residuals for serial correlation and determine whether the residuals were independent. Heteroscedasticity was assessed for equality of variance spread through the Breusch-Pagan Godfrey test, while the Ramsey RESET Linearity test was used to ascertain linear relationships. The structural stability of the models was tested using the CUSUM of squares test.

Results
Descriptive statistics Table 1 shows that for the period 1993-2020, there was a relatively more economic diversification in a Tress index of 10 industries relative to 50 industries. There was an average of 421 475.5 kilotons of CO 2 emissions, with US$3.45 billion FDI inflows. The country's GDP alternated between US$115 and US$416 billion, while it was a net importer of US$10 thousand worth of goods and services.
In the country, there was a gradual increase in the Tress Index and CO 2 emissions from 1993 to 2020 (Figure 2). The Tress Index increased by an average of 0.13 units annually, while the CO 2 emission increased by 7081 kilo-tons annually.
In South Africa, there has been a gradual economic concentration in Western Cape and Gauteng Provinces, as exhibited by the increase in the Tress indices ( Figure  3). Provinces, such as North West, Mpumalanga and   Limpopo, have been diversifying their economies since 2000.

Empirical results
A unit root test was initially performed to test for stationarity and determine the order of integration. The unit root test in Table 2 shows that the variables were integrated at different orders, making the ARDL model ideal. Based on the Phillips-Perron (PP) Test, FDI, population and trade balance were stationery at levels, while economic diversity, CO 2 emissions, and GDP were stationery at first difference. Table 3 illustrates the lag selection criteria. This was performed to avoid bias and inefficiency when too few or too much lag lengths are selected. There were mixed lags from the different models based on the Akaike Information Criterion (AIC) and Schwarz Criterion (SC). A lag of 0 was obtained for CO 2 emissions and FDI models, while 1 was for GDP model and 4 for economic diversity, population, and trade balance models, respectively. The ARDL short-run and long-run were carried out using the AIC.
The ARDL Bounds test, indicated in Table 4, shows that there was a long-run equilibrium cointegration between the variables in all models, except the GDP model, which is inconclusive due to the fact that the Fstatistic lies between the lower and upper bound at the 10% level. Thus, the ECM was necessary to specify the long-run models, except for the GDP model. Table  4 shows that in the short run, GDP had a positive relationship with TI, while GDP t−1 , population, and TRD t−1 had a negative relationship with TI. In the short run, CO 2 emission had a positive relationship with GDP and Pop t−1 , while it had a negative relationship with Pop. In the short run, TI t−1 had a negative relationship with Pop and a positive relationship with TRD.
In the long run, Pop had a negative causal relationship with the ED, while GDP had a positive causal association with CO 2 emission (Table 5). Pop and GDP also had a positive association in the long run. A 1% increase in population increased economic diversity by 0.20%, whereas a 1% increase in economic concentration decreases the population by 4.74%. An increase of 1% in the GDP increased the CO 2 emissions by 0.33%. However, a 1% increase in CO 2 emissions increases GDP by 2.01%. Finally, a 1% increase in GDP increases population by 0.07%.
The error correction terms indicate that there were long-run causal relationships in the models. In the economic diversity model, the reversion to long-run equilibrium was at an adjustment speed of 68%, and it will take 1.47 years ( 1 68% ) to achieve this equilibrium. The speed of adjustment was at 84%, 98%, 36%, 5%, and 117% in the CO 2 emission, FDI, GDP, population, and trade balance indices models, respectively (Table 6). Time to restore equilibrium in the CO 2 emission model was 1.19 years, while it will take 1.02, 3.78, 20, and 0.85 years in the FDI, GDP, population, and trade balance models, respectively. Table 7 shows that CO 2 emission Granger caused economic diversification and FDI, while economic diversification caused FDI and population. Population Granger caused economic diversification, while GDP also Granger caused economic diversification.
From the diagnostics tests in Table 8, the Jarque-Bera normality test showed that the error terms were normally distributed in all models except the trade balance model. Thus, the estimates had minimum variance property. The Breusch-Godfrey Serial Correlation LM Test showed that the error terms were independent in all models, except for the FDI model, and thus do not rely on the previous periods' value. The Breusch-Pagan   Godfrey test indicated that there was no heteroscedasticity, except for the FDI model, and thus there was no constant variance and they were homoscedastic. The models were well specified, as indicated by the Ramsey RESET Linearity Test, except for the GDP and trade balance models, indicating linear relationship. The CUSUM of squares test showed that all models were structurally stable, except for the trade balance model, and thus suitable for long-run decisions. Thus, the Tress, CO 2 , and population indices models were normally distributed, homoscedastic, had no collinearity, well-specified, and stable. However, there was collinearity and heteroscedasticity in the FDI index model as well as no normality and instability in the trade balance index model. The GDP model was not well specified. The diagnostic tests verify that the models for Tress index and CO 2 were valid.

Discussion
In both the short and long run, Tress index had negative causality with population index. This indicates that the economy becomes more diverse as the population increases in both the short and long run. Yakushnina (2019) explains this relationship through the need of social infrastructure as the population grows. The social infrastructure is mainly through social security, health care, education, housing, and communal services, among others. The productive infrastructure will also require to be expanded through the development of the economic basis and improvement of the standard of living. This tends to lead to a more diversified economy. The quality-of-life needs of a growing portion of the population, as well as the need to improve job opportunities, reinforce the need for a diversified economy (Yakushnina 2019). However, Hidalgo (2009) postulates that economic complexity is a result of the complexity of an economic structure of only a proportion of a population. Rightly so, a class/ social structure within a population also tends to drive diversity in needs, which mirrors complexity within an economy. The social structure within South Africa changed significantly since the end of apartheid in 1994. There was an increase in the black middle class. According to Yameogo et al. (2014) improving conditions of the middle class of emerging economies was  necessitated by the expansion of manufacturing sectors in countries that had relied on the extractive and agricultural (primary) industries. However, this expansion has increased structural transformation and economic diversity within such countries. The study, however, found that economic diversity had no association with CO 2 emissions in the long run. This was in line with Laverde-Rojas, Guevara-Fletcher, and Camacho-Murillo (2021) findings in Colombia of the lack of benefit in increased economic complexity. This was contrary to Neagu and Teodoru (2019) who found economic complexity having an impact on greenhouse gas emissions, especially in countries with the lower level of economic complexity in the EU. This was observed in the short-run Granger causality test in the current study. The findings were also contrary to Dogan et al. (2021) as well as Doğan, Saboori, and Can (2019) who found economic complexity increasing environmental degradation in the lower and higher middle-income countries, while controlling CO 2 emissions in high-income countries. In France, Can and Gozgor (2017) identified that economic complexity suppresses CO 2 emissions in the long run. The major explanation can be in the differing measures of economic complexity and economic diversification. Economic complexity goes beyond economic diversification to include the quality and network positions in a knowledge economy (Ferraz et al. 2021). The result can also be due to the fact that South Africa has not reached the latter stages of development where technological advances and their associated diversification outweigh scale effects in growth. Thus, in the short run, CO 2 Granger caused economic diversity but in the long run, there was no association. In a study in Japan, Okamoto (2013) highlighted that structural change in an economy improves the environment. Gross Domestic Product (GDP) also caused economic diversity, while population had bi-directional causality with economic diversity, in the short run.
Carbon dioxide (CO 2 ) emissions index had a positive association with GDP index in both the short and long run. An increase in GDP increases CO 2 emission in both the short and long run. The findings were similar to Dogan et al. (2021), Simbi et al. (2021) as well as Menyah and Wolde-Rufael (2010) who highlighted that GDP was positively associated with CO 2 emissions. This is due to the fact that economic progress and development incite CO 2 emission increase through the scale effect of growth trajectory (Joshua, Bekun, and Sarkodie 2020). This is due to the dependence on highly emitting energy sources (Dogan et al. 2021). In South Africa, it is difficult to reduce CO 2 emissions without compromising economic growth (Menyah and Wolde-Rufael 2010). Waheed, Sarwar, and Wei (2019) highlight that this was mainly evident in developing countries and not evident in developed countries. However, Bekun, Emir, and Sarkodie (2019) found that in South Africa, an increase in economic activities decreased CO 2 emissions. Acheampong (2018) actually found out that it is CO 2 emissions   Population index had positive causality with GDP in both the short and long run. In the short and long run, a rise in population increased GDP. Dogan et al. (2021) attributes this to an increase in several human activities within an economy as the population increases. However, Peterson (2017) attests that economic growth in high-income countries will be stifled by low population growth, while high population growth in low-income countries will slow down development. Joshua, Bekun, and Sarkodie (2020) denotes that population increase is associated with population explosion with urbanization increasing commercial activities. However, Omri, Nguyen, and Rault (2014) found to the contrary, indicating that population had a negative effect on CO 2 emissions.

Conclusion
The objective of the study was to ascertain the relationship between economic diversification and CO 2 emission in South Africa. The study found that in the short run, CO 2 Granger caused economic diversity but in the long run, there was no association. The study fails to reject the hypothesis of no bi-directional relationship between economic diversification and CO 2 emission, and concludes of a unidirectional causality in the short run and no relationship in the long run. The study suggests that South Africa is still in the latter stage of the scale phase of the EKC model, due to economic diversification (with its associated growth) having indifference relationship with CO 2 emissions. On one hand, the long-and short-run relationship between economic diversification and population augment this conclusion based on the social and productive infrastructural transformation pushing this relationship. Furthermore, the short-run relationship suggests that the country is still in the scale phase where expansion and growth (with its associated diversification) drive CO 2 emissions. Policy implications include the consideration of economic diversity in national, economic, and climate change policies such as the National Development Plan (NDP), National Industrial Policy Framework (NIPF), and National Determined Contributions (NDC). A balance needs to be struck between the scale effects of economic growth and structural, as well as technological, effects incorporating social and productive transformation and the associated economic diversification. This will overall have a reductive effect on CO 2 emissions. Economic diversification should be considered as an environmental pollution policy tool in the short run. This can be through active promotion of environment-friendly economic transformation, for example, utilization of renewable products and services, which will meet the country's NDP, NIPF, and NDCs. The conclusion that the country is in the latter phase of the scale phase in the EKC theory indicates that the country will still be energy reliant in the short term. Policy should be focused on transitioning to renewable energy to improve growth while improving environmental sustainability. Furthermore, the EKC theory should be further expanded to include the effects of economic diversification and complexity to improve its utilization in policy.

Disclosure statement
No potential conflict of interest was reported by the author(s).