The Basilicata region (Southern Italy): a natural and ‘human-built’ open-air laboratory for manifold studies. Research trends over the last 24 years (1994–2017)

Abstract The Basilicata region (Southern Italy) is characterized by a peculiar environmental as well as biological, geological, and cultural heritage features that are evident in national and regional parks as well as places of monumental and archaeological relevance, including the Matera UNESCO Site. Basilicata hosts the highly studied Vulture volcanic complex and the largest hydrocarbon reservoir in continental Europe. Furthermore, the region falls among the Italian regions most prone to landslides and floods and it is located in a seismotectonic background responsible of strong earthquakes. Therefore, the territory of Basilicata can be considered an open-air laboratory, both natural and “human-built”. The laboratory, with its features, has attracted many scientists worldwide and over time. The researchers have focused their attention on manifold studies. That being stated, the article aims to analyse the outputs of the scientific investigations targeting the territory of Basilicata within the last 24 years (1994–2017) with a bibliometric approach. The Thomson Reuters’ Science Citation Index Expanded and the Social Sciences Citation Index were the two bibliographic databases considered. Once the pertinent articles were extracted from the two citation indexes, the authors analysed the publication trends, Web of Science categories, countries, and hot topics.


Introduction
Basilicata is a small region of Southern Italy with distinctive environmental, biological, geological, and cultural heritage features that mark out the territory, the regional and national parks, and the archaeological and monumental places, including the Matera World Heritage Site. As a result, the territory of Basilicata can be considered an open-air laboratory, both natural and "human-built". Thus, scientists have performed manifold studies considering a wide range of disciplines and adopting integrated research strategies (Multi-Inter-Trans-Disciplinary) (sensu Stock and Burton 2011). Just to the purpose to shed light on these research activities, the work considers the bibliometric study of articles having the Basilicata region (or its portions) as a target. The Thomson Reuters' Science Citation Index Expanded (SCI-EXPANDED) and the Social Sciences Citation Index (SSCI) were the two bibliographic databases considered. In detail, we analysed the articles published during the time-span of 24 years, from 1994 to 2017.
The general aim of bibliometric studies is to investigate research areas and infer how they have changed over time so as to get some clues about future research tendencies (e.g. Marx and Bornmann 2013;Niu et al. 2014;Gizzi et al. 2015;Knapczyk et al. 2018). Besides this, the results of this study can also stimulate the scientific community to build interdisciplinary and transdisciplinary research networks, according to what is recommended by international organizations (e.g. Gleed and Marchant 2016), so as to further sustain the competitiveness of research projects within international scenarios.
The article is made up of four main parts: the first part deals with the main features of the Basilicata region; the second introduces the methodological approach followed and the interpretation criteria of the data; the third discusses the results of the bibliometric analysis, with a brief review of the main research activities; and the fourth draws the conclusions and lists the perspectives of the study.

Features of the Basilicata region: a bird's eye view
Basilicata is a region of Southern Italy with an area of about 10,000 km 2 and 570,000 inhabitants (http://dati.istat.it/Index.aspx?QueryId=18564, last access on 27 June 2018). It is one of the smaller regions in Italy, embracing 131 communes in the two provinces of Potenza and Matera. The chief-town of the region is Potenza (Figure 1).
The territory is characterized mostly by mountains and hills, and by plains in a very little portion. In the southern part of the region, crossing the border with the Calabria region, the largest national park in Italy, the Pollino National Park, is present. Among the regional parks the 'Murgia Materana', that includes ancient evidences of the human prehistoric presence as well as the Rupestrian Churches, is worthy to be mentioned as it was inscribed in the UNESCO World Heritage Site together with the 'Sassi of Matera' (The Sassi and the Park of the Rupestrian Churches of Matera, https://whc.unesco.org/en/list/670, last access 27 June 2018). The Sassi represent the most ancient part of Matera town. They are surrounded by the Gravina stream flowing along the deep natural gorge that strongly marks out the local landscape. The Sassi symbolizes an extraordinary example of a rich historical stratigraphy. In point of fact, stable settlements have been documented in an important way and uninterruptedly on the top of Civita (the first inhabited nucleus) since the II millennium BC and they have grown along the right side of the Gravina stream whereon are still lying the two Sassi: Caveoso and Barisano (Gizzi et al. 2016).
Looking at the northern part of the region, the Pleistocene Vulture volcanic complex is present and, even it is not active, it constitutes a lively field for studies and an attractive touristic site for the presence of two volcanic lakes (Lago Grande and Lago Piccolo).
The plain portion of the region extends towards the south-east areas and the Ionian coast where, due also to the fertility of the soils crossed by the main rivers (Sinni, Agri, Basento, and Bradano rivers flowing into the Ionian sea), most of the agricultural activities are developed.
The orographic setup has influenced the natural vegetation cover accordingly, by crossing from forests, grasses, and pastures mostly located in mountains to less vegetated and dry areas in the central-eastern part of the region up to the Mediterranean scrub, in the coastal zones. In particular, the Pollino massif hosts a wide variety of natural vegetation which testifies the different climatic conditions affecting the territory. In this area, included in the homonymous National Park, the main feature of the natural vegetation is represented by the presence of Pinus leucodermis (from 800 up to 2,000 m above the sea level) which is a testimonial of a biological species which lives only in this portion of the territory.
As regards the human activities, the agriculture is developed mostly along the coastal zones, cultivated by orchards and cereals, and in the northern part where vineyards and olive groves are the dominant cultures in the volcanogenic Vulture basin. The vineyards in Basilicata cover about 4,021 ha, out of which 2,879 in Potenza province and 1,142 in Matera province (Schedario Viticolodati 2010), and have been recognized by important European and national acknowledgements due to their quality and production area.
By a geological point of view, Basilicata is part of the Southern Apennines which represent a NW-SE oriented segment of an arc-shaped thrust-and-fold belt (Gueguen et al. 1998). The geological history of the Southern Apennines is connected with the complex geodynamic evolution of the Mediterranean Sea, reflecting the interaction between the African and the European plates.
The main geologic features that can be identified in Basilicata are: (1) the Apennines stricto sensu, with silico and carbonatic deposits, evaporites, and ophiolitic deposits, Upper Jurassic -Cretaceous to Middle Pliocene; (2) the Bradanic Trough, to the east of the Apennine chain, with Lower Pliocene and Quaternary deposits (gravel, clay and sand); (3) the Apulia foreland with the Murgia Plateau whose western limits outcrop in Basilicata, mostly composed by carbonatic deposits, Lower Cretaceous to Upper Miocene.
Due to its geological features, Basilicata hosts the largest hydrocarbon reservoir in continental Europe, which has been studied since the beginning of the 20th century by many geologists and exploited by oil companies (Lazzari 2013).
Basilicata is one of the Italian regions most prone to hydrogeological instability, in which nearly 50% of the towns are classified as exposed at high risk of landslides or floods (e.g. Clarke and Rendell 2006;Di Maio et al. 2010;Lazzari 2011). The area affected by flood risk is 260 km 2 wide (2.6% of the whole territory) and mainly includes the hilly areas and the coastal belt, which are exploited in agriculture, the main regional economic source (e.g. Piccarreta et al. 2013). Moreover, the nature of cropping terrains coupled with the climate dynamics have brought several inner areas to desertification, due to the widespread forms of land degradation (badlands) including rills, gullies, debris flows and loss of soil fertility (e.g. . This is probably due to the type of soil and to the deforestation, necessary to make room for pastures and the growing of cereals. Besides this, the Basilicata region is considered to be a zone of moderate-high seismic activity. Presently, most of communes fall in medium and high seismic zones (http://zonesismiche.mi.ingv.it/class2004.html, last access, 21 May 2018). The seismic hazard of the region is well testified by numerous strong historical earthquakes (Rovida et al. 2016), the last of which occurred on 23 November 1980 (Mw 6.8), causing about 3000 deaths with very heavy damage to residential buildings, cultural heritage, productive, and industrial tissue. Furthermore, the seismic event also caused significant primary and secondary effects on the territory and environment (e.g. Esposito et al. 1998;Blumetti et al. 2002;Gizzi et al. 2012;Gizzi et al. 2014). Due to the hazard, studies have also been addressed to the management of the risk, so as to reduce the seismic vulnerability of buildings (e.g. Tilio et al. 2012).
From the demographic and socio-economic point of view, the Basilicata region is characterized by a very low population density-less than 55 inhabitants per km 2 (http://noi-italia.istat.it/, last access on 27 June 2018)-, showing a sharp decrease in number of inhabitants in the last two decades. As for its industrial setting, Basilicata has a high percentage of micro-enterprises, higher than the national and southern region average, while the percentage of larger firms (between 10 and 19 employees) is slightly below the national average. Most of the micro-enterprises are individual firms whose business is linked to commerce and services. There are a few large international companies in the fields of oil and gas extraction activity (ENI, Total, and Shell), the automotive industry (FIAT) and agri-food (Ferrero and Barilla) (Regione Basilicata 2009).
Due to the human activities, the attention towards the environmental quality has been increased during the last decade with a particular focus on soil and air quality of the industrial areas located in Agri and Basento valleys, and Tito Scalo (Caggiano et al. 2005;Chianese et al. 2006).
With regard to the research and innovation system, Basilicata is characterized by numerous top-tier research centres, such as the University of Basilicata, some institutes of the National Research Council (CNR), the National Agency for New Technologies, Energy and Sustainable Economic Development (ENEA), the Centre for Space Geodesy (CGS) of the Italian Space Agency (ASI), and private consortia aimed at promoting R&D to SMEs.

Material and methods
In order to perform the bibliometric investigation, we taken the SCI-EXPANDED and SSCI citation indexes as a reference to extract the suitable articles (including items published as proceeding papers).
With the intention of pulling out the appropriate records from the two databases, we considered some terms to be used to search for topics (titles, abstracts, author keywords, and keyword plus) of the indexed articles. Numerous search terms were selected, spanning from the general words such as Basilicata and Lucania Ã , to the names of 131 communes (including the main hamlets) of the region, the name of the main rivers, mountains, lakes, and valleys. So as to avoid possible fake results, the abstracts of the articles resulted in the search output were manually analysed each time that a new search string was added to the previous.
The search query built with these terms has allowed to automatically extract the articles from the two databases. Despite the easiness of the approach, well consolidated in bibliometric studies, the fixed query could not be satisfactory as the articles that do not include geographically constraint terms in the topics (e.g. the name of the communes) would not have been caught by the query. In order to increase the chance of building as wide a dataset as possible, we considered additional terms to be included in the query, such as 'south Italy', 'south of Italy', 'southern Italy', and 'southern Apennine Ã '. However, with the intention of identifying the articles targeting the Basilicata territory, the use of non-specific terms forced us to check by hand each article, analyzing the fulltexts in a quick way. These indirect search criteria allowed to increase the dataset obtained by the automatic extraction by about 30%.
Once the records were selected from the two databases, they were downloaded (June 1, 2018) as plain text. The conventional analysis of the data, such as the publication trend and the subject categories, was made through the MS Excel software. As concerns the country co-authorship and the co-occurrence term networks they were performed through the VOSviewer software (version 1.6.8) that builds distance-based maps (www.vosviewer.com; Waltman et al. 2010). Details on this software will be given in the dedicate section (4.2.).

Publication trend, subject categories, and countries
The total number of articles is 1,592. The mean yearly number is equal to about 66. The lowest number of articles was in 1994 (16), the highest in 2015 (120). Until 2003 the annual number of articles was less than 50 with a yearly average of 33. In 2004 the annual number of articles increased quite significantly, counting 81 items. Since 2008 the number of yearly items shows a general increase even if the fluctuations in the trend are not negligible ( Figure 2).
The number of Web of Science categories is 115, in the whole reference period . The noteworthy number of categories involved depends on two main aspects: the first confirms the circumstance that Basilicata is an open-air laboratory, useful to attract scientists acting in different research areas; the second is due to the fact that journals where the articles are published can be assigned to multiple categories.
Researchers from fifty-nine countries all around the world authored or co-authored the articles. The most productive countries ( Figure 3) were Italy, France, Germany, and England with more than 100 articles. The international collaboration network of the countries that signed at least 10 articles is shown in Figure 4. In the figure, the nodes identify the countries, the size of nodes is proportional to the total number of articles from each country and the line thickness represents the cooperation link strength between two countries. By analyzing the network, it emerges that Italy collaborated with all top-productive countries (total link strength equal to 425) even if it had more cooperation with France (link strength equal to 88) and Germany (link strength equal to 69).

Co-occurrence term maps and hot issues
The co-occurrence networks are used to give an image of possible relationships between words. In order to build the co-occurrence term map we used the VOSviewer software that considers the text mining technique to identify the noun phrases from titles and/or abstracts. After that, the noun phrases are classified on the basis of a relevance score: high relevance score is assigned when terms co-occurs mainly with a limited set of other noun phrases so showing a more specific meaning  Figure 3. Countries that authored or co-authored the articles. For each country the number of articles is indicated. For the sake of readability, data for England, Scotland, Wales, and Northern Ireland are grouped as UK. Articles from Taiwan are grouped as China (see also Figure 4).
in the domain under investigation. In this way, noun phrases with low relevance score are those tend to be too general and meaningless for the domain of interest and therefore, excluded by the analysis. Therefore, VOSviewer grouped the high relevance noun phrases together into clusters to identify possible research topics or subfields. The default option of the software is to select 60% most relevant terms among the noun phrase that occurred 10 times at least (Van Eck and Waltman 2011; Van Eck and Waltman 2014). The software produces three typologies of maps: network visualization, overlay visualization, and density visualization. The network visualization shows the items by their label and also by a circle. For each term, the size of the term label and the size of the term circle depends on the weight of the term. Furthermore, the colour of an item is determined by the cluster to which the item belongs. The overlay visualization is the same of the network visualization except that items are coloured in a different way, depending on the user choice. The density visualization highlights the density of an item or a cluster.
To build our co-occurrence term maps, we used the titles and abstracts to extract the noun phrases. After that, we built four co-occurrence maps: the first three maps, presented as network visualizations, concern the analysis over four 8 years long time windows : 1994-2001, 2002-2009, and 2010-2017. These maps make available information about the temporal changes of research fields, identifying their modifications over the years. The fourth map consists of the overlay visualization map regarding the whole time-span covered by this study , to give a general overview about the change of research fields over time.
From a methodological point of view, according to the previous experiences (e.g. Gizzi and Leucci 2018), in order to assure the building of maps and clusters having a . Overlay visualization of the co-authorship of the 16 countries that signed a number of articles equal to or greater than 10 (data on the productivity of all 59 countries are shown in Figure 3). The colours identify the average publication year of the articles for each country: we can infer that some countries (e.g. Canada and Russia) take a curtain call in more recent years.
number of terms suitable to represent the article content, the VOSviewer parameters such as the percentage of most relevant terms, the minimum number of term occurrence to be included in the map, and the cluster resolution were adjusted considering that the elaborations for earlier periods rely on a database made up of a fewer number of articles (Table 2). In order to select the significant terms, the option of inserting a thesaurus text file was considered to merge different spellings of the same term and different terms referring to the same concept. Furthermore, words considered too general and not relevant were deleted by hand. That being stated, we stress that the elaborations that will be discussed in the following sections must be considered as strictly correlated to the 'threshold' values adopted in the data elaboration (Table 2).
According to what reported before, the diameter of a circle and the size of labels in the visualizations reflect the number of articles that have the corresponding term in their title or abstract. The terms that co-occur many times (that is the number of articles in which both words occur together is high) tend to be placed close to each other in the map, so that some clusters can be identified. Each of these is marked by some leading terms.
It must be underlined that in order to assure the 'morphological' comparison of the maps they were rotated and flipped horizontally and/or vertically. Furthermore, for the sake of readability the maps were graphically edited to add the keyword labels to some elements. Lastly, the visualization scale and the label size variation were adjusted for each map separately, so as the diameter of the circles cannot be used to quantitatively compare the term occurrences between the maps.

1994-2001 co-occurrence map
The 1994-2001 map presents three main clusters. Adopting a simplified scheme, the first most populated (red) cluster can be associated to the Apennine geology (APG) studies, the second (green) group calls for climate (CLT) researches, the third (blue) bunch hosts articles on seismological aspects (SEISM) ( Figure 5).
The APG and CLT clusters are of a comparable size (34 items against 30, respectively). The CLT cluster shows words where each of them is quite close to each other, so that one might infer that the field of research is already quite well structured. Density of terms decrease progressively from APG to SEISM cluster, the last showing the lowest number of terms (18).
The APG group, quite well connected with the SEISM group, can be identified by geological, geodynamic, seismotectonic, and geochemical studies of the Southern Apennines. One of the first work regards the structural analysis of the  (Gialanella et al. 1994;Marsella et al. 1995;Pescatore et al. 1999;Mazzoli et al. 2001). The Anzi-Calvello and Sant'Arcangelo basin(s) were also considered in the research. In detail, the tectonic evolution and the stratigraphicsedimentary analyses were performed (Bonini and Sani 2000;Zavala 2000). The structure and the evolution of a segment of the Lucanian Apennines, extended from the Pollino to the Agri Valley and the evolution of the erosional landsurfaces of the Campanian-Lucanian Apennines were analysed by Monaco et al. (1998) and Amato and Cinque (1999), respectively. Furthermore, the analysis of the exploration data related to the oil fields in the Agri Valley allowed to analysed the deep structure [47] of the Campanian-Lucanian Arc (Noguera and Rea 2000). In order to date the active faulting and constraint possible model [39] of earthquake sources of the Agri Valley, some studies considered the analysis of fault [27] properties by the acquisition of quantitative data aimed at assessing the structure permeability and the fractal properties of the faults (e.g. Cello 2000;Cello et al., 2000;Giano et al. 2000). The study of the horizontal strain [10] rate field from GPS data and site velocity [11] of the Central-Southern Apennines were also analysed by the researchers (e.g. Serpelloni et al. 2001). Further studies involved the analysis of historical and recent seismicity of the Potentino area allowing to identify at least four seismogenic zones (Alessio et al. 1995). Moving towards the boundary between the Basilicata and Calabria regions, geological studies considered the seismic potential of the Pollino fault. By paleoseismological trenches, Michetti et al. (1997) identified traces of two historical earthquake(s) [6] with a magnitude 6.5-7.0 so as to infer that the seismic hazard of the area was higher than the Italian seismic catalogue suggested at that time. The Pollino was also one of three Italian areas considered in further research aimed at clarifying whether the Massif is a silent seismogenetic area or the documentary sources missed to record the earthquakes (Valensise and Guidoboni 2000).
Several studies also regarded the stress analysis by using borehole breakouts and focal mechanisms of the Southern Italy and the static stress changes imposed by 23 November 1980 earthquake in the Southern Apennines (e.g. Amato and Montone 1997;Nostro et al. 1997).
In the left-lower portion of the red cluster, close to the green cluster, we can identify a subgroup of terms related to the articles dealing with the chemical composition and the mineralogical-petrographical-physical properties of sample(s) [30], including xenolith garnet(s) [7], sampled at the undersaturated alkaline-potassic to ultrapotassic Mount Vulture strato-volcano rocks. These investigations were made also in view of drawing geodynamic, volcanological, and tectonic deductions (e.g. Marini et al. 1994;Melluso et al. 1996;Stoppa and Principe 1997;Peccerillo 1998;Jones et al. 2000;Rosatelli et al. 2000). The CLT green cluster is mainly related to the paleoclimate studies performed on Monticchio lake records. The researchers identified environmental and climatic changes during the last glacial period, including the Dansgaard-Oeschger and Heinrich event(s) [6]. This was possible by the use of high-resolution sedimentological, microstratigraphical, geochemical, physical, radiocarbon, pollen, diatom, and plant macrofossil analyses performed on a large number of core samples of the lacustrine sequence (e.g. Watts et al. 1996aWatts et al. , 1996bZolitschka and Negendank 1996;Hajdas et al. 1997;Allen et al. 1999;Bindi et al. 1999;Huntley et al. 1999;Nimmergut et al. 1999;Ramrath et al. 1999;Allen et al. 2000). Also the use of geomagnetic records were considered as proxies of climate change until about 100 kyr BP (e.g. Creer and Morris 1996;Brandt et al. 1999).
The SEISM blue bunch, located at the right side of the map, mirrors studies that can be referred to the analysis of seismic precursors, considering the geoelectrical, seismoacoustic, geochemical and thermal fluctuation(s) [18] (e.g. Cuomo et al. 1998;Cuomo et al. 2000;Lapenna et al. 2000;Italiano et al. 2001;Telesca et al. 2001;Tramutoli et al. 2001).

2002-2009 co-occurrence map
Analyzing the 2002-2009 map four main clusters can be identified ( Figure 6). The three clusters that were included in the map of the previous time period increased their significance, as we can argue from the number of terms and the word density in each of group. Furthermore, a new bunch of words (AGR-ENV-VEG-LAN) appears in the lower side of the map, identifying studies related to agriculture, environment, vegetation, and land that, however, taken individually do not reach the sizes to constitute independent clusters.
The APG (red) cluster confirms to be the largest cluster of the map (51 items). It gains more terms as respect to the previous time span showing a raise in differentiation of the studies regarding the Southern Apennines. Among the new terms, in the upper-left side of the cluster, shale [11] subtends some geochemistry studies useful, for example, to understand the genesis and the paleoenvironment of shales having different ages (e.g. Di Leo et al. 2002;Cavalcante et al. 2003). Also the structural setting [10] (upper-right side) seems to be an emerging term, so as such features of the thrust belt [29] or Mount Vulture [36] Volcano were also considered (e.g. Bentivenga et al. 2005;Fedi et al. 2005;Piedilato and Prosser 2005;Schiattarella et al. 2005).
Moving the attention to the central-left side of the cluster, mantle xenolith [6] and carbonatite [9] emerge, pointing out that they can be referred to petrographic and geochemistry studies on Vulture rocks with consequent interpretations also in geodynamic (subduction [15]) perspective (e.g. Downes et al. 2002;D'Orazio et al. 2007).
The other clusters of the map show a comparable size. The new (2nd) purple group, that counts 44 items, shows words that we can further subdivide into some sub-clusters, such as agriculture-food, environmental, vegetation, and land studies (AGR-ENV-VEG-LAN). As a matter of fact, in the lower part of the cluster we find the terms that mainly reproduce the agricultural-food studies. We refer, for example, to the analyses of the red Aglianico [8] wine [13] and those studies related to the scrutiny of aspects linked to the fermentation processes, the compositional causes of the sweet-like off-flavour or the analysis about the effects of heat and/or light on the composition of some volatile compounds (e.g., Romano et al. 2003;Lechiancole et al. 2006;Tat et al. 2007;Cellamare et al. 2009;D'Auria et al. 2009). Other studies consider the use of spectroscopic techniques to differentiating the Italian wines produced in different regions, comparing the Aglianico wines from Basilicata with those of the neighbouring Campania region (Viggiani and Castiglione Morelli 2008). The necessity of the viticulture zoning of the 'Aglianico del Vulture' was the subject of further research (Caputo et al. 2006).
Research in agriculture include the investigation about biological features, nutritional quality, and antioxidant activity of the common bean (Phaseolus vulgaris L. [7]) landraces [6] (e.g., Piergiovanni and Pignone 2003;Heimler et al. 2005;Lioi et al. 2005). Also the irrigation [9] was a matter to be considered in agriculture by the analysis of the role that can be played by wastewater (e.g. Lopez et al. 2006;Palese et al. 2009), or the impact of irrigation on cultivations (e.g. Metzidakis et al. 2008;Favati et al. 2009) and the usefulness of irrigation to mitigate desertification process avoiding the loss of soil productivity (Costantini et al. 2009).
In the central-lower part of the cluster we can identify terms that call for vegetation studies. These terms exceed the boundary of the group, as shown by the interpenetration between the purple and green clusters. Pollino National Park [10] was the subject of studies regarding the investigations on structure, environment relationship, biological features of plant [26], and animal populations (e.g. Mura 2004;Pandolfi et al. 2004;Bellusci et al. 2005;Gargano and Bernardo 2006).
At the right side of the cluster we find, in a quite isolated position, the word ndvi [8] (Normalized Difference Vegetation Index). As we can see, the word is very close to the blue cluster, so showing a relationship with some of these terms. Really, the connection with the blue cluster is due to the link with detection [14], algorithm [19] and dfa [8] (detrended fluctuation analysis) method, this last was also adopted to examine the seismic precursor data. The Normalized Difference Vegetation Index was used to detect archaeological crop marks and archaeological buried remains (Lasaponara andMasini 2006, 2007), analyse vegetation [22] phenology, land [11] degradation (e.g. Bonfiglio et al. 2002;Lanfredi et al. 2003), and fire susceptibility (Lasaponara 2005).
As concerns the ENV subgroup, in the upper portion of the bunch, several studies concern the presence and/or the source origin of heavy metal(s) [8] in marine sediments (e.g. Buccolieri et al. 2004Buccolieri et al. , 2006, atmosphere (e.g. Ragosta et al. 2002Ragosta et al. , 2006Ragosta et al. , 2008, soil (e.g. Chianese et al. 2006), and plant or animal species (e.g. Caggiano et al. 2005;Santoro et al. 2009).
Towards the left side of the cluster, we find the join of terms that 'simulate' the climate studies (CLT). The green group, that counts 40 terms (4th), shows some new terms such as Younger Dryas [8]. The term indicates one of the most well known examples of abrupt climate change during the shift from the last cold glacial stage to a warmer interglacial state. During this transition, temperatures in the Northern Hemisphere suddenly returned to the near-glacial conditions. This near-glacial period is called Younger Dryas from the flower Dryas octopetala that was common in Europe during this phase. Studies about the Younger Dryas in the sediments of Lago Grande di Monticchio were performed by pollen [12] investigations so showing that there was no evidences at Monticchio of a Younger Dryas-like oscillation during the penultimate deglaciation (e.g. Brauer et al. 2007;Allen and Huntley 2009). The cluster also refers to articles having the changes in temperature(s) [39] and precipitation(s) [12] during historical times as a focus, also analyzing their consequences (e.g. Lucarini et al. 2004;Piccarreta et al. 2004;Toreti and Desiato 2008).
Observing the map, at the antipode of the green cluster, we find the 3rd cluster (SEISM-MLD) that by 43 terms mirrors the seismological studies and other research fields that do not reach the size useful to build independent clusters. Therefore, the cluster that in the previous time period incorporated seismological (SEISM) studies expands, including also specific seismotectonic investigations (at the border with the APG red cluster) and other research fields that for the sake of brevity we can classify as multidisciplinary (MLD).
At the right-lower side, a number of terms can be referred to investigations on earthquake precursors with special attention to the geoelectrical and natural electromagnetic signals recorded in the seismic areas of Giuliano and Agri Valley analysed by means of the multifractal detrended fluctuation analysis (MF-DFA [8]) (e.g. Telesca et al. 2004;Balasco et al. 2007). These articles are in line with similar studies referred to the earthquake 'cluster' of the 1994-2001 time span. In the central-lower part of the bunch, other terms such as network, seismicity call for the study of data recorded from seismic networks to investigate the wave attenuation in the Southern Apennines (e.g. Bianco et al. 2002;Castro et al. 2004), infer seismotectonic aspects in the Lucanian Apennines and foredeep (e.g. Frepoli et al. 2005;Brozzetti et al. 2009;Maggi et al. 2009), and analyse the temporal and/or spatial relations between seismicity and human activity in Agri Valley (e.g. Valoroso et al. 2009). Furthermore, towards the end of the 2002-2009 period, the ISNet seismic network was in advanced phase of completion. The system was designed to provide real-time or near real-time earthquake alert notification for early-warning applications and post-event rapid evaluation of ground-shaking maps (Weber et al. 2007). Other studies refer to 1930 and 1980 Irpinia earthquake(s) [11]. As regards the 1930 earthquake, the articles involve both investigations aimed at clarifying the features of the seismic sources by the use of historical waveforms (Pino et al. 2008) and the analysis of damage effects caused by the earthquake in some urban areas, among which Potenza (Gizzi and Masini 2007). Considering the 1980 Irpinia seismic event, studies face the analysis of the fault structure, the faulting geometry, the coseismic displacement or the combined analysis of some geophysical, geochemical, and hydrogeological anomalies recorded around the time of the seismic event (Armigliato and Tinti 2003;Improta et al. 2003;Amoruso et al. 2005;Pulinets et al. 2007). For both these two events systematic analysis and cataloguing of the environmental effects such as surface faults, ground cracks, slope failures, and liquefactions were performed by, for example, Porfido et al. (2007). The area affected by the 1857 Basilicata earthquake, that struck the Agri Valley [18] and the Diano Valley was also analysed considering, for example, the potential seismogenetic source (e.g. Cello et al. 2003;Burrato and Valensise 2008) and the site amplification effects (e.g. Gallipoli et al. 2003).
In the central-left portion of the cluster, the term building [18] is correlated with earthquake [76] (link strength-LS-equal to 7), damage (LS = 5) [12], and frequency (LS = 3) so showing that the group of terms calls for seismic risk analysis and earthquake engineering studies such as the earthquake loss scenarios (e.g. Dolce et al. 2006) or the analysis of fundamental frequencies and damping for buildings in Potenza (Gallipoli et al. 2009). Other studies regard the seismic behaviour of base-isolated buildings (e.g. Braga and Laterza 2004;Cardone et al. 2006) and the investigation about the soil-structure interaction (e.g. Mucciarelli et al. 2003;Gallipoli et al. 2004). Close to these terms, we can also note landslide [20]. Other terms of the area close to landslide are, for example, geophysical investigation [8], electrical resistivity tomography [11], reactivation [6], and damage [12]. The articles that subtend two or more of these words indicate that geomorphological, geophysical, numerical, topographical, historical-documentary, and dendrochronological methods or techniques as well as approaches based on the artificial intelligence were considered to investigate, monitor the landslides, and manage the hazard in urban areas as well as in cultural heritage sites (e.g. Rizzo 2002;Troncone 2005;Perrone et al. 2004;Caniani et al. 2008;Gizzi 2008;Guida et al. 2008, Naudet et al. 2008.

2010-2017 co-occurrence map and synoptic temporal pattern
The 2010-2017 map shows five main clusters, one more than in the previous time span (Figure 7). The new yellow cluster concerns the tephrochronology/tephrostratigraphy studies (TEPH).
The 1st most populated cluster (purple) is made up of 80 terms and it can be thought of as subdivided in three sub-clusters (AGR-LAN-ENV). In the lower part of the bunch we can identify the sub-group related to the agriculture(-food) studies (AGR) whose words trace the prosecution of the research on both Aglianico Vitis vinifera [new term, with 7 occurrences] and wine from it derived. Several aspects are investigated, such as the ampelographic and pedigree features of the Vitis as well as the nutraceutical properties of the wine (e.g. Alba et al. 2011;De Lorenzis et al. 2013;De Nisco et al. 2013). The agriculture sub-cluster also considers the irrigation issues in relationship with the cultivations and yield(s) [new term, 40] (e.g. Katerji et al. 2010;Palese et al. 2010;Candido et al. 2015).
The central-right side of the cluster refers to the sub-group whose words trace the land as subject (LAN). Some articles deal with land degradation [10], desertification [new, 11], and management [52] issues (e.g. Salvati et al. 2011;Basso et al. 2012;Salvati et al. 2013;Kelly et al. 2015). Other words consider the urban sprawl issue for different localities of the Basilicata region (e.g. Amato et al. 2015;Di Palma et al. 2016).
In the upper-left side of the cluster, it is possible to identify words that trace the environmental studies (ENV). We refer to the research in pollution [new, 16] of river sediment due to metals (e.g. Medici et al. 2011), soil pollution in industrial areas (e.g. D'Emilio et al. 2010), air pollution (e.g. Caggiano et al. 2011;Pavese et al. 2012;Trippetta et al. 2014), groundwater [new, 10] (and) contamination [10] due to dumpsites (e.g. Caniani et al. 2015). Further investigations regarded the dispersion and pollution caused by asbestos fibres (e.g. Beneduce et al. 2012) and the pollution of soil and groundwater due to agricultural chemicals and pathogens (e.g. Coppola et al. 2014).
Adjacent to the AGR-LAN-ENV cluster we find the 2nd blue cluster (62 items, SEISM-MLD) that growths in complexity as respect to the previous time span.
In the upper part, where seismological studies emerge (SEISM), the investigations on seismic precursors disappear and some new terms including seismic hazard [11], seismic swarm [7], and isnet [ISNet,9] catch the eye. The acronym ISNet indicates the Irpinia Seismic Network, planned to estimate in real-time the epicentre and magnitude, provide ground-motion parameters, assess the ground shaking expected and study the seismogenetic faults in the Southern Apennines. The network and its technological features as well as the analysis of data recorded from the network were the subject of numerous articles in the 2010-2017 time span (e.g. Emolo et al. 2011;Satriano et al. 2011;Vassallo et al. 2012). Another new term of the cluster is reservoir [27] (lower-left area) that joined with seismicity [32] calls for the research about the induced seismicity caused by wastewater re-injection (e.g. Improta et al. 2015;Buttinelli et al. 2016) or annual fluctuations of water level in the Pertusillo artificial lake reservoir of Agri Valley (e.g. Stabile et al. 2015).
In the central-lower part of the cluster the MLD studies start, among which physical geography studies, engineering field-related investigations, and cultural heritage researches.
Close to reservoir we can also find drainage network [new, 8] that considered jointly with landslide [42] subtends studies regarding the relationships between landslide occurrence and drainage network (e.g. Santangelo et al. 2013).
The lower part of the cluster shows other new terms such as SAR [13, new] (Synthetic Aperture Radar) sensor(s) [new, 10] that was considered to study the Montescaglioso landslide (Raspini et al. 2015). SAR, jointly with other techniques, was also taken as a tool for the monitoring [42] of the Musumeci bridge in Potenza chief-town (Proto et al. 2010;Catapano et al. 2012;Fornaro et al. 2013;Stabile et al. 2013), the investigating of archaeological site(s) [14] (e.g. Chen et al. 2015), and the analysis of flood(s) [new, 10] (e.g. D'Addabbo et al. 2016). As regards this last, the use of flood maps allowed to identify the flood-prone areas over the Bradano River basin (Manfreda et al. 2015).
The term energy calls for energy performance analysis of Matera public and historical building(s) [27] (e.g. Cardinale et al. 2013;Rospi et al. 2017). However, the term energy [22] is also related to the adjacent purple cluster. This can be argued analysing some articles that deal with the energy consumption in relationship to the agricultural crop(s) [new, 29] (e.g. Cozzi et al. 2015;Pergola et al. 2017) or availability of biomass [new, 14] (Delivand et al. 2015) (right part of the purple cluster).
Cultural heritage [new, 8] is another new compound term that emerges in the lower-right side of the blue cluster. Cultural heritage calls for studies performed on Matera UNESCO site, considering the monitoring approach based on the ICT technology (Gribaudo et al. 2017), the conservation state of wall(s) [12] of monuments or building(s) of the Sassi of Matera (e.g. Danese et al. 2010;Gizzi et al. 2016). Other research in the cultural heritage field consider the use of the in situ survey(s) [63] for monitoring the heritage by means of gpr [new, 7] (ground-penetrating radar) or low cost technologies (e.g. Masini et al. 2010;Sileo et al. 2017). Furthermore, the analysis of slope [46] processes and hazard [new, 26] of archaeological site(s) [14] was also considered (e.g. Lazzari and Gioia 2017).
The closeness of the MLD sub-cluster to the SEISM-related research (upper portion) is due to multiple factors such as the cause-effect relationship between earthquakes and landslides, the circumstance that these natural hazards can cause damage to building(s) or cultural heritage. Other reason is the sharing of some approaches, methods, and survey techniques to investigate on landslides, earthquakes, and buildings, such as the hazard analysis, the monitoring, the mapping, the use of geophysical survey techniques such as gpr.
The emerging yellow cluster (3rd, TEPH) of tephrocronology-tephrostratigraphy studies is made up of 49 terms.
In the northern part of Basilicata, archaeological and volcanic studies were performed by intertwining and overlapping each other towards important discovers dealing with evidences of the Early Acheulian culture in Western Europe as the oldest Middle Pleistocene human fossil discovered in Italy so far, found and dated in tephra [48] at Notarchirico site, in the Venosa basin (Lef evre et al. 2010). The tephra examined in this area were well preserved in the medium-sized maar lake 'Lago Grande di Monticchio', occupying a crater of Mount Vulture. The sediment sequence of the Monticchio lake represents one of the most relevant archive since it provides both eruptive evidence from adjacent volcanic centres and robust chronologies. Here, geochemical, tephrostratigraphy, and chronology studies have been carried out during 2010-2017, by allowing to reconstruct the eruptive history of Vulture and other Italian volcanoes during the late Quaternary, and the physical link to other Holocene archives located in the Central and Eastern Mediterranean (e.g. Lake Ohrid [7]). The chronology of the sediments in Monticchio lake has allowed to provide paleoclimatic records back to 133ka BP, the end of the penultimate Glacial (Wulf et al. 2012). The transition from stadial to interstadial conditions at Monticchio is well documented by the distinct lithological changes in the sediments, and this natural archive has contributed to understand the effects caused by the Heinrich stadial H4 and the correlations recorded also in the Eastern Mediterranean area. In this framework, the correlation between the Campanian Ignimbrite tephra layer and the 40-38 ka varved sediments was studied allowing to elaborate considerations on the existence or absence of correlation between the onset of a millennial-scale cold stadial H4 and the occurrence of the Campanian Ignimbrite [9] eruption (Wutke et al. 2015). In the Monticchio lake the distal tephra fallout layers from volcanic sources in Central and Southern Italy, which are located between 100 and 540 km away, were indeed found such as from the Campanian Ignimbrite (Campi Flegrei [9]), Monte Epomeo Green Tuff (Ischia Island [7]), and Alban Hills (Roman Province).
The 4th red APG cluster is made up of 36 items. Among the new terms, conodont [new, 10] (lower-left part of the cluster) identifies the use of the minute tooth-like fossil as biostratigraphical constraints of the Lagonegro basin [14] succession at different sites (e.g. Giordano et al. 2010). Further studies regard, for example, the investigation of sedimentary and/or tectonic [31] aspects of geological formations or morphological units (Cavalcante et al. 2015;Sinisi et al. 2016;La Bruna et al. 2017).
The 5th green cluster, linked to the yellow one, is made up of 30 items. It has the least number of words and it is mainly related to terms focusing on both paleo/recent climate-environment change and vegetation studies (CLT-VEG).
Many studies consider the pollen [19] records of Lago Grande di Monticchio to perform paleoclimate comparisons and correlations between different sites at national Figure 8. Co-occurrence term in overlay visualization . The overlay colour of each circle corresponds to the average publication year of all the articles that include the corresponding word. In this map, the terms with the blue colour represent the research activities with older average publication year and the terms with the yellow colour show the terms with more recent average publication year. or international scale for the last glacial and interglacial stages (Pini et al. 2010;Bogot a et al. 2011;Joannin et al. 2012;Pickarski et al. 2015). As a matter of fact, the sediments of Lago Grande di Monticchio have been cored since 1982 and the relative pollen record are very well-documented (e.g. Watts, 1985;Allen and Huntley 2009). Other studies refer to a series of investigations about the paleoenvironment changes and climate patterns of specific marine successions (e.g. Marino et al. 2015).
Climate [59] analysis of data recorded over the last decades was also performed at regional and local scale in Basilicata to investigate different aspects such as the changes in precipitation and the Urban Heat Island phenomenon (e.g. Longobardi et al. 2016;Giorgio et al. 2017).
The other group of terms can be identified with different studies such as those on plant physiology or related to the relationship between changes in vegetation and natural/human conditioning factors (e.g. Paletto et al. 2014;Rita et al. 2014;Colangelo et al. 2017;Giarrizzo et al. 2017). This group of terms is connected with the left side of the (AGR-LAN-ENV) purple cluster.
To illustrate the evolution of research areas over time we can also show the terms analysis of the publications, but with time information. To this purpose, Figure 8 shows a co-occurrence term map based on titles and abstracts of the articles in the period 1994-2017. The colour of a term in the overlay visualization indicates the average publication year of the articles in which the term occurs. Terms occurring in recent articles are coloured green and yellow. These terms can be found mainly in the lower part of the overlay visualization where agriculture-food, land, environmental, and vegetation studies (lower-right area of the map), and the tephrochronologytephrostratigraphy investigations (lower-left) can be identified as 'new' research fields. On the contrary, the lower average publication year (blue colour) is related to some 'mature' research fields such as the Apennine geology studies and seismological investigations (central-upper part of the map).

Conclusion
The article has considered the bibliometric analysis of the research outputs concerning the Basilicata region during the 1994-2017 time span. Two databases were considered: the Thomson Reuters' Science Citation Index Expanded (SCI-EXPANDED) and the Social Sciences Citation Index (SSCI) accessed via the Web of Science Core Collection.
The total number of articles was 1,592 and the number of Web of Science categories involved was 115. Fifty-nine countries all around the world authored or co-authored the articles. Italy, France, Germany, and England were the most productive countries and Italy had collaboration with all top-productive countries, in particular with France and Germany.
The number and the complexity of the research fields increased progressively over time. The first part of the time span investigated has shown the development of three main research fields regarding the Apennine geology, the paleoclimate, and the seismology. The period from 2002 to 2009 was characterized by the 'entry' of agriculture, environment, vegetation, and land studies with a growth in diversification of those researches that marked out the second half of the nineties of the last century. Furthermore, the studies of the first decade of 2000s include landslide-flood investigations and other engineering-related and cultural heritage researches. From 2010 until 2017 a well-defined cluster of tephrocronology/tephrostratigraphy studies emerge with a significant increase in diversification of the agriculture/land/environment studies.
The relatively recent age of the agriculture-food/land/environment studies and the progressive evolution of these researches over the last 16 years in Basilicata might be considered a clue that these investigations, taken on the whole, will also significantly engage the scientific community in the next years. Quite similar considerations can be made for the group of earthquake-related studies/hydrogeological instability investigations/cultural heritage researches (SEISM-MLD in the maps). These bibliometric remarks can be strengthen by considering the policy that is going to be accomplished in the Region according to the European Commission for programming 2014-2020. Really, the European Commission requested to all the European regions, as a mandatory requirement for the application of the European Regional Development Fund (ERDF) Operational Program 2014-2020, to adopt the Research and Innovation Smart Specialization Strategy (RIS3) (EU Regulation no. 1303/ 2013). RIS3 of Basilicata has identified five priorities, among which there are bio-economy (including agri-food), earth observation, and cultural heritage (http://www.pofesr.basilicata. it/fesr2014-20/?page_id=60). On these priorities, the European Regional Development Fund (ERDF) Operational Program 2014-2020 of Basilicata has allocated more than 800 million Eur, most of them to be addressed to the implementation of experimental research and development projects in each priority of RIS3 (http://europa.basilicata.it/fesr/programma/pianofinanziario/).
From the outlook angle, the outcomes of this study can supply a support to scientists that are approaching in some research areas having the territory of the Basilicata region as focus. Furthermore, also due to the presence of numerous and different skilled top-tier research centres within the region, the results summarized in this article can encourage the scientific community to start new research activities by the building of interdisciplinary and transdisciplinary networks of scientists so as to further uphold the international competitiveness of research projects.
However, the readers have to be attentive to the results of this study that, likewise all bibliometric studies, have some constraints. Firstly, the unfeasibility to prepare all-inclusive search queries to search for articles might have left out some relevant articles. In addition, all the findings and discussions included here have to be strictly considered as a result of the content analysis of the two ISI databases taken as a reference that do not index some journals, so that some articles published are out of the investigations performed here. Lastly, the article supplies an overview of the main research fields related to the territory of Basilicata. In order to gain more clues and perform specific analysis, more thorough studies for each research activity are requested.