ESSD - recent papers

Tracking vegetation phenology across diverse biomes using Version 3.0 of the PhenoCam Dataset

2025-11-26T23:10:10+01:00

Tracking vegetation phenology across diverse biomes using Version 3.0 of the PhenoCam Dataset
Adam M. Young, Thomas Milliman, Koen Hufkens, Keith L. Ballou, Christopher Coffey, Kai Begay, Michael Fell, Mostafa Javadian, Alison K. Post, Christina Schädel, Zakary Vladich, Oscar Zimmerman, Dawn M. Browning, Christopher R. Florian, Minkyu Moon, Michael D. SanClements, Bijan Seyednasrollah, Mark A. Friedl, and Andrew D. Richardson
Earth Syst. Sci. Data, 17, 6531–6556, https://doi.org/10.5194/essd-17-6531-2025, 2025

Vegetation phenology plays a significant role in driving seasonal patterns in land-atmosphere interactions and ecosystem productivity, and is a key factor to consider when modeling or investigating ecological and land-surface dynamics. To integrate phenology in ecological research ultimately requires the application of carefully curated and quality controlled phenological datasets that span multiple years and include a wide range of different ecosystems and plant functional types. By using digital cameras to record images of plant canopies every 30 min, pixel-level information from the visible red-green-blue color channels can be quantified to evaluate canopy greenness (defined as the green chromatic coordinate, G_CC), and how it varies in space and time. These phenological cameras (i.e., “PhenoCams”) offer a pragmatic and effective way to measure and provide phenology data for both research and education. Here, in this dataset descriptor, we present the PhenoCam dataset version 3 (V3.0), providing significant updates relative to prior releases. PhenoCam V3.0 includes 738 unique sites and a total of 4805.5 site years, a 170 % increase relative to PhenoCam V2.0 (1783 site years), with notable expansion of network coverage for evergreen broadleaf forests, understory vegetation, grasslands, wetlands, and agricultural systems. Furthermore, in this updated release, we now include a PhenoCam-based estimate of the normalized difference vegetation index (cameraNDVI), calculated from back-to-back visible and visible+near-infrared images acquired from approximately 75 % of cameras in the network, which utilize a sliding infrared cut filter. Both G_CCG_CCG_CCith potential applications to the evaluation of satellite data products, earth system and ecosystem modeling, and understanding phenologically mediated ecosystem processes. The PhenoCam V3.0 data release is publicly available for download from the Oak Ridge National Lab Distributed Active Archive Center: the source imagery used to derive phenology information is available at https://doi.org/10.3334/ORNLDAAC/2364https://doi.org/10.3334/ORNLDAAC/2389

In situ surface cloud measurement dataset from four cloud spectrometers during the Pallas Cloud Experiment (PaCE) 2022

2025-11-26T23:10:10+01:00

In situ surface cloud measurement dataset from four cloud spectrometers during the Pallas Cloud Experiment (PaCE) 2022
Konstantinos Matthaios Doulgeris, Ville Kaikkonen, Harri Juttula, Eero Molkoselkä, Anssi Mäkynen, and David Brus
Earth Syst. Sci. Data, 17, 6497–6506, https://doi.org/10.5194/essd-17-6497-2025, 2025

This data paper presents an overview of the cloud spectrometers deployed during the Pallas Cloud Experiment (PaCE) in autumn 2022, a coordinated measurement campaign in the Finnish subarctic that took place between 12 September and 15 December 2022. Four cloud spectrometers – the Cloud and Aerosol Spectrometer (CAS); the Forward Scattering Spectrometer Probe (FSSP-100); the Cloud Droplet Analyzer (CDA); and ICEMET – were operated as ground-based setups, providing high-resolution in-cloud measurements of droplet size distributions and key microphysical properties, such as number concentration (N_c), liquid water content (LWC), median volume diameter (MVD), and effective diameter (ED). The dataset is complemented by meteorological observations of temperature, humidity, wind speed, and visibility at a 1 min resolution. The measurements collected during PaCE 2022 offer valuable insights into aerosol–cloud interactions and cloud evolution in subarctic cloud systems. This dataset is suitable for researchers in cloud microphysics, atmospheric science, and climate modeling, as well as for instrument calibration and validation in future campaigns. The data can also be integrated with complementary concurrent in situ aerosol, remote sensing, UAV, and balloon-borne observations during PaCE 2022 to provide a more comprehensive understanding of cloud microphysics and atmospheric processes in the subarctic environment. The dataset is publicly available at https://doi.org/10.5281/zenodo.15045294

Harmonised boundary layer wind profile dataset from six ground-based Doppler wind lidars in a transect across Paris, France

2025-11-26T23:10:10+01:00

Harmonised boundary layer wind profile dataset from six ground-based Doppler wind lidars in a transect across Paris, France
William Morrison, Dana Looschelders, Jonnathan Céspedes, Bernie Claxton, Marc-Antoine Drouin, Jean-Charles Dupont, Aurélien Faucheux, Martial Haeffelin, Christopher C. Holst, Simone Kotthaus, Valéry Masson, James McGregor, Jeremy Price, Matthias Zeeman, Sue Grimmond, and Andreas Christen
Earth Syst. Sci. Data, 17, 6507–6529, https://doi.org/10.5194/essd-17-6507-2025, 2025

Doppler wind lidars (DWL) offer high-resolution wind profile measurements that are valuable for understanding atmospheric boundary layer (ABL) dynamics. Here six ground-based DWL, deployed in a multi-institutional effort along a 40 km transect through the centre of Paris (France), are used to retrieve horizontal wind speed and direction through the ABL at 18–25 m vertical and 1–60 min temporal resolution. Data are available for June 2022–March 2024 (three DWL) and two Intensive Observation Periods (six DWL) across 9 weeks in September 2023–December 2023. Data from all sensors are harmonised in terms of quality control, file format, as well as temporal and vertical resolutions. The quality of this DWL dataset is evaluated against in-situ measurements at the Eiffel Tower and radiosonde profiles. This unique, spatially dense, open dataset will allow urban boundary layer dynamics to be explored in process-studies, and is further valuable for the evaluation of high-resolution weather, climate, inverse and air pollution models that resolve city-scale processes. The dataset is available at https://doi.org/10.5281/zenodo.14761503

Marine Heat waves – Multiple Analysis / Definitions (MHW-MAD): A Multi-Definition Global Marine Heatwave Dataset from Satellite Sea Surface Temperature data

2025-11-26T23:10:10+01:00

Marine Heat waves – Multiple Analysis / Definitions (MHW-MAD): A Multi-Definition Global Marine Heatwave Dataset from Satellite Sea Surface Temperature data
Alexander Hayward, Nishka Dasgupta, Ronan McAdam, Mark R. Payne, Roshin P. Raj, Giulia Bonino, Sourav Chatterjee, Vincent Combes, Dimitra Denaxa, Francesco De Rovere, Pia Englyst, Veera Haapaniemi, Paul Hargous, Jacob Høyer, K. Ajith Joseph, Beatriz Lopes, Ana Oliveira, João Paixão, Fabiola Silva, Saradhy Surendran, Artemis Zegna-Rata, and Steffen Olsen
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-590,2025
Preprint under review for ESSD (discussion: open, 0 comments)
Marine heatwaves (MHWs) are prolonged anomalies of warm sea surface temperature (SST) that can disrupt marine ecosystems, physical climate processes, and human coastal activities. MHW definitions vary due to different stakeholders requirements, such as ecological scientists and climate scientists having differing yet specific thresholds and metrics. Here we introduce a new global dataset of daily MHW metrics: climatological baselines, threshold exceedances, SST anomalies, and categorical event classifications of severity, derived from the European Space Agency SST Climate Change Initiative (ESA SST CCI) climate data record (CDR; 1982–2021) version 3.0 and an extension from 2022–2024 provided as an interim climate data record (iCDR). Building on the widely used definition of MHWs, periods in which SST exceeds the local 90th percentile for 5 or more days, our dataset extends this framework by incorporating multiple baseline climatologies (including fixed 30-year periods and rolling 30-year windows, as well as the period for reanalysis 1993–2016), varied percentile thresholds (90th, 95th, 99th), and both raw and linearly detrended SST anomalies. We also implement alternative event duration criteria (minimum 10-day and 30-day persistence) to classify longer-lasting warm events. All data products are provided at daily resolution on a 0.05° (~5 km) grid, with outputs including daily climatological percentiles, SST anomalies and binary MHW flags with severity category indices. This comprehensive dataset provides a consistent foundation for detecting and analysing MHWs across time and space, enabling researchers to assess how methodological choices affect MHW characterisation. By offering multiple definitions in parallel, the dataset facilitates intercomparison studies and supports applications from climate monitoring and model evaluation to marine ecological impact assessment, thereby providing users with pre-made indices for extremes.

Multi-spatial scale assessment and multi-dataset fusion of global terrestrial evapotranspiration datasets

2025-11-25T23:10:10+01:00

Multi-spatial scale assessment and multi-dataset fusion of global terrestrial evapotranspiration datasets
Yi Wu, Chiyuan Miao, Yiying Wang, Qi Zhang, Jiachen Ji, and Yuanfang Chai
Earth Syst. Sci. Data, 17, 6445–6460, https://doi.org/10.5194/essd-17-6445-2025, 2025

Evapotranspiration (ET) is an important component of the terrestrial water cycle, carbon cycle, and energy balance. Currently, there are four main types of ET datasets: remote sensing–based, machine learning–based, reanalysis–based, and land–surface–model–based. However, most existing ET fusion datasets rely on a single type of ET dataset, limiting their ability to effectively capture regional ET variations. This limitation hinders accurate quantification of the terrestrial water balance and understanding of climate change impacts. In this study, the accuracy and uncertainty of thirty ET datasets (across all four types) are evaluated at multiple spatial scales, and a fusion dataset BMA (Bayesian model averaging)-ET, is obtained using BMA method and dynamic weighting scheme. ET from FLUXNET2015 as reference, the study recommends remote sensing- and machine learning-based ET datasets, especially Model Tree Ensemble Evapotranspiration (MTE), Penman-Monteith-Leuning (PML) and Process-based Land Surface Evapotranspiration/Heat Fluxes (PLSH), but the optimal selection depends on season and vegetation type. At the basin scale, most of ET datasets demonstrate superior performance. Relative uncertainty based on remote sensing and machine learning is low at the grid point scale. The fusion dataset BMA-ET accurately captures trends in ET, showing a global terrestrial increasing trend of 0.65 (0.51–0.78) mm yr⁻¹https://doi.org/10.5281/zenodo.15470621

In situ-measured benthic fluxes of dissolved inorganic phosphorus in the Baltic Sea

2025-11-25T23:10:10+01:00

In situ-measured benthic fluxes of dissolved inorganic phosphorus in the Baltic Sea
Astrid Hylén, Nils Ekeroth, Hannah Berk, Andy W. Dale, Mikhail Kononets, Wytze K. Lenstra, Aada Palo, Anders Tengberg, Sebastiaan J. van de Velde, Stefan Sommer, Caroline P. Slomp, and Per O. J. Hall
Earth Syst. Sci. Data, 17, 6423–6443, https://doi.org/10.5194/essd-17-6423-2025, 2025

Sedimentary recycling of phosphorus is a key aspect of eutrophication. Here, we present data on benthic fluxes of dissolved inorganic phosphorus (DIP) from the Baltic Sea, an area with a long eutrophication history. The presented dataset contains 498 individual fluxes measured in situ with three types of benthic chamber landers at 59 stations over 20 years, and data cover most of the Baltic Sea subbasins (Hylén et al., 2025, https://doi.org/10.5281/zenodo.14812160). The dataset further contains information about bottom-water dissolved oxygen (O₂) concentrations, sedimentary organic carbon (OC) content and sediment type. The DIP fluxes differ considerably between basins depending on OC loading and the level of O₂₂₂⁻¹₂

Survey data of damaged residential buildings and business premises from the 2022 record-breaking flood in the Marche region, Italy

2025-11-25T23:10:10+01:00

Survey data of damaged residential buildings and business premises from the 2022 record-breaking flood in the Marche region, Italy
Sara Rrokaj, Chiara Arrighi, Marta Ballocci, Gabriele Bertoli, Francesca da Porto, Claudia De Lucia, Mario Di Bacco, Paola Di Fluri, Alessio Domeneghetti, Marco Donà, Alice Gallazzi, Andrea Gennaro, Mohammed Hammouti, Gianluca Lelli, Sara Mozzon, Natasha Petruccelli, Elisa Saler, Anna Rita Scorzini, Simone Sterlacchini, Gaia Treglia, Debora Voltolina, Marco Zazzeri, and Daniela Molinari
Earth Syst. Sci. Data, 17, 6487–6496, https://doi.org/10.5194/essd-17-6487-2025, 2025

Accurate flood damage data are essential for developing reliable flood risk assessments and designing effective risk management strategies. However, empirical flood damage data remain limited, particularly at the object level, hindering the calibration and validation of predictive models. Existing datasets are often highly aggregated and lack the granularity required for detailed analysis. This paper presents two comprehensive, micro-scale datasets documenting flood damage to 256 buildings, comprising both residential buildings and business premises, surveyed in the aftermath of the 2022 flood event in the Marche region of Italy. The georeferenced datasets include information on hazard characteristics, buildings' vulnerability features, physical damage description across structural and non-structural components, indirect damage, and implemented mitigation measures. In addition, original survey forms are provided to support future data collections in different contexts. Datasets and survey forms are available at the link: https://doi.org/10.5281/zenodo.15591850

A comparative analysis of EDGAR and UNFCCC GHG emissions inventories: insights on trends, methodology and data discrepancies

2025-11-25T23:10:10+01:00

A comparative analysis of EDGAR and UNFCCC GHG emissions inventories: insights on trends, methodology and data discrepancies
Manjola Banja, Monica Crippa, Diego Guizzardi, Marilena Muntean, Federico Pagani, and Enrico Pisoni
Earth Syst. Sci. Data, 17, 6461–6486, https://doi.org/10.5194/essd-17-6461-2025, 2025

Tracking greenhouse gas (GHG) emissions is essential for understanding the drivers of climate change and guiding global mitigation strategies. The Emissions Database for Global Atmospheric Research (EDGAR) and submissions by Parties to the United Nations Framework Convention on Climate Change (UNFCCC) are two key sources of GHG emissions data. While EDGAR provides comprehensive and globally consistent estimates, UNFCCC submissions are based on nationally reported inventories, which adhere to specific guidelines and reflect country-specific circumstances and practices. This study presents a detailed comparison between EDGAR and UNFCCC GHG emissions inventories, focusing on G20 countries, which account for nearly 80 % of global emissions, as well as Annex I countries, including the EU27. By examining sectoral discrepancies, methodological variations, and the impact of reporting timelines, the paper identifies key areas of alignment and divergence in emissions estimates. While CO₂₄₂O estimates exhibit substantial discrepancies due to differences in methodologies, emission factors, uncertainties, and reporting practices. Our findings emphasise the need for enhanced methodological harmonization and more frequent reporting, particularly in non-Annex I countries, where limited capacity and irregular updates reduce comparability. Addressing these inconsistencies is crucial for improving transparency, aligning national and independent datasets, and strengthening climate policy decisions under the Paris Agreement (UNFCCC Secretariat, 2021b).

The MAESTRO turbulence dataset derived from the SAFIRE ATR42 aircraft

2025-11-25T23:10:10+01:00

The MAESTRO turbulence dataset derived from the SAFIRE ATR42 aircraft
Louis Jaffeux, Marie Lothon, Fleur Couvreux, Dominique Bouniol, Grégoire Cayez, Lilian Joly, Jérémie Burgalat, Cyrille De Saint-Léger, Hubert Bellec, Olivier Henry, Dyaa Chbib, Tetyana Jiang, and Sandrine Bony
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-586,2025
Preprint under review for ESSD (discussion: open, 0 comments)
The MAESTRO airborne field campaign took place between August 10 and September 10 2024 over the North Atlantic tropical ocean near the Cabo Verde Islands. Its goal was to investigate the processes that control the mesoscale organization of clouds with a payload of probes and sensors, as well as vertically and horizontally-pointing radars and lidars. A particular attention was paid to the role of coherent structures in the boundary layer and mesoscale cloud organization. This focus motivated the acquisition of high-resolution measurements of temperature and water vapor to capture turbulence dynamics in the subcloud layer. To achieve this, six hygrometers and four temperature sensors were deployed, including a new fast-rate hygrometer called FAST-WAVE. This article describes the turbulence dataset, prepared on the basis of these measurements. It consists in 25 Hz segmented time series of calibrated water vapor mixing ratio, temperature, and three-dimensional wind, their corresponding fluctuations, as well as turbulent moments, and integral length scales. In total, 40 hours of stabilized legs data were gathered in a wide range of mesoscale and local cloud conditions, with nearly 13 hours consisting of high-quality boundary-layer samples. This paper describes the methodological choices made for all the computations, calibrations, and corrections that were applied to the original measurements. The collection of NetCDF files composing this dataset is publicly available on the AERIS website.

SEEPS4ALL: an open dataset for the verification of daily precipitation forecasts using station climate statistics

2025-11-25T23:10:10+01:00

SEEPS4ALL: an open dataset for the verification of daily precipitation forecasts using station climate statistics
Zied Ben-Bouallègue, Ana Prieto-Nemesio, Angela Iza Wong, Florian Pinault, Marlies van der Schee, and Umberto Modigliani
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-553,2025
Preprint under review for ESSD (discussion: open, 0 comments)
Forecast verification is an essential task when developing a forecasting model. How well does a model perform? How does the forecast performance compare with previous versions or other models? Which aspects of the forecast could be improved? In weather forecasting, these questions apply in particular to precipitation, a key weather parameter with vital societal applications. Scores specifically designed to assess the performance of precipitation forecasts have been developed over the years. One example is the Stable and Equitable Error in Probability Space (SEEPS, Rodwell et al., 2010). The computation of this score is however not straightforward because it requires information about the precipitation climatology at the verification locations. More generally, climate statistics are key to assessing forecasts for extreme precipitation and high-impact events. Here, we introduce SEEPS4ALL, a set of data and tools that democratize the use of climate statistics for verification purposes. In particular, verification results for daily precipitation are showcased with both deterministic and probabilistic forecasts.

Measurements of water droplets in a turbulent wind tunnel

2025-11-25T23:10:10+01:00

Measurements of water droplets in a turbulent wind tunnel
Wiebke Frey, Silvio Schmalfuß, Frank Stratmann, and Dennis Niedermeier
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-658,2025
Preprint under review for ESSD (discussion: open, 0 comments)
In order to study the behaviour of cloud droplets at the cloud-clear interface, the “Solving The Entrainment Puzzle” (STEP) project examined a droplet stream in the Turbulent Leipzig Aerosol Cloud Interaction Simulator (LACIS-T). LACIS-T comprises two particle free air streams, that are turbulently mixed, and during the experiment one air stream was resembling in-cloud conditions, whereas the other air stream was set to out-of-cloud conditions. A droplet stream was injected by a droplet generator into the mixing plane of the two air streams. Droplet size distributions were observed with a phase Doppler anemometer at various levels in the measurement section of LACIS-T, corresponding to different residence times of the droplets in the turbulent environment. Additionally, observations were made using different flow speeds in the two air streams to create shear flows in the wind tunnel. The experiment was accompanied by computational fluid dynamics simulations to provide a full 3d representation of meteorological fields and turbulence parameters.This manuscript provides a description of the laboratory settings and instrumentation, the experimental design, the simulations, and a general overview of the data. We invite the scientific community for joint data analysis and numerical studies using the data which is freely available from the Eurochamp Data Centre, see Table 2 in the Data availability section for details.

HIStory of LAND transformation by humans in South America (HISLAND-SA): annual and 1 km gridded data for soybean, maize, wheat, and rice (1950–2020)

2025-11-24T23:10:10+01:00

HIStory of LAND transformation by humans in South America (HISLAND-SA): annual and 1 km gridded data for soybean, maize, wheat, and rice (1950–2020)
Binyuan Xu, Hanqin Tian, Shufen Pan, Xiaoyong Li, Ran Meng, Óscar Melo, Anne McDonald, María de los Ángeles Picone, Xiao-Peng Song, Edson Severnini, Katharine G. Young, and Feng Zhao
Earth Syst. Sci. Data, 17, 6353–6377, https://doi.org/10.5194/essd-17-6353-2025, 2025

South America is a global hotspot for land use and land cover (LULC) change, marked by dramatic agricultural land expansion and deforestation. While previous studies have documented land use and land cover changes in South America over recent decades, there is still a lack of spatially explicit and time-series maps of crop types that capture shifts in crop distribution. Therefore, developing high-resolution, long-term, and crop-specific datasets is crucial for advancing our understanding of human–environment interactions and for assessing the impacts of agricultural activities on carbon and biogeochemical cycles, biodiversity, and climate. In this study, we integrated multi-source data, including high-resolution remote sensing data, model-based data, and historical agricultural census data, to reconstruct the historical dynamics of four major commodity crops (i.e., soybean, maize, wheat, and rice) in South America at an annual timescale and 1 km × 1 km spatial resolution from 1950 to 2020. The results showed that soybean and maize cultivation expanded rapidly in South America by encroaching on other vegetation (i.e., forest, pasture/rangeland, and unmanaged grass/shrubland) over the past 70 years, whereas wheat and rice areas remained relatively stable. Specifically, soybean is one of the most dramatically expanded crops, increasing from essentially zero in 1950 to 48.8 Mha in 2020, resulting in a total loss of 23.92 Mha of other vegetation. In addition, the area of maize increased by a factor of 2.1 from 12.7 Mha in 1950 to 26.9 Mha in 2020. The newly developed crop type dataset provides important insights for assessing thehttps://doi.org/10.5281/zenodo.14002960

SHEDIS-Temperature: linking temperature-related disaster impacts to subnational data on meteorology and human exposure

2025-11-24T23:10:10+01:00

SHEDIS-Temperature: linking temperature-related disaster impacts to subnational data on meteorology and human exposure
Sara Lindersson and Gabriele Messori
Earth Syst. Sci. Data, 17, 6379–6403, https://doi.org/10.5194/essd-17-6379-2025, 2025

International databases of disaster impacts are crucial for advancing disaster risk research, particularly as climate change intensifies the frequency and intensity of many natural hazards – including temperature extremes. However, many widely-used disaster impact databases lack information on the physical dimension of the hazards associated with an impact, and on the exposure to such hazards. This hinders analysing drivers of severe disaster outcomes. To bridge this knowledge gap, we present SHEDIS-Temperature, a dataset that provides Subnational Hazard and Exposure information for temperature-related DISaster impact records (https://doi.org/10.7910/DVN/WNOTTC; Lindersson and Messori, 2025). This open-access dataset links temperature-related impact records from the Emergency Events Database (EM-DAT) with subnational data on their locations, associated meteorological time series, and population maps. SHEDIS-Temperature provides hazard and exposure data for 2835 subnational locations associated with 382 disaster records from 1979–2018 in 71 countries. Detailed hazard metrics, derived from 0.1° 3 hourly data, encompass absolute indicators, such as the heat stress measure apparent temperature accounting for humidity and wind speed, as well as percentile-based indicators of when and where temperatures exceeded local thresholds. Population exposure data include annual population figures for impacted subnational administrative units and person-days of exposure to threshold-exceeding temperatures. Outputs are available at grid-point level as well as zonally aggregated to administrative subdivision units, and disaster-record levels. Technical validation against a station-based dataset indicated minor systematic biases – slightly overestimated minimum and underestimated maximum temperatures – but confirmed high consistency between datasets, with correlation coefficients ≥0.9≤2 °C. By providing comprehensive attributes across the hazard-exposure spectrum, SHEDIS-Temperature supports interdisciplinary research on past temperature-related disasters, offering valuable insights for future risk mitigation and resilience strategies.

Hourly precipitation fields at 1 km resolution over Belgium from 1940 to 2016 based on the analog technique

2025-11-24T23:10:10+01:00

Hourly precipitation fields at 1 km resolution over Belgium from 1940 to 2016 based on the analog technique
Elke Debrie, Jonathan Demaeyer, and Stéphane Vannitsem
Earth Syst. Sci. Data, 17, 6405–6421, https://doi.org/10.5194/essd-17-6405-2025, 2025

High-resolution gridded precipitation data is scarce, especially at time intervals shorter than daily. However hydrological applications for example benefit from a finer temporal resolution of rainfall information. In this context, we introduce an hourly precipitation dataset for Belgium, featuring a resolution of 1 km. An hourly high-resolution gridded precipitation product over Belgium can provide valuable insights into the dynamics of both short-term and long-term rainfall events, which can be used for wide-ranging applications such as flash flood forecasting and warning systems, studying precipitation extremes and trends, validating weather and climate models or detecting changes in precipitation patterns due to climate change. Similar products such as EURADCLIM (Europe) (Overeem et al., 2023)(Winterrath et al., 2018), both radar-based gauge-adjusted datasets, have already been created and published. Both datasets are high spatial resolution dataset (2 and 1 km, respectively).

A high resolution precipitation grid of hourly precipitation data for Belgium covering the period from 1940 to 2016 using the analog technique, is created. The analogs are sampled from the period 2017–2022 for which high resolution radar data precipitation fields are available. The initial step involves identifying the criteria, i.e. atmospheric parameters such as atmospheric pressure, temperature and humidity, that can be used to determine analogous days. These atmospheric parameters are obtained from the ERA5 observational data provided by the European Centre for Medium-Range Weather Forecasts (ECMWF). In a second step, hourly precipitation data for suitable analog days are extracted from our radar database, and then used to create the high resolution grid of hourly precipitation for Belgium from 1940 to 2016. Data from rain gauges on the Belgian terrain were used for validation of the candidate precipitation analogs.The dataset for this project lists the top 25 analog days for 1940–2016 based on similarities in weather patterns. The analogs are ranked based on how closely they match to their target day.The database is relying on the Zarrmedian field datasethttps://doi.org/10.5281/zenodo.14965710) (Debrie et al., 2025).

A Global Dataset of Forest Disturbance Regimes Derived from Satellite Biomass Observations

2025-11-24T23:10:10+01:00

A Global Dataset of Forest Disturbance Regimes Derived from Satellite Biomass Observations
Siyuan Wang, Hui Yang, Sujan Koirala, Maurizio Santoro, Ulrich Weber, Claire Robin, Felix Cremer, Matthias Forkel, Markus Reichstein, and Nuno Carvalhais
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-670,2025
Preprint under review for ESSD (discussion: open, 0 comments)
Forests play a central role in the global carbon cycle by serving as critical carbon sinks for atmospheric CO₂. Yet, the stability and continued capacity of these sinks are increasingly threatened by a growing number of disturbances. Accurately representing the stochastic nature of disturbance remains a major challenge and a key source of uncertainty in our understanding of carbon cycle dynamics. This study presents a novel framework for deriving disturbance regimes characterized by extent (μ), frequency (α), intensity (β), as well as background mortality (K_b) directly from landscape features of high-resolution satellite biomass data. These regimes reflect the characteristics of long term disturbances at the landscape scale rather than the properties of any single event. Our analysis inverts the forward model framework developed by Wang et al. (2024), which used a machine learning model trained on a massive synthetic dataset of over 8 million forward model simulations to link known disturbance regimes to spatial biomass patterns. Instead of predicting patterns from regimes, we use observed satellite biomass patterns to infer the underlying disturbance regimes. To ensure robustness, we first identified the optimal spatial resolution for aggregating both simulation and satellite data, minimizing discrepancies in feature value ranges and reducing extrapolation risk. Using this framework, we produced the first globally consistent, observationally constrained dataset of forest disturbance regime parameters and their associated uncertainties, provided at both a 25 × 25 km² Copernicus Electronic Production Support Office 2025-11-24T23:10:10+01:00 2025-11-24T23:10:10+01:00

BEACH: Barbados and Eastern Atlantic Combined High-altitude dropsonde datasets

2025-11-24T23:10:10+01:00

BEACH: Barbados and Eastern Atlantic Combined High-altitude dropsonde datasets
Helene Marie Gloeckner, Theresa Mieslinger, Nina Robbins-Blanch, Geet George, Lukas Kluft, Tobias Kölling, Sandrine Bony, Julia Windmiller, and Bjorn Stevens
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-647,2025
Preprint under review for ESSD (discussion: open, 0 comments)
As part of the ORCESTRA field campaign in August and September 2024, 1191 dropsondes were deployed over the Eastern and Western Atlantic ITCZ from the HALO aircraft coordinated by the PERCUSION and MAESTRO subcampaigns. Here, we describe the hierarchy and processing of the resulting Barbados and Eastern Atlantic Combined High-altitude (BEACH) dropsonde datasets. The Level 0 dataset contains measured meteorological variables, such as relative humidity (RH), temperature (T), pressure (p), eastward (u), and northward (v) wind data as output by the AVAPS system. The corresponding ASPEN quality-controlled data is called Level 1. Level 2 adds further measurement-specific quality control flags. Level 3 builds the core of BEACH including all quality controlled dropsonde profiles interpolated to a common 10 m altitude grid and concatenated into a single dataset. We further derive mesoscale vorticity, divergence, and vertical velocities from 87 circular flight patterns in Level 4 using the regression method. These area-averaged variables will guide our understanding of mesoscale processes acting within the ITCZ, one of the main goals of ORCESTRA. All data levels are openly available on IPFS, while the processing code is made public on GitHub.

History of anthropogenic Phosphorus inputs (HaPi) to the terrestrial biosphere from 1860 to 2020

2025-11-24T23:10:10+01:00

History of anthropogenic Phosphorus inputs (HaPi) to the terrestrial biosphere from 1860 to 2020
Zihao Bian, Hao Shi, Rui Li, Fei Lun, Francesco Tubiello, Nathaniel Mueller, Shiyu You, Rong Hao, Jiageng Ma, Longhui Li, Changchun Huang, Bing He, Yuanzhi Yao, and Hanqin Tian
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-507,2025
Preprint under review for ESSD (discussion: open, 0 comments)
Nitrogen and Phosphorus (P) are essential nutrients for sustaining life on Earth and have been increasingly applied in global agriculture to meet the growing demand for food production. Quantifying the spatial and temporal dynamics of nutrient inputs to the terrestrial biosphere is crucial for analyzing nutrient flows in crop-livestock systems, managing nutrient resources sustainably, and mitigating nutrient-related environmental impacts. Here, built upon our previous work mapping global nitrogen inputs (History of anthropogenic Nitrogen inputs, HaNi), this study presents the History of anthropogenic P inputs (HaPi) dataset, a comprehensive quantification of human-driven P fluxes to terrestrial ecosystems. HaPi covers the period from 1860 to 2020 and has a spatial resolution of 5 arc-minutes (about 10 km at the equator) with an annual time-step. This harmonized dataset integrates seven components, including P fertilizer application on croplands and pastures, manure P application on croplands and pastures, manure P deposition on pastures and rangelands, and atmospheric P deposition. The results reveal that the global total P input increased from 3.8 Tg yr^–1in the 1860s to 40.9 Tg yr^–1https://doi.org/10.6084/m9.figshare.29930279.v1 Copernicus Electronic Production Support Office 2025-11-24T23:10:10+01:00 2025-11-24T23:10:10+01:00

DANRA: The Kilometer-Scale Danish Regional Atmospheric Reanalysis

2025-11-24T23:10:10+01:00

DANRA: The Kilometer-Scale Danish Regional Atmospheric Reanalysis
Xiaohua Yang, Carlos Peralta, Bjarne Amstrup, Kasper Stener Hintz, Søren Borg Thorsen, Leif Denby, Simon Kamuk Christiansen, Hauke Schulz, Sebastian Pelt, and Mathias Schreiner
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-610,2025
Preprint under review for ESSD (discussion: open, 0 comments)
The DANish regional atmospheric ReAnalysis (DANRA) is a novel high-resolution (2.5 km) reanalysis dataset covering Denmark and its surrounding regions over a 34-year period (1990–2023). Denmark’s complex coastline, with over 400 islands and an extensive 7,400 km coastline, means that most municipalities experience mixed land-sea variability. This complexity requires a regional climate reanalysis that can resolve fine-scale coastal and inland features, as well as their impact on climate variability. DANRA is based on the HARMONIE-AROME Numerical Weather Prediction (NWP) model and assimilates a comprehensive set of observations, with a particular focus on Denmark. Compared to global reanalyses such as the ECMWF Reanalysis v5 (ERA5), DANRA demonstrates superior performance in representing essential climate variables, including near-surface weather parameters during both extreme and ordinary conditions. We illustrate these improvements in the representation of several extreme weather cases over Denmark, such as the December 1999 hurricane-force storm, the July 2022 national temperature record, and the August 2007 cloudburst in South Jutland. DANRA is made to support climate adaptation, impact modelling, and the training of next-generation data-driven atmospheric forecasting models. DANRA is distributed as Zarr dataset freely accessible from an object store (doi.org/10.5281/zenodo.17294179), maximizing its usability for climate adaptation, impact modelling, and data-driven research.

Tethered balloon-borne measurements to characterise the evolution of the Arctic atmospheric boundary layer at Station Nord

2025-11-24T23:10:10+01:00

Tethered balloon-borne measurements to characterise the evolution of the Arctic atmospheric boundary layer at Station Nord
Henning Dorff, Holger Siebert, Komal Navale, André Ehrlich, Joshua Müller, Michael Schäfer, Fan Wu, and Manfred Wendisch
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2025-651,2025
Preprint under review for ESSD (discussion: open, 0 comments)
We present a comprehensive balloon-borne measurement dataset collected during a dedicated Arctic observation campaign conducted from 19 March to 18 April 2024 in the transition from polar night to polar day at the Villum Research Station (Station Nord, STN, Greenland). The objective of the observations was to characterise the temporal evolution of the Arctic atmospheric boundary layer (ABL), focusing on key transition periods, including cloud development, low-level jet evolution, and day to night shifts. Data were collected by the Balloon-bornE moduLar Utility for profilinG the lower Atmosphere (BELUGA) tethered-balloon system performing in-situ measurements of temperature, humidity, wind speed, turbulence, and thermal infrared irradiance from the surface to several hundred meters altitude, with frequent profiling in high vertical resolution. Twenty-eight research flights delivered more than 300 profiles, with up to 8 profiles per hour, complemented by daily radiosonde launches. This paper specifies the BELUGA instrumentation at STN, data processing procedures, and the publicly available Level-2 data (BELUGA and radiosonde), provided in instrument-separated data subsets listed in a data collection (https://doi.pangaea.de/10.1594/PANGAEA.986431). One major application of the data is to evaluate different model types (such as numerical weather prediction, single-column, large-eddy simulations) in representing processes controlling the Arctic ABL. To prepare such evaluations, we give an overview of the observations, environmental conditions during the campaign, and highlight specific events that are valuable for model comparison. We introduce an event in which temperature rates influence the ABL inversion, radiative heating-rate profiles associated with transitions between cloudy and cloud-free conditions, and an observed Arctic low-level jet compared with reanalysis, offering insights into the Arctic ABL evolution.

The Greenland Ice-Marginal Lake Inventory Series from 2016 to 2023

2025-11-21T23:10:10+01:00

The Greenland Ice-Marginal Lake Inventory Series from 2016 to 2023
Penelope How, Dorthe Petersen, Kristian K. Kjeldsen, Katrine Raundrup, Nanna B. Karlsson, Alexandra Messerli, Anja Rutishauser, Jonathan L. Carrivick, James M. Lea, Robert S. Fausto, Andreas P. Ahlstrøm, and Signe B. Andersen
Earth Syst. Sci. Data, 17, 6331–6351, https://doi.org/10.5194/essd-17-6331-2025, 2025

Ice-marginal lakes form at the edge of the Greenland Ice Sheet and its surrounding peripheral glaciers and ice caps (PGIC), where outflowing glacial meltwater is trapped by a moraine, or by the ice itself, and create a reservoir that is in contact with the adjacent ice. While glacial meltwater is typically assumed to flow directly into the ocean, ice-marginal lakes temporarily store a portion of this runoff, influencing glacier dynamics and ablation, ecosystems, and downstream hydrology. Their presence, and change in abundance and size, remain under-represented in projections of sea level change and glacier mass loss. Here, we present an eight-year (2016–2023) inventory of 2918 automatically classified ice-marginal lakes (≧0.05 km²) across Greenland, tracking changes in lake abundance, surface extent, and summer surface temperature over time. Fluctuations in lake abundance were most pronounced at the north (22 %) and northeast (14 %) PGIC margins and the southwest Ice Sheet margin (8 %). Over the study period, an increase in surface lake area was evident at 283 lakes, a decreasing trend was evident at 240 lakes, and 1373 remained stable (±0.05 km²). The northeast region contained the largest lakes, with a median size of 0.40 km²km²°C°C−809±5 %. Surface temperature estimates showed strong agreement with in situ measurements (r²=0.87, RMSE =1.68 °C, error ±1.2 °C). This dataset provides a crucial foundation for quantifying meltwater storage at ice margins and refining sea level contribution projections while supporting research on glacier-lake interactions, Arctic ecology, and environmental management. The inventory series is openly accessible on the GEUS Dataverse (https://doi.org/10.22008/FK2/MBKW9N, How et al., 2025) with full metadata and documentation, and a reproducible processing workflow.