Author: Benjamin Stocker

Workflow

  1. analysis/make_tidy_cmip6.R: Make original global files tidy. Save as nested data frames, where each row is one grid cell and time series is nested in the column data. Separate files are written for each longitudinal band. Writes files: evspsbl/evspsbl_mon_CESM2_ssp585_r1i1p1f1_native_LON_[+-XXX.XXX].rds into folder defined as outdir.

  2. analysis/apply_cwd_global.R: Script for parallel applying the CWD algorithm (or anything else that operates on full time series) separately for on gridcell. Distributes by longitudinal band. Reading files written in previous step and writing files (in the example): cwd/CWD_result_LON_[+-XXX.XXX].rds into folder defined as outdir.

    src/apply_cwd_global.sh: Bash script that calls apply_cwd_global.R , to be used as an alternative and containing submission statement for HPC.

    Note: This step creates data at the original temporal resolution. Data is not collected at this stage to avoid memory limitation.

  3. analysis/get_cwd_annmax.R: Script for parallel applying function for determining annual maximum. Reading files written in previous step and writing files (in the example):

    cwd_annmax/CWD_result_LON_[+-XXX.XXX]_ANNMAX.rds

  4. collect_cwd_annmax.R: Script for collecting annual time series of each gridcell and writing the global annual data into a NetCDF file. Since annual maximum is much smaller data and can now be handled by reading all into memory. This uses the function write_nc2() from the package {rgeco}. Install it from here. Writes single file containing global data with annual resolution as NetCDF:

    /data_2/scratch/fbernhard/CMIP6ng_CESM2_ssp585/cmip6-ng/tidy/cwd_annmax_global.nc

Writes file containing global data with annual resolution: Note: Adjust paths and file name prefixes for your own case in scripts (located in subdirectory analysis/)