8 Exercises

8.1 Phenology trends and algorithms

These exercises cover all materials up to Chapter 5. A proper understanding of these chapters is required to complete these exercises. Exercises are at times formulated in long form, not simple bullet points, in order to partially mimic formal descriptions as you would find in a methods section of an academic journal, or a reference manual.

8.1.1 Physical geography and phenology

Interpret the results of Section 4.3 and the fit model as shown in the collapsed note at the end of the section.

What does the intercept indicate?
How can you interpret the slope?
Convert the established relationship with altitude, to one with temperature
- How would you go about this?

8.1.2 Temporal and spatial anomalies

For a location near the Adirondacks in the North-Eastern United States (Figure 8.1) gather phenology data on both the greenup and maximum canopy development of a location centered on 43.5\(^\circ\)N and 74.5\(^\circ\)W. Gather data for all pixels 100 km around this location for years 2001 to 2010. Similarly, download land cover data for the year 2010 for the same spatial extent, and only consider IGBP broadleaf and mixed forest classes in your analysis.

For the years 2001 - 2009 calculate the long term mean (LTM) and standard deviation (SD) of the phenology metrics. Calculate location with an early greenup for 2010 (< LTM - 1 SD) and locations with late maturity (> LTM + 1 SD).

Describe the observed patterns and speculate about the underlying reasons. In addition, download a digital elevation map for the United States (30s resolution), and compare differences in altitude (e.g. a boxplot) across locations where you do or do not see any patterns in phenology.

Figure 8.1: The location of the Adirondack mountains in the North-Eastern United States

Note

Use details in Chapter 2, Chapter 3 and Chapter 4 to answer these questions, by considering all data products and methods mentioned. Additional meta-data will need to be consulted online to provide context or data conversion instructions in some cases. Where necessary consult the relevant scientific literature.

8.1.3 Scaling the calculation of phenology metrics

In this exercise you will be required to download external data manually. First you will have to sign up for a NASA EarthData login to access the required data. The NASA EarthData login provides access to a wide range of data products.

Once signed in, download the data MCD13C1 product for the year 2022. The MCD13C1 product provides vegetation indices (VI), such as the Enhanced Vegetation Index (EVI) and Normalized Difference Vegetation Index (NDVI) data products on a down-sampled climate model grid (CMG, at 0.05\(^\circ\) or ~5km resolution). This down-sampled data product reduces the volume of data to download and process, but should allow you to explore broad continental scale patterns when calculating vegetation (or land surface) phenology metrics.

With all data downloaded:

Combine all EVI data (23 layers) into a single compressed geotiff file (write to file)
Read in the geotiff file to work faster from memory
For a first trial crop the full dataset to 26\(^\circ\)W, 20\(^\circ\)E, 31\(^\circ\)N, 70\(^\circ\)N
Apply the algorithm as outlined in Section 5.4.1 using a start of season greenup signal of 25% the seasonal EVI amplitude
Assess the performance of the algorithm across the globe and discuss its consistency.
- Where does it fail? How does it fail?
- Where necessary, inspect point locations to explore potential issues.
- If possible, address any issues by altering the original algorithm.
Could you scale this globally?
- How long would it take?
- Can you improve calculation times?

Plot the global phenology maps, and its various iterations in the R markdown notebook.

Note

All information for this exercise can be found in Chapter 3 and Chapter 5. As before, additional meta-data will need to be consulted online to provide context or data conversion instructions in some cases.

8.2 Phenology modelling

How can you improve the model used to regionally scale the results in Chapter 6?
- Provide at least three (3) ways to improve the model used.
Implement at least one of these methods
Statistically compare the results with the MODIS MCD12Q2 phenology product
- compare the data spatially
- describe why you might or might not see the same patterns
- consider that 2010 was a ‘special’ year for the north east of the US

8.3 Land-Use and Land-Cover modelling

How can you improve the model used to map LULC?
- Provide at least four (4) ways to improve the model used.
Implement at least two of these methods yourself
Demonstrate improved model skill by submitting your model results to our internal leaderboard
- the leaderboard requires you to submit a CSV file with labels
  - training data can be downloaded here
  - unlabelled out-of-sample test data can be downloaded here
  - submissions should be made as a pull request to the AGDS2 course repository
  - your CSV file with labels should be stored in data/leaderboard/fall_2023
  - the file should be named xyz_results.csv (replace xyz with your github handle)

Note

Use the appropriate APIs to download the required data.