--- title: "Building a Dynamic TOPMODEL" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{Building a Dynamic TOPMODEL} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) ``` The purpose of this vignette is to provide an outline of the steps needed to build a dynamic TOPMODEL implementation using the dynatopGIS package. # Implementation notes The dynatopGIS package implements a structured, object orientated, data flow. The steps outlined below create a `dynatopGIS` catchment object to which actions are then applied to generate a model. The `dynatopGIS` package is written using the object orientated framework provided by the `R6` package. This means that some aspects of working with the objects may appear idiosyncratic for some R users. In using the package as outlined in this vignette these problems are largely obscured, except for the call structure. However, before adapting the code, or doing more complex analysis users should read about `R6` class objects (e.g. in the `R6` package vignettes or in the Advanced R book). One particular gotcha is when copying an object. Using ```{r eval=FALSE} my_new_object <- my_object ``` creates a pointer, that is altering `my_new_object` also alters `my_object`. To create a new independent copy of `my_object` use ```{r eval=FALSE} my_new_object <- my_object$clone() ``` # Getting started The dynatopGIS packages works through a number of steps to generate a Dynamic TOPMODEL object suitable for use in with the `dynatop` package. Each step generates one or more layers which are saved as raster or shape files into the projects working directory (which is not necessarily the R working directory). A record of these layers is kept in the json format meta data file. This vignette demonstrates the use of the `dynatopGIS` package using data from the Swindale catchment in the UK. To start first load the library ```{r load_library} library("dynatopGIS") ``` For this vignette we will store the data into a temporary directory ```{r tempory_dir} demo_dir <- tempfile("dygis") dir.create(demo_dir) ``` and initialise the analysis by creating a new object specifying the location of the meta data file, which will be created if it doesn't exist. ```{r, initialisation} ctch <- dynatopGIS$new(file.path(demo_dir)) ``` # Adding catchment data The basis of the analysis is a rasterised Digital Elevation Model (DEM) of the catchment and a vectorised representation of the river network with attributes. Currently these can be in any format supported by the ```terra``` library. However, within the calculations used for sink filling, flow routing and topographic index calculations the raster DEM is presumed to be projected so that is has square cells such that the difference between the cell centres (in meters) does not alter. For Swindale the suitable DEM and channel files can be found using: ```{r, data_files} dem_file <- system.file("extdata", "SwindaleDTM40m.tif", package="dynatopGIS", mustWork = TRUE) channel_file <- system.file("extdata", "SwindaleRiverNetwork.shp", package="dynatopGIS", mustWork = TRUE) ``` Before adding either the DEM or channel a raster map of the catchment outline must be provided. This defines not only the catchment boundaries but also, if required, subcatchments, each of which must be given a unique number. The projection and resolution of this map is used in all subsequent GIS processing In this example the catchment map is generated from the DEM, which, by convention must contain a edge rows and columns containing only `NA` values. ```{r, add_catchment} dem <- terra::rast(dem_file) dem <- terra::extend(dem,1) ## pad with NA values catchment_outline <- terra::ifel(is.finite(dem),1,NA) ctch$add_catchment(catchment_outline) ``` Either the DEM or channel files can be added to the project first. In this case we add the DEM with ```{r, add_dem} ctch$add_dem(dem) ``` Adding river channel data is more complex. The `add_channel` method requires a `SpatVector` object (or a file name that can be loaded as a `SpatVect` object). Each vector object is treated as a length of river channel which requires the following properties - *name* - a label for the channel length - *endNode* - a label for the downstream end of the river length - *startNode* - a label for the upstream end of the river length - *length* - the length in meters - *area* - surface area in square meters - *width* - width of the channel - *slope* - bed slope of the channel Additional properties are currently kept but ignored with the exception of *id* which is overwritten. Since it is possible that these properties are present in a data file under different names some basic preprocessing may be required. The `convert_channel` function is designed to help with this. To illustrate this let us examine the river network for Swindale ```{r, channel_current} sp_lines <- terra::vect(channel_file) head(sp_lines) ``` Some of the main properties are present under appropriate names (startNode, endNode, length) but the remainder are missing. Also the river network is defined as a series of lines, rather then polygons. The `convert_channel` function addresses these shortcomings by - changing the names of the required properties - buffering the line objects to create polygons The `convert_channel` function takes a named vector giving the variable names to be use for the properties. If we want to carry over the identifier as the name we could call `convert_channel` with as follows: ```{r, channel_properties} property_names <- c(name="name1", endNode="endNode", startNode="startNode", length="length") chn <- convert_channel(sp_lines,property_names) ``` Since the data set for Swindale does not contain a channel width or slope default values are used and warnings issued. The river network can then be added to the project by ```{r, add_channel} ctch$add_channel(chn) ``` # Getting and plotting catchment information The `dynatopGIS` class has methods for returning and plotting the GIS data in the project. The names of all the different GIS layers stored is returned by ```{r, list_layers} ctch$get_layer() ``` These can be plotted (with or without the channel), for example ```{r, plot} ctch$plot_layer("dem", add_channel=TRUE) ``` or returned, for example ```{r, get_layer} ctch$get_layer("dem") ``` All layers are returned as `SpatRast` objects with the exception of the `channel_vect` layer which is returned as a `SpatVect` object. # Filling sinks For the hill slope to be connected to the river network all DEM cells must drain to those that intersect with the river network. The algorithm implemented in the `sink_fill` method ensures this is the case. In calling the `sink_fill` method a flow direction algorithm is specified and the resulting flow paths recorded. If subcatchments are present in the catchment map then only flow paths within the subcatchment are considered. The algorithm of used by the `sink_fill` method is iterative and the execution time of the function is limited by capping the maximum number of iterations. If this limit is reached without completion the method can call again with the "hot start" option to continue from where it finished. For Swindale, where the example DEM is already partially filled the algorithm only alters a small area near the foot of the catchment. ```{r, sink_fill} ctch$sink_fill() terra::plot( ctch$get_layer('filled_dem') - ctch$get_layer('dem'), main="Changes to height") ``` ## Determining Ordering The computational scheme in the `dynatop` package works with an ordered sequence of HRUs constructed such that the sequence moves downslope to catchment outlet. This is achieved by banding the channel reaches and hillslope cells such that the catchment outlet(s) are in band 1, those cells or reaches draining only into band 1 are in band 2 and so forth. banding is achieved by the following call ```{r, band} ctch$compute_band() ctch$plot_layer("band") ``` # Computing properties Two sets of properties are required for Dynamic TOPMODEL. The first set is those required within the evaluation of the model; gradient and contour length. The second set are those used for dividing the catchment up into Hydrological Response Units (HRUs). Traditionally the summary used for the separation of the HRUs is the topographic index, which is the natural logarithm of the upslope area divided by gradient. These are computed using the formulae in [Quinn et al. 1991](https://doi.org/10.1002/hyp.3360050106). The upstream area is computed by routing down slope with the fraction of the area being routed to the next downstream pixel being proportional to the gradient times the contour length. The local value of the gradient is computed using the average of a subset of between pixel gradients. For a normal 'hill slope' cell these are the gradients to downslope pixels weighted by contour length. In the case of pixels which contain river channels the average of the gradients from upslope pixels weighted by contour length us used. These properties are computed in an algorithm that passes over the data once in descending height. It is called as follows ```{r, calc_atb} ctch$compute_properties() ``` The plot of the topographic index shows a pattern of increasing values closer to the river channels ```{r, plot_atb} ## plot of topographic index (log(a/tan b)) ctch$plot_layer('atb') ``` Although not used in ordering the HRUs `dynatopGIS` also provides the ability to compute flow distances for the hill slope cells. The calculation of three distances is supported - *shortest flow length* - the shortest length based on the pixel flow paths to a channel - *Dominant flow length* - the distance to a channel moving in the dominant (largest fraction) flow direction from any grid cell - *Expected flow length* - the distance to the channel based on a weighted average of the down-slope flow lengths. Weights are given by the fraction of flow in each direction. The computation, in this example for the shortest flow length, is initiated with ```{r, flow_length} ctch$compute_flow_lengths(flow_routing="shortest") ``` The additional layers can be examined as expected ```{r, flow_length_plot} ctch$get_layer() ctch$plot_layer("shortest_flow_length") ``` ## Adding additional layer Properties may come in additional GIS layers. To demonstrate the addition of an additional layer we will extract the filled dem ```{r ,extract_filled} tmp <- ctch$get_layer("filled_dem") ``` then separate it into a layers representing land above and below 500m. ```{r height layer} ## T tmp <- terra::ifel(tmp<=500,NA,-999) ``` The resulting raster object can now be added to the project with ```{r, add_height_layer} ctch$add_layer(tmp, "greater_500") ctch$get_layer() ``` # Classifying into Hydrological Response Units Methods are provided for the classification of the catchment areas of similar hydrological response. The classifications generated in this process are augmented with a further distance based separation when generating a `dynatop` model (see following section). By definition each channel length is treated as belonging to a single class. To classify the hillslope two methods can be used. The `classify` method of a `dynatopGIS` allows a landscape property to be *cut* into classes. For example to cut the topographic index for Swindale into 21 classes: ```{r, atb_split} ctch$classify("atb_20","atb",cuts=20) ctch$plot_layer("atb_20") ``` Providing a single value to the cuts argument determines the number of classes. The values used to cut the variable can be extracted from the meta data with ```{r,atb_splt_get_class} ctch$get_method("atb_20") ``` The `combine_classes` method of a `dynatopGIS` allows classes to be combined in two ways, which are applied in the order shown - *pairing* - where unique combinations of classes create one new class - *burning* - where a single class is imposed upon an area To demonstrate a pairing combination consider combining the atb classes generated above with the classification provided by the distance band ```{r, atb_20_band} ctch$combine_classes("atb_20_band",c("atb_20","band")) ctch$plot_layer("atb_20_band") ``` Additionally the land greater then 500 in altitude can be burnt in with ```{r, atb_20_band_burn} ctch$combine_classes("atb_20_band_500",pairs=c("atb_20","band"),burns="greater_500") ctch$plot_layer("atb_20_band_500") ``` The each class in the combined classification the values of the classes used in the computations can be returned ```{r see_class} head( ctch$get_method("atb_20_band_500")$groups ) ``` Note that by giving the area to be burnt in a negative value when it was generated above we have ensured that the values do not clash with those generated by the cuts which (except potentially when a cut is NA) which will always be positive. # Generating a dynamic TOPMODEL A Dynamic TOPMODEL suitable for use with the ```dynatop``` package can be generated using the `create_model` method. This uses an existing classification to generate the model. The required model structure is given in the vignettes of `dynatop` package and is not described here in details. Since `dynatop` simulations make use of ordered HRUs to work downslope, a classification which used a distance layer (see earlier section) which represents the ordered downslope sequencing of the pixels is recommended. Even if a distance layer is not used in the classification one must be given to the `create_model` method, so the resulting HRUs can be ordered. **Currently only the 'band' distance metric as used below will produce valid model**. For example, in the case of the division of Swindale by topographic index into 21 classes and the bands directly the resulting model can be generated by ```{r, model_atb_split} ctch$create_model(file.path(demo_dir,"new_model"),"atb_20") ``` Looking at the files within the `demo_dir` folder ```{r, model files} list.files(demo_dir,pattern="new_model*") ``` shows that an addition raster map of the HRUs has been created in `new_model.tif` along with a file `new_model.rds` containing a model suitable for `dynatop`. The values on the map correspond to the `ìd` of the HRUs in the `dynatop` model.