Configuration File#
This page describes the settings available in the configuration file for HySetter. The configuration file should be in YAML format. You can download an example configuration file here.
Project (project)#
The project section is required and contains basic information about your project.
name(required): The name of your project. Data will be stored indata_dir/name.data_dir(required): The directory where data will be stored.
Example:
Area of Interest (aoi)#
The aoi section is required and defines the area of interest for your project. You should specify only one of the following options:
huc_ids: A list of Hydrologic Unit Code (HUC) IDs. It can be a mix of different HUC levels.nhdv2_ids: A list of NHDPlusV2 catchment IDs (featureid).gagesii_basins: A list of GAGES-II basin IDs.mainstem_main: NHDPlusV2 catchments belonging to the main flowlines upstream of the provided mainstem ID.mainstem_tributaries: NHDPlusV2 catchments belonging to the tributaries upstream of the provided mainstem ID.geometry_file: Path to a file containing geometry data. Can be parquet, feather, or any format thatgeopandas.read_fileaccepts.
Additional AOI options:
nhdv2_flowlines(optional): Boolean. Iftrue, retrieve the NHDPlusV2 flowlines within the AOI.streamcat_attrs(optional): A list of valid StreamCat attributes to get for flowlines within the AOI.nldi_attrs(optional): A list of valid NLDI attributes to get for flowlines within the AOI.
Example:
Forcing Data (forcing)#
The forcing section is optional and defines settings for retrieving forcing data.
source: The data source. Options aredaymet,gridmet, ornldas2.start_date: The start date for the data retrieval (YYYY-MM-DD format).end_date: The end date for the data retrieval (YYYY-MM-DD format).variables: A list of variables to retrieve. Valid variable names depend on the chosen source.
Example:
Topography (topo)#
The topo section is optional and defines settings for topographic data retrieval and processing.
resolution_m: The desired resolution in meters. Use10,30, or60for faster retrieval from 3DEP's static files.derived_variables: A list of derived variables to compute. Options areslope,aspect, andcurvature.
Example:
Soil Data (soil)#
The soil section is optional and defines settings for soil data retrieval.
source: The data source. Options aresoilgridsorgnatsgo.variables: A list of variables to retrieve. Valid options depend on the chosen source.
Example:
National Land Cover Database (nlcd)#
The nlcd section is optional and defines settings for retrieving NLCD data.
cover: A list of years for land cover data.impervious: A list of years for impervious surface data.canopy: A list of years for canopy data.descriptor: A list of years for descriptor data.
Valid years:
- Cover, Impervious, Descriptor: 2021, 2019, 2016, 2013, 2011, 2008, 2006, 2004, 2001
- Canopy: Any year between 2011–2022 (inclusive)
Example:
National Inventory of Dams (nid)#
The nid section is optional and defines settings for retrieving NID data.
within_aoi: Boolean. Iftrue, only return dams within the defined AOIs. Iffalseor omitted, store the full NID database.
Example:
Streamflow Data (streamflow)#
The streamflow section is optional and defines settings for retrieving streamflow data.
start_date: The start date for data retrieval (YYYY-MM-DD format).end_date: The end date for data retrieval (YYYY-MM-DD format).frequency: The data frequency. Options aredailyorinstantaneous.within_aoi: Boolean. Iftrue, get streamflow for all stations within the AOIs.use_col: A column name from the AOIs GeoDataFrame to use as the station IDs to query NWIS for streamflow. When provided,within_aoiis ignored.
Example:
This configuration file allows you to customize various aspects of data retrieval and processing for your project. Adjust the settings as needed to suit your specific requirements.
Remote Raster Data (remote_raster)#
The remote_raster section is optional and defines settings for retrieving remote raster data. Any number of name/URL pairs can be specified. The URL should point to a raster file or a VRT file that references multiple raster files. The data will be downloaded and stored in the data_dir directory. Note that the name will be sanitized (strip, lower, replace space with _) since it is used as filenames (e.g., data_dir/twi/twi_geom_1.tif). Similar to other gridded data, there are two additional options that can be specified:
crop: Boolean. Whether to crop the data to the geometry of the AOI. Default istrue.geometry_buffer: Buffer distance in meters to add to the geometry of the AOI before requesting the data. Default is0.
Example: