Landscape processes are critical components of soil formation and play important roles in determining soil properties and spatial structure in landscapes. We propose a new approach using stepwise principal component regression to predict soil redistribution and soil organic carbon across various spatial scales.
Landscape topography is a critical factor affecting soil formation and plays an important role in determining soil properties on the earth surface, as it regulates the gravity-driven soil movement induced by runoff and tillage activities. The recent application of Light Detection and Ranging (LiDAR) data holds promise for generating high spatial resolution topographic metrics that can be used to investigate soil property variability. In this study, fifteen topographic metrics derived from LiDAR data were used to investigate topographic impacts on redistribution of soil and spatial distribution of soil organic carbon (SOC). Specifically, we explored the use of topographic principal components (TPCs) for characterizing topography metrics and stepwise principal component regression (SPCR) to develop topography-based soil erosion and SOC models at site and watershed scales. Performance of SPCR models was evaluated against stepwise ordinary least square regression (SOLSR) models. Results showed that SPCR models outperformed SOLSR models in predicting soil redistribution rates and SOC density at different spatial scales. Use of TPCs removes potential collinearity between individual input variables, and dimensionality reduction by principal component analysis (PCA) diminishes the risk of overfitting the prediction models. This study proposes a new approach for modeling soil redistribution across various spatial scales. For one application, access to private lands is often limited, and the need to extrapolate findings from representative study sites to larger settings that include private lands can be important.
Soil redistribution (erosion and deposition) exerts significant impacts on soil organic carbon (SOC) stocks and dynamics. Increasing efforts have been devoted to investigating how SOC is detached, transported, and deposited over the landscape1,2,3. Carbon (C) sequestration and SOC distribution are influenced by gravity-driven soil movement induced by water erosion4,5,6. In cultivated fields, soil translocation by tillage is another important process contributing to C redistribution7,8,9. Tillage erosion causes a considerable net downslope movement of soil particles and leads to a within-field soil variation10. Both water and tillage erosion are significantly affected by landscape topography, which determines the locations of erosional and depositional sites11. Therefore, effective soil erosion regulation and C dynamic investigation in agricultural lands calls for a better understanding of topographic controls on soil erosion and movements.
Several studies have investigated the impacts of topography on soil redistribution and associated SOC dynamics9,12,13,14,15,16,17. Van der Perk et al.12 reported that topographic factors explained 43% of variability in soil redistribution. Rezaei and Gilkes13 found higher SOC in soils on a shady aspect, due to lower temperatures and less evaporation when compared to other aspects in rangelands. Topography may have more significant impacts on soil redistribution in agricultural lands with traditional tillage treatment than those with minimum tillage, due to the interactions between landforms and tillage practices9. However, these findings were primarily derived from field observations, which present difficulties in investigating soil properties at a broader spatial scale. There is a pressing need to develop new strategies to effectively understand spatial patterns of soil properties at watershed and regional scales.
The objective of this study is to develop efficient models to simulate soil redistribution and SOC distribution. Topography-based models using topographic metrics as predictors have been developed to quantify soil erosion and deposition processes. Compared with empirical- or process-based erosion models that employed discrete field samplings to simulate soil erosion18,19, topography-based models could be developed based on topographic information derived from digital elevation models (DEMs) with high resolutions. This approach allows for continuous soil property simulations at the watershed or regional scale. In the past several decades, accuracy of topographic information has substantially improved, with increasing availability of high resolution remotely sensed data. Although previous studies have employed topography-based models to simulate soil properties12,20,21,22, most of these investigations used a single topographic metric or single category of topographic metrics (local, non-local, or combined topographic metrics), which may not have sufficiently explored topographic impacts on soil microbial activity. Therefore, to gain a better understanding of topography controls on soil erosion and C dynamics, we examined a comprehensive set of topographic metrics including local, non-local, and combined topographic metrics and developed multi-variable topography-based models to simulate soil property dynamics. Applications of these models are expected to provide scientific support for better soil erosion control and agricultural land management.
Topographic metrics are generally categorized into one of three categories: a) local topographic metrics, b) non-local topographic metrics, or c) combined topographic metrics. Local topographic metrics refer to local features of one point on the land surface. Non-local topographic metrics refer to the relative locations of selected points. Combined topographic metrics integrate local and non-local topographic metrics. A set of topographic metrics affecting soil erosion and deposition were used in this study to investigate the topographic controls on soil movement and C stocks (Table 1). Specifically, we used four local topographic metrics [slope, profile curvature (P_Cur), plan curvature (Pl_Cur), general curvature (G_Cur)], seven non-local topographic metrics [flow accumulation (FA), topographic relief, positive openness (POP), upslope slope (UpSl), flow path length (FPL), downslope index (DI), catchment area (CA)], and three combined topographic metrics [topographic wetness index (TWI), stream power index (SPI), and slope length factor (LS)].
1. Topographic Analyses
2. Field Data Collection
3. Topography-Based Model Development
We used the Walnut Creek Watershed (WCW) as a testbed to assess feasibility of topography-based models in investigating soil redistribution and SOC dynamics. The watershed is in Boone and Story counties within the state of Iowa (41°55'-42°00'N; 93°32'-93°45'W) with an area of 5,130 ha (Figure 2). Croplands is the dominant land use type in the WCW, with a relatively flat terrain (mean 90 m, topographic relief 2.29 m). Chisel plowing, disking, and harrowing operations are the principal tillage practices in the crop fields26,27; however, tillage directions vary due to differences in management practices.
Four hundred and sixty crop field locations were randomly selected to derive topographic information in the WCW (Figure 2). 100 out of the 460 locations, including two 300 m transects (each have 9 sampling locations), were selected to conduct field samplings and for analysis of SOC and soil redistribution levels. In addition, two small-scale field sites with topographic landscape, soil types, and tillage practices similar to the WCW were selected for more intensive samplings. At each small-scale field site, a 25 × 25 m grid cell was created, and 230 sampling locations were located at grid nodes (Figure 3). Topographic metrics and soil property information were calculated for the 230 locations.
The topographic metrics in the WCW were generated following the above protocol. The WCW is characterized with low-to-moderate topography (elevation ranging from 260 to 325 m) with a relative low slope (ranging from 0 to 0.11 radian), upslope slope (0 to 0.09 m), and moderate curvatures (profile curvature: -0.009 to 0.009 m-1, plan curvature: -0.85 to 0.85 m-1, general curvature: -0.02 to 0.02 m-1). The vertical elevations of DEMs were enlarged 100 times to increase the distinguishability of the relatively low field-scale relief found in the WCW for creating the positive openness metrics (POP100). After conversion, the range of positive openness increased from 0.08 radians (POP: 1.51-1.59 radians) to 0.86 radians (POP100: 0.36-1.22 radians).
For the topographic relief, we generated seven relief maps with following radiuses: 7.5 m, 15 m, 30 m, 45 m, 60 m, 75 m, and 90 m. Two relief principal components were selected based on results of PCA on the seven relief variables. The first showed coarse resolution relief variation with relief45m as the main variable. We defined this component as the large-scale relief (LsRe). The second component, which was highly correlated with relief7.5m and presented fine resolution relief variation, was defined as the small-scale relief (SsRe).
Results of correlation analyses between topographic metrics and SOC density/soil redistribution are presented in Table 2. The TWI and LsRe showed the highest correlations with SOC density and soil redistribution rates, respectively. Spatial patterns of the two metrics are presented in Figure 4. Details of the TWI and LsRe can be better observed from the transect area. Both metrics showed high values in depressional area and low values in sloping and ridge areas. However, differences between the two metrics occurred in ditch areas, where the TWI exhibited extremely high values but the values of LsRe were not different from adjacent areas.
After generating the fifteen topographic metrics, we used PCA on these topographic variables over the 460 sampling sites in the WCW. The first seven topographic principal components (TPCs) that explained more than 90% variability of the whole topographic dataset were selected. Five TPCs that were final selected to build topography-based models are listed in Table 3. For the first principal component (TPC1), G_Cur showed the highest loading. Slope, TWI, Upsl, and LS_FB were the most important metrics in TPC2, with loadings larger than 0.35. In the TPC3, FA, SPI, and CA were important metrics, with loadings of 0.482, 0.460, and 0.400, respectively. FPL (-0.703) and Pl_Cur (0.485) were the most important in the TPC6. The main metrics with high loadings in the TPC7 were SsRe (0.597), DI (0.435), FPL (0.407), and Pl_Cur (0.383).
Collinearity of topographic variable was checked by examining VIF. Of the 15 metrics, slope, TWI, and G_Cur were removed due to the high VIFs. Based on soil redistribution rates and carbon density data from sites 1 and 2, SOLSR models were developed using all 15 metrics (SOLSRf) and the 12 metrics with collinear covariate removed (SOLSRr) (Table 4). Generally, over 70% and 65% of variability in SOC density and soil redistribution rates were explained by the SOLSRf models, respectively. For the models with collinear covariate removed (SOLSRr), simulation efficiencies were slightly lower than SOLSRf models (68% for SOC density and 63% for soil redistribution). NSEs were slightly lower and RSR were slightly higher in SOLSRr models than in SOLSRf models.
For SPCR models, similar simulation efficiencies as SOLSRr are observed in Table 4. However, fewer independent variables were selected in SPCR models (less than 5 variables) than the SOLSRf and SOLSRr models (more than 6 variables). TPCs 1, 2, 3, and 7 were selected as the independent variable combinations for the SOC model and TPCs 1, 2, 3, 6, and 7 were selected as the combination for the soil redistribution model.
We found that the SPCR models had the best predictions and the SOLSRr models showed the poorest performances at the watershed scale. The coefficients of determination (r2) by comparing SOC density prediction to observation increased from: 1) 0.60 in SOLSRf and 0.52 in SOLSRr to 0.66 in SPCR, and 2) NSE increased from 0.21 in SOLSRf and 0.16 in SOLSRr to 0.59 in SPCR; while RSR reduced from 0.87 in SOLSRf and 0.91 in SOLSRr to 0.64 in SPCR. Soil redistribution rate prediction in SPCR accounted for 36% of the variability in the measured variable and was higher than the predictions by SOLSRf (34%) and SOLSRr (0.35%). A higher NSE and lower RSR in SPCR (NSE = 0.33, RSR = 0.82) compared to SOLSRf (NSE = 0.31, RSR = 0.83) and SOLSRr (NSE = 0.32, RSR = 0.82) also demonstrated a better performance in soil redistribution rate simulation by SPCR.
According to the model performance evaluations, SPCR models were selected to generate SOC density and soil redistribution rate maps at the watershed scale. The maps revealed consistent patterns between model simulations and field measurements (Figure 5). The high consistencies between simulations and observations were more evident along the transects. Both SOC density and soil redistribution rates showed high correlations with landscape topography. High values of SOC density can be found in footslope and depositional areas, where soil deposition occurred, while low values of SOC density were observed in sloping areas, where soil erosion took place.
Figure 1: The Slope, Aspect, Curvature module in the System for Automated Geoscientific Analysis (SAGA). The polygons show the locations of study areas. Please click here to view a larger version of this figure.
Figure 2: Location of Walnut Creek Watershed and sampling sites in the watershed (Iowa). This figure was adapted from previous work17. Please click here to view a larger version of this figure.
Figure 3: Location of sampled sites a) 1 and b) 2 (z-axis 15x elevation). This figure was adapted from previous work17. Please click here to view a larger version of this figure.
Figure 4: Topographic metric maps. (a) Topographic wetness index (TWI) and (b) large-scale topographic relief (LsRe) in the Walnut Creek Watershed and transect area (z-axis 15 x elevation). Please click here to view a larger version of this figure.
Figure 5: Soil redistribution rate (t ha-1 year-1) maps and SOC density (kg m-2) maps. Shown are soil redistribution maps (a) within the Walnut Creek Watershed and (b) along two transects. Shown are SOC density (kg m-2) maps (c) within the Walnut Creek Watershed and (d) along two transects using the stepwise principal component analysis models (z-axis 15x elevation). Please click here to view a larger version of this figure.
Variables | Significance |
Slope (radian) | Runoff velocity, soil water content28,29 |
Profile Curvature (m-1) | Flow acceleration, soil erosion, deposition rate11,30 |
Plan Curvature (m-1) | Flow convergence and divergence, soil water content30 |
General Curvature (m-1) | Runoff velocity , soil erosion, deposition29 |
Flow accumulation | Soil water content, runoff volume20 |
Topographic Relief (m) | Landscape drainage characteristics, runoff velocity and acceleration21,31 |
Positive Openness (radian) | Landscape drainage characteristics , soil water content32 |
Upslope Slope (m) | Runoff velocity33,34 |
Flow Path Length (m) | Sediment yield, erosion rate35 |
Downslope Index (radian) | Soil water content36 |
Catchment Area (m2) | Runoff velocity and volume33,37 |
Topographic Wetness Index | Soil moisture distribution28,38,39 |
Stream Power Index | Soil erosion, Convergence of flow40 |
Slope Length Factor | Flow convergence and divergence28,40 |
Table 1: Significances of selected topographic metrics.
Slope | P_Cur | Pl_Cur | G_Cur | FA | LsRe | SsRe | POP | Upsl | FPL | DI | CA | TWI | SPI | LS_FB | ||
(radian) | (m-1) | (m-1) | (m-1) | (m) | (m) | (radian) | (m) | (m) | (°) | (m2) | ||||||
SOC | -0.687 | -0.159 | -0.333 | -0.288 | 0.165 | 0.698 | -0.171 | -0.451 | -0.315 | 0.499 | 0.413 | 0.588 | 0.735 | 0.165 | -0.453 | |
***,† | ** | *** | *** | *** | ***,† | *** | *** | *** | *** | *** | ***,† | ***,‡ | *** | *** | ||
SR | -0.65 | -0.205 | -0.274 | -0.282 | 0.156 | 0.687 | -0.099 | -0.427 | -0.217 | 0.487 | 0.361 | 0.565 | 0.647 | 0.156 | -0.438 | |
***,† | *** | *** | *** | ** | ***,‡ | * | *** | *** | *** | *** | ***,† | ***,† | *** | *** | ||
P_Cur, Pl_Cur, and G_Cur are profile curvature, plan curvature and general curvature, respectively; FA is flow accumulation; RePC1 and RePC2 are topographic relief component 1 and 2, respectively; POP100 is positive openness; Upsl is upslope slope; FPL is flow path length; DI is downslope index; CA is catchment area; TWI is topographic wetness index; and SPI is stream power index; and LS_FB is slope length factor (field based). | ||||||||||||||||
* P < 0.05, ** P < 0.005, *** P < 0.0001. | ||||||||||||||||
†Correlation coefficient >0.5, ‡Highest correlation coefficient for each soil property. |
Table 2: Spearman's rank correlation (n = 560) between selected topographic metrics and soil organic carbon (SOC) density and soil redistribution rates (SR).
TPC1(25%) | TPC2(24%) | TPC3(14%) | TPC6(5%) | TPC7(4%) | |
Slope | 0.062 | 0.475† | -0.035 | -0.013 | -0.183 |
P_Cur | -0.290 | 0.000 | 0.346 | -0.070 | -0.002 |
Pl_Cur | -0.283 | 0.107 | -0.001 | 0.485† | 0.383† |
G_Cur | -0.353† | 0.054 | 0.275 | 0.025 | 0.100 |
FA | 0.297 | -0.042 | 0.482† | 0.179 | 0.131 |
LsRe | 0.309 | -0.193 | -0.237 | 0.113 | -0.116 |
SsRe | 0.234 | 0.266 | -0.118 | 0.084 | 0.597† |
POP100 | -0.330 | 0.092 | 0.258 | -0.292 | 0.217 |
Upsl | 0.187 | 0.419† | -0.143 | -0.066 | 0.012 |
FPL | 0.147 | -0.168 | -0.088 | -0.703† | 0.407† |
DI | 0.103 | -0.220 | -0.164 | 0.184 | 0.435† |
CA | 0.326 | -0.128 | 0.4† | -0.160 | -0.092 |
TWI | 0.053 | -0.465† | -0.067 | 0.185 | -0.047 |
SPI | 0.345 | -0.014 | 0.46† | 0.169 | 0.080 |
LS_FB | 0.256 | 0.396† | 0.050 | 0.011 | -0.072 |
P_Cur, Pl_Cur, and G_Cur are profile curvature, plan curvature and general curvature, respectively; FA is flow accumulation; RePC1 and RePC2 are topographic relief component 1 and 2, respectively; POP100 is positive openness; Upsl is upslope slope; FPL is flow path length; DI is downslope index; CA is catchment area; TWI is topographic wetness index; and SPI is stream power index; and LS_FB is slope length factor (field based). | |||||
†Loadings> 0.35. |
Table 3: Variable loadings in the principal components (TPCs) calculated for topographic metrics (n = 460) in Walnut Creek Watershed.
Model | R2adj | NSE | RSR | |||||
Stepwise principal component regression (SPCR) | ||||||||
SOC | 2.932-0.058TPC2-0.025TPC3+0.051TPC7+0.037TPC1† | 0.68 | 0.69 | 0.56 | ||||
SR | 2.111+0.013TPC1+0.032TPC7-0.028TPC2-0.016TPC3-0.010TPC6 | 0.63 | 0.63 | 0.61 | ||||
Stepwise ordinary least square regression (SOLSRf) | ||||||||
SOC | 2.755+0.021TWI+0.0004FPL-6.369G_Cur-5.580Slope+ 0.011LsRe+0.091DI+0.013SsRe+0.125LS_FB | 0.7 | 0.71 | 0.55 | ||||
SR | 2.117+0.007LsRe-3.128Slope+0.109DI+0.010SsRe+0.0002FPL+ 0.801Upsl -4.442P_Cur | 0.65 | 0.65 | 0.59 | ||||
Stepwise ordinary least square regression with collinear covariate removed (SOLSRr) | ||||||||
SOC | 2.951+0.033LsRe-2.869Upsl+0.0006FPL+0.028SsRe+0.124DI-0.163LS_FB+0.007SPI-10.187P_Cur | 0.68 | 0.68 | 0.56 | ||||
SR | 2.042+0.016LsRe-0.146LS_FB+0.118DI+0.017SsRe+0.0003FPL+ 0.070POP | 0.63 | 0.64 | 0.6 | ||||
† The order of TPCs is based on the stepwise selection steps | ||||||||
R2adj is adjusted coefficient of determination; NSE is Nash-Sutcliffe efficiency; RSR is ratio of the root mean square error (RMSE) to the standard deviation of measured data. | ||||||||
TPC represents topographic principal component. TWI is topographic wetness index; FPL is flow path length; P_Cur, Pl_Cur, and G_Cur are profile curvature, plan curvature and general curvature, respectively; LS_FB is slope length factor (field based); LsRe and SsRe are large-scale and small-scale topographic reliefs, respectively; DI is downslope index; and Upsl is upslope slope. |
Table 4: Models of soil organic carbon (SOC) density and soil redistribution rates (SR) for agricultural fields based on topographic metrics at sites 1 and 2.
The SOLSRf models had slightly better performances than the SPCR models in calibration at the field scale. However, some of the topographic metrics, such as SPI and CA (r > 0.80), are closely correlated with each other. The collinearity may add uncertainties to model predictions. Because of the multicollinearity among predictors, small changes in the input variables can significantly affect the model predictions41. Therefore, the SOLSRf models tended to be unstable and showed low efficiencies in simulations of SOC density and soil redistribution rate at the watershed scale. The SPCR models substantially outperformed the SOLSRf models in prediction of SOC distribution at the watershed scale. TPCs eliminate the multicollinearity by converting the fifteen topographic metrics into mutually independent (orthogonal) components. The conversion also uncovered underlying relationships among topographic metrics. As indicated by the high loadings (> 0.35) of topographic metrics to the components, the TPC1, TPC2, TPC3, TPC6, and TPC7 were associated with runoff velocity, soil water content, runoff volume, flow divergence, and flow acceleration, respectively. Spatial patterns of soil redistribution rates and SOC distribution were highly correlated with soil water content and runoff divergence in the WCW, which is consistent with the study of Fox and Papanicolaou2, which demonstrated that eroded soil from upland could be impacted by flow divergence in a low-relief agricultural watershed.
Moreover, fewer predictor variables in the SPCR models than the SOLSRf and SOLSRr models reduced the risk of over-fitting the prediction models42,43. There were more than six variables in all the SOLSR models, which may increase the difficulty of data interpretation and induce high variance in model simulations41,44,45. This may account for the lower prediction efficiencies in WCW by the SOLSR models than by the SPCR models.
Topography- based SPCR models have advantages in simulating soil redistribution and associated SOC dynamics. First, topographic information can be easily derived from DEMs. Recent increased accessibility of the high spatial resolution LiDAR data can help improve the accuracy of DEM-derived landscape topography and benefit investigations in regions with limited field observations. Second, using a set of topographic metrics and statistical analyses, the topography-based models can efficiently quantify soil redistribution and SOC distribution patterns. Third, the application of principal component can effectively reduce biases associated with multicollinearity of topographic metrics and increase the stability of the stepwise regression models when applied to multiple spatial scales.
However, the SPCA models may be limited by variables during model development. Although application of the LiDAR data increased in ecological studies, the methods to derive useful topographic information have not yet been fully explored. In this study, the TWI and LsRe showed the highest correlations with SOC density and soil redistribution rates, respectively. However, additional topographic variables that are not considered may be equally or more important in explaining soil erosion and C dynamics. Additionally, other factors such as management practices, which may cause soil erosion variability, were not included in this study. For example, when tillage was parallel to the direction of maximum slope, soil erosion may double relative to the erosion in slantwise tillage turning soil upslope46. Therefore, different tillage practices may also be a reason for the reduced prediction efficiencies of the SPCR models.
The study is based on the paper published in Catena17. Instead of a mechanistic-based analysis of topographic influences on soil movement and soil properties as performed in the Catena paper, here we focused on the methods for quantifying topographic metrics and developing topography-based models. We discussed the feasibility and advantages of using topography-based models in studies of the spatial structure of soil properties. Meanwhile, we improved our models by updating algorithms of slope length factor and flow accumulation. The scale of slope length factor measurement was limited to field's area. Additionally, the deterministic infinity algorithm was used for flow accumulation generation. Compared with the method reported in Li et al.17 which generated flow accumulation with a deterministic eight-node algorithm, the infinity algorithm adopted in this study reduces loops in the flow direction angles and proved to be a better algorithm for low relief areas47.
In conclusion, our results demonstrate the feasibility of topography-based SPCR models in simulating SOC distribution and soil redistribution patterns in agriculture fields. As a cost-effective method to estimate SOC stocks and soil redistribution rates, it is applicable to sites with limited observational data and private lands lacking public access. In future studies, the prediction models could be improved with further refinement and availability of LiDAR data and inclusion of additional topographic metrics. The large-scale soil property maps that were developed based on the models will lead to further understanding of the mechanisms underlying the topographic impacts on soil movement in agricultural landscapes and the fate of SOC at the watershed and regional scales.
The authors have nothing to disclose.
This research was supported by the USDA Natural Resources Conservation Service in association with the Wetland Component of the National Conservation Effects Assessment Project (NRCS 67-3A75-13-177).
Light Detection and Ranging (LiDAR) data | http://www.geotree.uni.edu/lidar/ | Collected from the GeoTREE LiDAR mapping project | |
LECO CNS 2000 elemental analyzer | LECO Corp., St. Joseph, MI | ||
Canberra Genie-2000 Spectroscopy System | CANBERRA Industries | ||
Geographic positioning system | Trimble | RTK 4700 GPS | |
ArcGIS | ESRI, Redlands, CA | 10.2.2 | |
Statistical Analysis System | SAS Institute Inc | ||
System for Automated Geoscientific Analysis | University of Göttingen, Germany | v. 2.2.5, http://www.saga-gis.org/ | GNU General Public License |