کاربرد ماشین بردار پشتیبان (SVM) و درخت رگرسیون تقویت شده (BRT) جهت مدل سازی حساسیت فرسایش خندقی درحوضه آبخیز رودخانه شور (شهرستان مُهر) (مقاله علمی وزارت علوم)
درجه علمی: نشریه علمی (وزارت علوم)
آرشیو
چکیده
هدف از این مطالعه توسعه مدل های حساس فرسایش خندقی با اجرای الگوریتم یادگیری ماشینی (ماشین بردار پشتیبان و درخت رگرسیون تقویت شده) در حوضه مُهر است. ابتدا، مناطق خندقی شناسایی و پس از آن 13 متغیر مستعد کننده فرسایش خندقی (شیب، جهت شیب، شاخص رطوبت توپوگرافی، شاخص قدرت جریان، شاخص زبری سطح، فاصله از آبراهه، تراکم زهکشی، فاصله از جاده، کاربری اراضی، پوشش گیاهی، متوسط بارندگی سالانه، زمین شناسی و بافت خاک) انتخاب شد. ضریب تورم واریانس برای ارزیابی چندخطی بین متغیرها استفاده شد. در نهایت نقشه حساسیت فرسایش خندقی در محیط (R) تهیه شد. همچنین تأثیر ویژگی های فیزیکی و شیمیایی خاک بر فرسایش خندقی با استفاده از رگرسیون چند متغیره بررسی شد. از نظر اهمیت متغییرها، در مدل SVM کاربری اراضی و در مدل BRT زمین شناسی بیشترین تأثیر را بر فرسایش خندقی دارد. نقشه حساسیت پیش بینی شده با کمک منحنی مشخصه عملکرد گیرنده (ROC) اعتبارسنجی شد. نتایج نشان داد که مساحت زیر منحنی (AUC) در مدل های ماشین بردار پشتیبان و درخت رگرسیون تقویت شده به ترتیب 92/0 و 94/0 محاسبه شد که منجر به پیش بینی دقیقی شد. همچنین نتایج نشان داد متغیر ماسه (299/9)، نسبت جذب سدیم (967/7) و مواد خنثی شونده (185/6) بیشترین تأثیر را بر فرسایش خندقی داردApplication of Support Vector Machine (SVM) and Boosted Regression Tree (BRT) to Model the Sensitivity of Gully Erosion in the Watershed of Shore River Moher City
The aim of this study is to develop sensitive gully erosion models by implementing a machine learning algorithm (Support Vector Machine and Boosted Regression Tree) in the Moher basin. First, gully areas are identified, and then 13 variables predisposing to gully erosion (Slope, Slope Direction, Topographic Wetness Index, Streem Power Index, Terrain Ruggedness Index, Distance from Waterway, Drainage Density, Distance from Road, Land use, NDVI, Avera annual Rainfall, Geology, and Soil Texture) were selected. The variance inflation coefficient was used to evaluate multicollinearity between variables. Finally, a gully erosion sensitivity map was prepared in the environment (R). Also, the effect of physical and chemical characteristics of soil on gully erosion was investigated using Multivariate Regression. Regarding the importance of variables, Geology has the most significant effect on gully erosion in the SVM model, Land use, and the BRT model. The predicted sensitivity map was validated with the help of the receiver operating characteristic (ROC) curve. The results showed that the area under the curve (AUC) in the Support Vector Machine and Boosted Regression Tree models were calculated as 0.92 and 0.94, respectively, which led to accurate prediction. Also, the results showed that the sand variable (9.299), sodium absorption ratio (7.967), and TNV (6.185) have the most significant effect on gully erosion
Extended
Introduction
Currently, many countries are facing severe land degradation and soil erosion. Soil erosion caused by water is a major environmental concern, affecting approximately one billion hectares of land worldwide. Gully erosion is one of the most essential factors in land degradation in dry and semi-arid regions, causing significant soil losses and the transfer of sediments to low-altitude areas. Many researchers investigated gully erosion sensitivity using remote sensing (RS) and geographic information systems (GIS) techniques. Traditional data mining methods cannot establish the relationship between geoenvironmental factors and gully erosion processes. Therefore, machine learning models are highly efficient for evaluating areas sensitive to gully erosion. Support vector machines (SVM) and boosted regression trees (BRT) are machine-learning techniques to model gully erosion. In the lower reaches of a watershed or near agricultural lands, gully erosion caused sediment production and desertification of the region. Using SVM and BRT models, it is possible to prepare a map of gully erosion sensitivity and use zoning to reduce potential damage and manage crises.
Materials and methods
The study area covers approximately 101350 hectares and is located in southern Iran, with an elevation range of 387 to 1672 meters above sea level. The basin is situated between longitude 52°24'52" to 52°59'52"E and latitude 27°22'27" to 41°49'27"N. Initially, a gully erosion points map was created based on 200 gully and non-gully points, identified through field visits and data obtained from the Natural Resources Department of Fars province. Random non-gully points were also selected throughout the basin using a Geographic Information System (GIS). Furthermore, 13 variables were chosen for modeling based on the approach utilized by previous researchers. These variables encompassed slope, slope aspect, drainage density, distance from the stream, land use, geology, soil texture, stream power index (SPI), topographic wetness index (TWI), topographic roughness index (TRI), average precipitation, distance from the road, and vegetation cover. Multicollinearity analysis employing the variance inflation factor (VIF) was conducted to determine the linear correlation between variables. Subsequently, machine learning algorithms, including Support Vector Machine (SVM) and Boosted Regression Trees (BRT), were employed for modeling. Finally, the accuracy of the models was evaluated using the ROC curve. Moreover, the effect of physical and chemical characteristics of soil on gully erosion was investigated using multivariate regression.
Results and discussion
Investigating the collinearity between the selected variables is crucial in creating gully erosion sensitivity maps. Among the 13 selected variables, no significant collinearity was observed in the SVM and BRT models. The areas with low sensitivity to gully erosion are mainly concentrated in the region's northern, northwestern, and southwestern parts. The moderately susceptible areas are in the middle of the basin, while the highly susceptible areas are in the southern and southeastern parts. The SVM model had an AUC value of 92.0% for the training dataset and 93.0% for the test dataset. The BRT model had an AUC value of 96.0% for the training dataset and 94.0% for the test dataset. Both models demonstrated high accuracy.
Soil erosion can cause severe environmental and human damage through natural or human-induced gully formation. In dry and semi-dry areas, the formation and development of gullies are mainly problematic and lead to a decrease in soil quantity and quality and a reduction in agricultural productivity. Additionally, the sediment resulting from soil erosion can accumulate at the basin outlet and create problems for the environment and humans. Therefore, it is essential to investigate the formation and development of gullies and determine their importance in environmental modeling and management.
In the Mohr watershed, gully erosion has become a problem, and identifying vulnerable areas using SVM and BRT models is useful for sustainable land management. Topography, slope, surface runoff, drainage density, soil erosion, soil moisture, vegetation cover, and geomorphological processes affect the formation and expansion of gullies. Most gullies are formed and expanded in the north, northeast, east, and northwest directions. Gully erosion has a positive relationship with drainage density, and gullies are formed in areas with low slopes and high moisture. Soil texture and land use are also important for controlling gully erosion. The type of relationship between erosion factors and surface cover of the region is determined based on land use. The presence of vegetation cover in this area is highly important and effective in controlling erosion. The absence of vegetation cover creates runoff and causes erosion and sedimentation on a large scale. Additionally, the impact of road networks is crucial in soil erosion. The results of geological variables show that Quaternary sediments play an important role in the formation of gully erosion. Quaternary sediments are effective in gully erosion formation due to high levels of gypsum and salt, fine-grained sediment deposits, loess instability, soil structure weakening, and lack of organic matter. Soil texture is also used as a controlling mechanism in gully erosion. The results of physical and chemical characteristics showed that the sand variable (9.299), sodium absorption ratio (7.967), and TNV (6.185) have the most significant effect on gully erosion.
Conclusion
A study was conducted to prepare a gully erosion sensitivity prediction map using machine learning algorithms (SVM and BRT) in the Shoor River watershed (Mohr county). This research investigated 13 gully erosion-sensitive variables and their performance in gully development. The entire dataset was randomly divided into 70/30 for training and validation. The area under the curve (AUC) values for SVM and BRT were 0.92 and 0.94, respectively. In both models, more than half of the watershed was classified as low to very low sensitivity (SVM: 89.22%, BRT: 92.84%), while 2.62% and 3.58% of the area were classified as highly sensitive in SVM and BRT models, respectively. Also, the results of the physical and chemical characteristics of the soil showed that sand (9.299), sodium absorption ratio (7.967), and TNV (6.185) have the most significant effect on the expansion of gully erosion in the study area. Finally, the results showed that machine learning models can determine gully erosion boundaries, and the resulting susceptibility maps can be used as essential tools for protecting and sustainably managing gully erosion-prone areas in the study area.
Funding
There is no funding support.
Authors’ Contribution
All of the authors approved the content of the manuscript and agreed on all aspects of the work.
Conflict of Interest
Authors declared no conflict of interest.
Acknowledgments
We are grateful to all the scientific consultants of this paper.