Published

2026-02-25

Performance of Random Forest in predicting soil loss based on values calculated by USLE

Desempeño del algoritmo Random Forest en la predicción de la pérdida de suelo basada en valores calculados por la USLE

DOI:

https://doi.org/10.15446/esrj.v29n4.121271

Keywords:

NDVI, Machine Learning, Topographic factor (LS), Erosion processes (en)
NDVI, Aprendizaje automático, Factor topográfico (LS), Procesos de erosión (es)

Downloads

Authors

Soil erosion directly affects agricultural productivity and water resource quality, but estimating soil loss is complex and costly. This study proposes a machine learning (ML) approach to predict soil loss using selected factors from the Universal Soil Loss Equation (USLE) and the Normalized Difference Vegetation Index (NDVI). We applied the Random Forest (RF) algorithm to train and validate two models using different combinations of predictors: (1) NDVI, topographic factor (LS), and land cover/management factor (CP); and (2) NDVI, LS, and soil erodibility factor (K). These variables represent land use, conservation practices, and topographic conditions in the Sorocabuçu River Basin (SRB), part of Brazil’s Atlantic Forest biome with high environmental and socioeconomic value. Soil loss was classified into three classes (in ton/ha): low (0–10.0), moderate (10.1–50.0), and high (≥50.1). A total of 3348 samples were randomly selected and proportionally distributed to reflect class representation across the study area. We used a 70/30 train-test split and standardized parameters (50 trees and four variables per node) to enable reproducibility. The model using NDVI, LS, and CP achieved 93.43% accuracy with a kappa index of 0.90. The performance was especially strong for the low-loss class, the most prevalent in the area. The second model using NDVI, LS, and K achieved 97.14% accuracy with a kappa index of 0.90, showing excellent results, particularly for the high-loss class, which poses the greatest environmental risk. These models prove effective in identifying areas at risk of severe erosion using fewer, more accessible parameters. The approach offers a scalable and practical tool for decision-makers, environmental managers, and public agencies to monitor and mitigate soil degradation, particularly in sensitive and ecologically important regions.

La erosión del suelo afecta directamente la productividad agrícola y la calidad de los recursos hídricos; sin embargo, la estimación de la pérdida de suelo es un proceso complejo y costoso. Este estudio propone un enfoque de aprendizaje automático (Machine Learning (ML)) para predecir la pérdida de suelo utilizando factores seleccionados de la Ecuación Universal de Pérdida de Suelo (USLE) y el Índice de Vegetación de Diferencia Normalizada (NDVI). Se aplicó el algoritmo Random Forest (RF) para entrenar y validar dos modelos con diferentes combinaciones de variables predictoras: (1) NDVI, factor topográfico (LS) y factor de cobertura y manejo del suelo (CP); y (2) NDVI, LS y factor de erodabilidad del suelo (K). Estas variables representan el uso del suelo, las prácticas de conservación y las condiciones topográficas en la cuenca del río Sorocabuçu (SRB), ubicada en el bioma de la Mata Atlántica de Brasil, una región de alto valor ambiental y socioeconómico. La pérdida de suelo se clasificó en tres categorías (en t/ha): baja (0–10,0), moderada (10,1–50,0) y alta (≥50,1). Se seleccionaron aleatoriamente un total de 3348 muestras, distribuidas proporcionalmente para reflejar la representatividad de las clases en el área de estudio. Se utilizó una división de los datos del 70% para entrenamiento y 30% para validación, junto con parámetros estandarizados (50 árboles y cuatro variables por nodo) para garantizar la reproducibilidad del análisis. El modelo basado en NDVI, LS y CP alcanzó una precisión del 93,43% y un índice kappa de 0,90, con un desempeño destacado en la clase de baja pérdida de suelo, la más frecuente en el área. El segundo modelo, que utilizó NDVI, LS y K, obtuvo una precisión del 97,14% y un índice kappa de 0,90, mostrando resultados excelentes, especialmente en la clase de alta pérdida de suelo, que representa el mayor riesgo ambiental. Los resultados demuestran que ambos modelos son eficaces para identificar áreas con riesgo de erosión severa utilizando un conjunto reducido de parámetros más accesibles. Este enfoque constituye una herramienta práctica y escalable para la toma de decisiones por parte de gestores ambientales y organismos públicos, contribuyendo al monitoreo y la mitigación de la degradación del suelo, particularmente en regiones sensibles y de gran importancia ecológica.

References

Arantes, L. T., Santos, A. P., Silva, C. V., Nery, L. M., Toledo, M. V. L., Simonetti, V. C., Silva, D. C. C., & Lourenço, R. W. (2024a). Socioeconomic spatial analysis through fuzzy system as a tool for territorial planning applied to watersheds. International Journal of River Basin Management, 1–17. https://doi.org/10.1080/15715124.2024.2387579

Arantes, L. T., Santos, A. P., Silva, D. C. C., & Lourenço, R. W. (2024b). Indicador de vulnerabilidade ao carreamento de sedimentos integrado ao SIG e SR. Geo UERJ, (45). https://doi.org/10.12957/geouerj.2024.74164.

Amundson, R., Berhe, A. A., Hopmans, J. W., Olson, C., Sztein, A. E., & Sparks, D. L. (2015). Soil and human security in the 21st century. Science, 348(6235), 1261071. https://doi.org/10.1126/science.1261071

Andreoti, C. E. (2012). Avaliação da eficiência de um sistema agroflorestal na recuperação de um solo degradado por pastoreio. Dissertação (mestrado – Programa de Pós-graduação em Geografia Física) – Faculdade de Filosofia, Letras e Ciências Humanas – São Paulo, Brasil. https://doi.org/10.11606/D.8.2012.tde-09012013-121619

Bertoni, J., & Lombardi Neto, F. (1999). Conservação do solo. 4. ed. São Paulo, SP: Ícone.

Boardman, J., & Poesen, J. (2006). Soil erosion in Europe. Ed. John Wiley & Sons Ltd. West Sussex. 855 p. https://doi.org/10.1002/0470859202.ch36

Breiman, L. (2001). Random Forests. Journal Machine Learning, 45, 5-32. https://doi.org/10.1023/A:1010933404324

Cheng, Z., Lu, D., Li, G., Huang, J., Sinha, N., Zhi, J., & Li, S. (2018). A Random Forest-Based Approach to Map Soil Erosion Risk Distribution in Hickory Plantations in Western Zhejiang Province, China. Remote Sensing, 10, 1-20. https://doi.org/10.3390/rs10121899

Costa, R. V. F., Leite, M. G. P., Leao, L. P., Nalini Junior, H. A., Silva, D. C. C., & Valente, T. M. F. (2025). Hydrogeochemistry of surface waters in the Iron Quadrangle, Brazil: High-Resolution Mapping of Potentially Toxic Elements in the Velhas and Paraopeba River Basins. Water, 17, 2446. https://doi.org/10.3390/w17162446

Dubreuil, V., Fante, K. P., Planchon, O., & Sant'Anna Neto, J. L. (2017). Les types de climats annuels au Brésil: une application de la classification de Köppen de 1961 a 2015. EchoGéo, 41, 1-27. https://doi.org/10.4000/echogeo.15017

Empresa Brasileira de Pesquisa Agropecuária [EMBRAPA]. (2018). Solo e relevo: Influência no uso da terra. https://www.embrapa.br/agencia-de-informacao-tecnologica/cultivos/eucalipto/pre-producao/escolha-da-area/solo-e-relevo Access: 02 nov. 2025.

FAO – Food and Agriculture Organization of the United Nations. (2023). The State of Food and Agriculture 2023. Revealing the true cost of food to transform agrifood systems. From: https://www.fao.org/documents/card/en/c/cc7724en. Access: 09 ago. 2023.

FAO – Food and Agriculture Organization of the United Nations. (2021). The state of the world land and water resources for food and agriculture 2021. Disponível em: https://www.fao.org/3/cb7654en/online/cb7654en.html. Access: 09 ago. 2023.

Filho, J. P. (2014). Capacidade preditiva de Modelos Credit Scoring em inferência dos rejeitados. Dissertação (Mestrado em Estatística) – Centro de Ciências Exatas e de Tecnologia, Universidade federal de São Carlos, São Carlos, 95p.

Ganasri, B. P., & Ramesh, H. (2016). Assessment of soil erosion by RUSLE model using remote sensing and GIS-A case study of Nethravathi Basin. Geoscience Frontiers, 7(6), 953-961. https://doi.org/10.1016/j.gsf.2015.10.007

Ghimire, B., Rogan, J., & Miller, J. (2010). Contextual land-cover classification: incorporating spatial dependence in land-cover classification models using random forests and the Getis statistic. Remote Sensing Letters, 1(1), 45-54. https://doi.org/10.1080/01431160903252327

Ghosal, K., & Das Bhattacharya, S. (2020). A review of RUSLE model. Journal of the Indian Society of Remote Sensing, 48, 689-707. https://doi.org/10.1007/s12524-019-01097-0

Glaros, A. G., & Kline, R. B. (1998). Understanding the accuracy of tests with cutting scores: The sensitivity, specificity, and predictive value model. Journal of clinical psychology, 44(6), 1013-1023. https://doi.org/10.1002/1097-4679(198811)44:6%3C1013::aid jclp2270440627%3E3.0.co;2-z

He, Q., Zhao, H., Feng, Y., Wang, Z., Ning, Z., & Luo, T. (2024). Edge computing-oriented smart agricultural supply chain mechanism with auction and fuzzy neural networks. Journal of Cloud Computing: Advances, Systems and Applications, 13(1). https://doi.org/10.1186/s13677-024-00626-8

Helmi, A. M. (2023). Quantifying catchments sediment release in arid regions using GIS-based Universal soil loss equation (USLE). Ain Shams Engineering Journal, 14(8), 102038. https://doi.org/10.1016/j.asej.2022.102038

IBGE – Instituto Brasileiro de Geografia e Estatística. (2021). Banco de Dados de Informações Ambientais. From: https://bdiaweb.ibge.gov.br/. Acess 09 jun. 2023.

Kashiwar, S. R., Kundu, M. C., & Dongarwar, U. R. (2022). Soil erosion estimation of Bhandara region of Maharashtra, India, by integrated use of RUSLE, remote sensing, and GIS. Natural Hazards, 110(2), 937–959. https://doi.org/10.1007/s11069-021-04974-5

Köpen, W. (1948). Climatologia. Buenos Aires: Gráfica Panamericana. 478p.

Kulimushi, L. C., Choudhari, P., Mubalama, L. K., & Banswe, G. T. (2021). GIS and remote sensing-based assessment of soil erosion risk using RUSLE model in South-Kivu province, eastern, Democratic Republic of Congo. Geomatics Natural Hazards and Risk, 12(1), 961–987. https://doi.org/10.1080/19475705.2021.1906759

Kulimushi, L. C., Bashagaluke, J. B., Prasad, P., Heri-Kazi, A. B., Kushwaha, N. L., Masroor, M., Choudhari, P., Elbeltagi, A., Sajjad, H. & Mohammed, S. (2023). Soil erosion susceptibility mapping using ensemble machine learning models: A case study of upper Congo river sub-basin. Catena, 222, 106858. https://doi.org/10.1016/j.catena.2022.106858

Lepsch, I. F. (2010). Formação e conservação dos solos. 2. ed. São Paulo, SP: Oficina de Textos.

Maia Júnior, L. P. M. & Lourenço, R. W. (2020). Impactos das mudanças no uso e cobertura da terra sobre a variabilidade do albedo na Bacia Hidrográfica do Rio Sorocabuçu (Ibiúna-SP). Revista Brasileira de Climatologia, 27, 443-462. https://doi.org/10.5380/abclima.v27i0.72761

Nery, L. M., Sabonaro, D. Z. & Silva, D. C. C. (2023). A multicriteria analysis for decision making. Environment, Development and Sustainability, 25, 1-19. https://doi.org/10.1007/s10668-023-03261-6

Nery, L. M., Gomes, G., Nicomedes, N. P., Sabonaro, D. Z., & Silva, D. C. C. (2024). Análise socioambiental de sistemas de integração: quais seus benefícios, desafios e oportunidades? RISUS. Journal on Innovation and Sustainability, 15, 177–192. https://doi.org/10.23925/2179-3565.2024v15i2p177-192

Nery, L. M., Toniolo, B. P., Santos, A. P., Martins, A. C. G. & Silva, D. C. C (2025). Challenge of political integration in the territorial management of a protected area based on the analysis of land use and land cover change. Journal of Environmental Studies and Sciences, 15, 845–860. https://doi.org/10.1007/s13412-024-00990-6

Nguyen, K. A., Chen, W., Lin, B. S., Seeboonruang, U. & Thomas, K. (2019). Predicting Sheet and Rill Erosion of Shihmen Reservoir Watershed in Taiwan Using Machine Learning. Sustainability, 11, 1-18. https://doi.org/10.3390/su11133615

Noi, P. T. & Kappas, M. (2017). Comparison of random forest, k-nearest neighbor, and support vector machine classifiers for land cover classification using Sentinel-2 imagery. Sensors, 18(1), 18. https://doi.org/10.3390/s18010018

Pacheco, F. A. L., Fernandes, L. F. S., Valle Júnior, R. F., Valera, C. A. & Pissarra, T. C. T. (2018). Land degradation: Multiple environmental consequences and routes to neutrality. Current Opinion in Environmental Science & Health, 5, 78-86. https://doi.org/10.1016/j.coesh.2018.07.002

Panagos, P., Borrelli, P., Poesen, J., Ballabio, C., Lugato, E., Meusburger, K., Montanarella, L. & Alewell, C. (2015). The new assessment of soil loss by water erosion in Europe. Environmental Science & Policy, (54), 438-447. https://doi.org/10.1016/j.envsci.2015.08.012

Pandey, A., Chowdary, V. M. & Mal, B. C. (2007). Identification of critical erosion prone areas in the small agricultural watershed using USLE, GIS and remote sensing. Water Resources Management, 21(4), 729-746. https://doi.org/10.1007/s11269-006-9061-z

Paula, A. L., Pereira dos Santos, A., Belfort Poletti, F., & Lourenço, R. W. (2025). Adjustment of the conservation practices factor calculation in estimating soil loss. Ra’e Ga: O Espaço Geográfico em Análise, 63(1), 125–151. https://doi.org/10.5380/raega.v63i1.100335

QGIS. (2023). QGIS Geographic Information System. QGIS Association. From: http://www.qgis.org. Acess: 09 jun. 2023.

Rizzo, F. A., Santos, A. P., & Silva, D. C. C. (2024). Técnicas de geoprocessamento aplicadas para análise temporal do microclima na bacia hidrográfica do córrego do Pequiá, Maranhão. Boletim Goiano de Geografia, 44, e78032. https://doi.org/10.5216/bgg.v44i1.78032

Rossi, M. (2017). Mapa pedológico do Estado de São Paulo: Revisado e ampliado. Instituto Florestal. https://www.infraestruturameioambiente.sp.gov.br/institutoflorestal.

Rouse, J. J. R., Haas, R. H., Schell, J. A. & Deering, D. W. (1973). Monitoring the vernal advancement and retrogradation (green wave effect) of natural vegetation. Remote Sensing Center Texas A&M University College Station, Texas. 93p. From: https://core.ac.uk/download/pdf/42887948.pdf. Acess: 09 jun. 2023.

RStudio Team (2023). RStudio: Integrated Development Environment for R. RStudio, PBC, Boston. From: http://www.rstudio.com/. 09 jun. 2023.

Santos, A. P., Silva Junior, A. X., Nery, L. M., Gomes, G., Toniolo, B. P., da Cunha e Silva, D. C., & Lourenço, R. W. (2025). Random forest algorithm applied to model soil textural classification in a river basin. Environmental Monitoring and Assessment, 197, 330. https://doi.org/10.1007/s10661-025-13786-0.

Sheikh, A. H., Palria, S. & Alam, A. (2011). Integration of GIS and Universal Soil Loss Equation (USLE) for soil loss estimation in a Himalayan watershed. Recent Research in Science and Technology, 3(3), p. 51-57. https://www.researchgate.net/publication/286921198_INTEGRATION_OF_GIS_AND_UNIVERSAL_SOIL_LOSS_EQUATION_USLE_FOR_SOIL_LOSS_ESTIMATION_IN_A_HIMALAYAN_WATERSHED

Silva, D. C. C., Albuquerque Filho, J. L., Sales, J. C. A. & Lourenço, R. W. (2017). Identificação de áreas com perda de solo acima do tolerável usando NDVI para o cálculo do fator C da USLE. Ra'e Ga, 42, 72-85. http://dx.doi.org/10.5380/raega.v42i0.45524

Simonetti, V. C., Silva, D. C. C., & Rosa, A. H. (2022). Correlação espacial compartimentada dos padrões de drenagem com características morfométricas da bacia hidrográfica do rio Pirajibu-Mirim. Revista Brasileira de Geomorfologia, 23, 1134–1154. https://doi.org/10.20502/rbg.v23i1.2037

Tanyas, H., Kolat Ç. & Süzen, M. L. (2015). A new approach to estimate cover-management factor of RUSLE and validation of RUSLE model in the watershed of Kartalkaya Dam. Journal of Hydrology, 528, 583-598. https://doi.org/10.1016/j.jhydrol.2015.06.048

Tarek, Z., Elshewey, A. M., Shohieb, S. M., Elhady, A. M., El-Attar, N. E., Elseouf, S., Shams, M. Y. (2023). Soil Erosion Status Prediction Using a Novel Random Forest Model Optimized by Random Search Method. Sustainability, 15(9), 7114. https://doi.org/10.3390/su15097114

Toniolo, B. P., Nery, L. M., & Silva, D. C. C. (2024). Modelagem espacial para identificação de áreas potenciais à geração de poluição difusa na Bacia Hidrográfica do Rio Cotia - SP. URBE. Revista Brasileira de Gestão Urbana, 16, e20220207. https://doi.org/10.1590/2175-3369.016.e20220207.

UN – United Nations. The 17 goals. (2023). From: https://sdgs.un.org/goals. Acess: 30 nov. 2023.

Van Stralen, K. J., Stel, V. S., Reitsma, J. B., Dekker, F. W., Zoccali, C. & Jager, K. J. (2009). Diagnostic methods I: sensitivity, specificity, and other measures of accuracy. Kidney international, 75(12), 1257-1263. https://doi.org/10.1038/ki.2009.92

Weiss, S. M. & Zhang, T. (2003). Performance analysis and evaluation. In: The handbook of Data Mining. Lawrence Erlbaum Associates Publishers, Mahwah, NJ, 14, 425 – 440.

Wischmeier, W. H. & Smith, D. D. (1978). Predicting rainfall erosion losses – A guide to conservation planning. Washington, USDA, 1978. 58p. (USDA AH-537). file:///C:/Users/simio/Downloads/USLE.pdf

Xiao, Y. G. B., Lu, Y., Zhang, R., Zhang, D., Zhen, X., Chen, S., Wu, H., Wei, C., Yang, L. & Zhang, Y. (2021). Spatial–temporal evolution patterns of soil erosion in the Yellow River Basin from 1990 to 2015: impacts of natural factors and land use change. Geomat Nat Hazard Risk, 12(1), 103–122. https://doi.org/10.1080/19475705.2020.1861112

Yang, D., Kanae, S., Oki, T., Koike, T. & Musiake, K. (2003). Global potential soil erosion with reference to land use and climate changes. Hydrological Processes, 17, 2913-2918. https://doi.org/10.1002/hyp.1441

Yang, T., Siddique, K. H. M. & Liu, K. (2020). Cropping systems in agriculture and their impact on soil health - A review. Global Ecology and Conservation, 23, e01118. https://doi.org/10.1016/j.gecco.2020.e01118

Zhang, H. K. & Roy, D. P. (2017). Using the 500 m MODIS land cover product to derive a consistent continental scale 30 m Landsat land cover classification. Remote Sensing of Environment, 197, 15-34. https://doi.org/10.1016/j.rse.2017.05.024

Zhang, X., Song, J., Wang, Y., Deng, W. & Liu, Y. (2021). Effects of land use on slope runoff and soil loss in the Loess Plateau of China: A meta-analysis. Science of The Total Environment, 755(1), 142418. https://doi.org/10.1016/j.scitotenv.2020.142418

How to Cite

APA

Pereira dos Santos, A., Moreira Nery, L., Tondato Arantes, L., Pereira Toniolo, B., Collins da Cunha e Silva , D. & Wagner Lourenço, R. (2026). Performance of Random Forest in predicting soil loss based on values calculated by USLE. Earth Sciences Research Journal, 29(4), 379–386. https://doi.org/10.15446/esrj.v29n4.121271

ACM

[1]
Pereira dos Santos, A., Moreira Nery, L., Tondato Arantes, L., Pereira Toniolo, B., Collins da Cunha e Silva , D. and Wagner Lourenço, R. 2026. Performance of Random Forest in predicting soil loss based on values calculated by USLE. Earth Sciences Research Journal. 29, 4 (Feb. 2026), 379–386. DOI:https://doi.org/10.15446/esrj.v29n4.121271.

ACS

(1)
Pereira dos Santos, A.; Moreira Nery, L.; Tondato Arantes, L.; Pereira Toniolo, B.; Collins da Cunha e Silva , D.; Wagner Lourenço, R. Performance of Random Forest in predicting soil loss based on values calculated by USLE. Earth sci. res. j. 2026, 29, 379-386.

ABNT

PEREIRA DOS SANTOS, A.; MOREIRA NERY, L.; TONDATO ARANTES, L.; PEREIRA TONIOLO, B.; COLLINS DA CUNHA E SILVA , D.; WAGNER LOURENÇO, R. Performance of Random Forest in predicting soil loss based on values calculated by USLE. Earth Sciences Research Journal, [S. l.], v. 29, n. 4, p. 379–386, 2026. DOI: 10.15446/esrj.v29n4.121271. Disponível em: https://revistas.unal.edu.co/index.php/esrj/article/view/121271. Acesso em: 3 mar. 2026.

Chicago

Pereira dos Santos, Arthur, Liliane Moreira Nery, Leticia Tondato Arantes, Bruno Pereira Toniolo, Darllan Collins da Cunha e Silva, and Roberto Wagner Lourenço. 2026. “Performance of Random Forest in predicting soil loss based on values calculated by USLE”. Earth Sciences Research Journal 29 (4):379-86. https://doi.org/10.15446/esrj.v29n4.121271.

Harvard

Pereira dos Santos, A., Moreira Nery, L., Tondato Arantes, L., Pereira Toniolo, B., Collins da Cunha e Silva , D. and Wagner Lourenço, R. (2026) “Performance of Random Forest in predicting soil loss based on values calculated by USLE”, Earth Sciences Research Journal, 29(4), pp. 379–386. doi: 10.15446/esrj.v29n4.121271.

IEEE

[1]
A. Pereira dos Santos, L. Moreira Nery, L. Tondato Arantes, B. Pereira Toniolo, D. Collins da Cunha e Silva, and R. Wagner Lourenço, “Performance of Random Forest in predicting soil loss based on values calculated by USLE”, Earth sci. res. j., vol. 29, no. 4, pp. 379–386, Feb. 2026.

MLA

Pereira dos Santos, A., L. Moreira Nery, L. Tondato Arantes, B. Pereira Toniolo, D. Collins da Cunha e Silva, and R. Wagner Lourenço. “Performance of Random Forest in predicting soil loss based on values calculated by USLE”. Earth Sciences Research Journal, vol. 29, no. 4, Feb. 2026, pp. 379-86, doi:10.15446/esrj.v29n4.121271.

Turabian

Pereira dos Santos, Arthur, Liliane Moreira Nery, Leticia Tondato Arantes, Bruno Pereira Toniolo, Darllan Collins da Cunha e Silva, and Roberto Wagner Lourenço. “Performance of Random Forest in predicting soil loss based on values calculated by USLE”. Earth Sciences Research Journal 29, no. 4 (February 25, 2026): 379–386. Accessed March 3, 2026. https://revistas.unal.edu.co/index.php/esrj/article/view/121271.

Vancouver

1.
Pereira dos Santos A, Moreira Nery L, Tondato Arantes L, Pereira Toniolo B, Collins da Cunha e Silva D, Wagner Lourenço R. Performance of Random Forest in predicting soil loss based on values calculated by USLE. Earth sci. res. j. [Internet]. 2026 Feb. 25 [cited 2026 Mar. 3];29(4):379-86. Available from: https://revistas.unal.edu.co/index.php/esrj/article/view/121271

Download Citation

CrossRef Cited-by

CrossRef citations0

Dimensions

PlumX

Article abstract page views

32

Downloads

Download data is not yet available.