Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression

Chunlei Dai; Shangming Shi; Chao Song

doi:10.15446/esrj.v27n1.104741

Published

2023-05-23

Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression

Aplicación del método de Bosques Aleatorios en la identificación de las capas de petróleo y de agua durante el registro de pozo: caso de estudio en la Depresión Liaohe

DOI:

https://doi.org/10.15446/esrj.v27n1.104741

Keywords:

logging data. random forest. SMOTE. oil and water layer identification (en)
información de registro de pozo, método Bosques Aleatorios, método Smote, identificación de las capas de petróleo y de agua (es)

Downloads

PDF

Authors

Chunlei Dai School of Earth Science, Northeast Petroleum University, Daqing 163318, China
Shangming Shi School of Earth Science, Northeast Petroleum University, Daqing 163318, China
Chao Song School of Earth Science, Northeast Petroleum University, Daqing 163318, China

Abstract (en)
Abstract (es)

Accurate identification of oil and water layers is the basis of qualitative evaluation of reservoir fluid properties or industrial value and selection of testing layers of the well. The traditional oil and water layer identification is mainly based on the extensive use of the well’s logging and logging data, which is inefficient and easy to leak interpretation or misinterpretation for those reservoirs with complex geological conditions. In this paper, the random forest method of machine learning is used to select the lithology, porosity, permeability, movable fluid, oil saturation, S₀, S₁, S₂, T_max of rock as characteristics; smote oversampling is used to expand the sample, and the packet estimation is used to establish the oil and water layer identification model. This method is simple and easy to use, not prone to severe overfitting, and can find the potential rules in the data. The classification performance is excellent, and the accuracy rate can reach more than 89.9%, which solves the problem of low accuracy in oil-water layer identification in the past.

La identificación precisa de las capas de agua y petróleo es la base de la evaluación cualitativa de las propiedades de fluido del yacimiento o de valor industrial, y de la selección de las capas de ensayo del pozo. La identificación tradicional de las capas de petróleo y agua se basa principalmente en el uso extensivo de la información ofrecida por la adquisición de registros del pozo, la cual es ineficiente y fácil de perder información o de incurrir en malinterpretación en aquellos yacimientos con condiciones geológicas complejas. En este artículo se utilizó el método de "Bosques Aleatorios (del inglés Random Forest Method)" para seleccionar la litología, porosidad, permeabilidad, fluidos móviles, saturación de petróleo, y las características de la rocas S₀, S₁, S₂ y T_max. El sobremuestreo con el método Smote se usó para ampliar la muestra, y el paquete de estimación se uilizó para establecer el modelo de identificación de las capas de agua y petróleo. Este método es simple y fácil de usar, además de no ser propenso a un sobreajuste severo, y puede encontrar en la información las normas potenciales que lo rigen. La clasificación del desempeño es excelente, y el índice de exactitud puede alcanzar más del 89.9 %, lo que resuelve el problema de la baja exactitud que se presenta en la identificación de las capas de petróleo y de agua.

References

Bengio, Y., Courville, A., & Vincent, P. (2012). Representation Learning: A Review and New Perspectives. ArXiv. /abs/1206.5538. https://doi.org/10.48550/arXiv.1206.5538

Breiman, L. (2001). Random forest. Machine learning, 45, 5-32 DOI: https://doi.org/10.1023/A:1010933404324

Chawla, N. V., Bowyer, K. W., & Hall, L. O. (2011). SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16(1), 321-357. DOI: https://doi.org/10.1613/jair.953

Cheng, K. (2007). A Review of the Theory and Methods of Statistical Data Preprocessing. Statistics and Information Forum, 22(6), 98-103.

Cutler, A., Cutler, D. R., & Stevens, J. R. (2004). Random Forests. Machine Learning, 45(1), 157-176. DOI: https://doi.org/10.1007/978-1-4419-9326-7_5

Džeroski, S., & Ženko, B. (2004). Is Combining Classifiers with Stacking Better than Selecting the Best One? Machine Learning, 54, 255–273. https://doi.org/10.1023/B:MACH.0000015881.36452.6e

Hang, L. (2012). Statistical learning methods. Beijing: Tsinghua University Press.

Kang, Q., & Lu, L. (2020). Application of stochastic forest algorithm in lithology classification of logging. World Geology, 39(2), 398-405.

Lai, Q., Wei, B., & Wu, Y. (2021). K-Neighbor Algorithm For Igneous Lithology Based on Random Forest. Special Oil and Gas Reservoirs, 28(6), 62-69.

Liang, J., Chen, J., & Zhang, X. (2019). Anomaly Detection Based on Durtific Coding and Convolutional Neural Network. Journal of Tsinghua University (Natural Science Edition), 59(7), 523-529.

Liu, Y., Liu, S., & Ma, Q. (2019). Application of BP neural network method in slate facies identification of Lucaogou Formation in Santanghu Basin. Lithological Reservoirs, 31(4), 101-111.

Pedregosa, F., Varoquaux, G., & Gramfort, A. (2012). Scikit-learn: Machine learning in python. Journal of Machine Learning Research, 12(10), 2825-2830.

Su, G. (2006). Application of Geochemical Gas Logging Data in Oil-Water Reservoir Identification. Logging Technology, 30(6), 551-553.

Wu, Z., Zhang, X., Zhang, C., & Wang, H. (2021). Lithology Recognition Method Based on LSTM Recurrent Neural Network. Lithological Reservoirs, 33(3),120-128.

Xing, C., Zhou, C., & He, Y. (2022). Direct inversion of pore pressure in unconventional reservoir formations by Bayesian method. Lithological Reservoirs, 34(3), 1-7.

Wang, Y., Wang, M., & Tian, S. (2021). Coal Rock Identification Based on Kalman Filter and Random Forest. Coal Technology, 40(12), 208-211.

Wang, Y., Wang, R., & Wie, K. (2021). Classification of compact reservoirs based on random forests: A case study of the eastern box 8 section of Yan'an gas field. Journal of Xi'an Shiyou University (Natural Science Edition), 36(6), 1-8.

Zhao, M., Jin, Y., & Wang, Y. (2021). Application of Stochastic Forest Algorithm in Selection Decision. Computer and Network, 47(22), 56-59.

Zhong, Y., Zhang, T., & Li, P. (2022). Study on the Classification of Stochastic Forest Fusion Model in The Classification of Pressure Well Methods. Journal of Southwest Petroleum University (Natural Science Edition), 44(1), 165-173.

Zhou, Z. (2016). Machine Learning. Beijing: Tsinghua University Press.

Zhou, X., Zhang, Z., & Zhang, C. (2017). Complex lithology recognition based on rough set-random forest algorithm. Daqing Petroleum Geology and Development, 36(6), 127-133.

How to Cite

APA

Dai, C., Shi, S. & Song, C. (2023). Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression. Earth Sciences Research Journal, 27(1), 69–75. https://doi.org/10.15446/esrj.v27n1.104741

ACM

[1]

Dai, C., Shi, S. and Song, C. 2023. Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression. Earth Sciences Research Journal. 27, 1 (May 2023), 69–75. DOI:https://doi.org/10.15446/esrj.v27n1.104741.

ACS

(1)

Dai, C.; Shi, S.; Song, C. Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression. Earth sci. res. j. 2023, 27, 69-75.

ABNT

DAI, C.; SHI, S.; SONG, C. Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression. Earth Sciences Research Journal, [S. l.], v. 27, n. 1, p. 69–75, 2023. DOI: 10.15446/esrj.v27n1.104741. Disponível em: https://revistas.unal.edu.co/index.php/esrj/article/view/104741. Acesso em: 12 may. 2026.

Chicago

Dai, Chunlei, Shangming Shi, and Chao Song. 2023. “Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression”. Earth Sciences Research Journal 27 (1):69-75. https://doi.org/10.15446/esrj.v27n1.104741.

Harvard

Dai, C., Shi, S. and Song, C. (2023) “Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression”, Earth Sciences Research Journal, 27(1), pp. 69–75. doi: 10.15446/esrj.v27n1.104741.

IEEE

[1]

C. Dai, S. Shi, and C. Song, “Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression”, Earth sci. res. j., vol. 27, no. 1, pp. 69–75, May 2023.

MLA

Dai, C., S. Shi, and C. Song. “Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression”. Earth Sciences Research Journal, vol. 27, no. 1, May 2023, pp. 69-75, doi:10.15446/esrj.v27n1.104741.

Turabian

Dai, Chunlei, Shangming Shi, and Chao Song. “Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression”. Earth Sciences Research Journal 27, no. 1 (May 23, 2023): 69–75. Accessed May 12, 2026. https://revistas.unal.edu.co/index.php/esrj/article/view/104741.

Vancouver

1.

Dai C, Shi S, Song C. Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression. Earth sci. res. j. [Internet]. 2023 May 23 [cited 2026 May 12];27(1):69-75. Available from: https://revistas.unal.edu.co/index.php/esrj/article/view/104741

Download Citation

CrossRef Cited-by

2

1. Tarun Jaiswal, Sujata Dash, Ganpati Panda, Sudipta Patowary, Shanchamo Yanthan. (2025). Biologically Inspired Techniques in Many Criteria Decision-Making. Learning and Analytics in Intelligent Systems. 45, p.41. https://doi.org/10.1007/978-3-031-82706-8_5.

2. Aditya Pramada Wicaksono, Achmad Choiruddin. (2024). Candidate Selection of Water Shut-Off in Oil and Gas Industry Using Random Forest. 2024 IEEE International Symposium on Consumer Technology (ISCT). , p.464. https://doi.org/10.1109/ISCT62336.2024.10791205.

Dimensions

PlumX

Article abstract page views

248

Downloads

Download data is not yet available.

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Earth Sciences Research Journal holds a Creative Commons Attribution license.

You are free to:

Share — copy and redistribute the material in any medium or format
Adapt — remix, transform, and build upon the material for any purpose, even commercially.
The licensor cannot revoke these freedoms as long as you follow the license terms.

The Earth Sciences Research Journal is the copyright holder for these license attributes.

	IBN Publindex El Índice Bibliográfico Nacional Publindex es un sistema colombiano para la clasificación, actualización, escalafonamiento y certificación de las publicaciones científicas y tecnológicas. Es regido por COLCIENCIAS y el ICFES en Colombia.
	Directory of Open Access Journals DOAJ aumenta la visibilidad y la facilidad de uso de las revistas científicas y académicas de acceso abierto, pretende ser global y abarcar todas las revistas que utilizan un sistema de control de calidad para garantizar el contenido.
	SciELO Colombia SciELO Colombia es una librería virtual para América Latina, el Caribe, España y Portugal, fue creada por FAPESP en el año de 1997 en Sao Pablo Brasil, actualmente en Colombia es gestionada por la Universidad Nacional de Colombia.
	REDIB (Red Iberoamericana de Innovación y Conocimiento Científico) REDIB es una plataforma de agregación de contenidos científicos y académicos en formato electrónico producidos en el ámbito iberoamericano. REDIB cuenta con una clara vocación de promoción de la innovación tecnológica de las herramientas de producción editorial. Estas facilitan el acceso, la difusión y la puesta en valor de la producción científica generada en los países de su ámbito de actuación, especialmente en los diversos idiomas que les son propios. Los destinatarios de esta información son tanto la comunidad académica como la sociedad en general, así como los responsables, gestores y analistas de políticas científicas.
	Science Citation Index Expanded^TM SCI de Thomson Reuters es un prestigio sistema de indexación en línea que incorpora información bibliográfica y de citación de publicaciones científicas alrededor del mundo.
	Scopus Scopus es una base de datos bibliográfica de resúmenes y citas de artículos de revistas científicas. Cubre aproximadamente 19.500 títulos de más de 5.000 editores internacionales, incluyendo la cobertura de de 16.500 revistas.
	Latindex Latindex es producto de la cooperación de una red de instituciones latinoamericanas que funcionan de manera coordinada para reunir y diseminar información bibliográfica sobre las publicaciones científicas seriadas producidas en la región.

Earth Sciences Research Journal

Published

Application of Random Forest method in oil and water layer identification of logging data: a case study of the Liaohe depression

Aplicación del método de Bosques Aleatorios en la identificación de las capas de petróleo y de agua durante el registro de pozo: caso de estudio en la Depresión Liaohe

DOI:

Keywords:

Downloads

Authors

References

How to Cite

APA

ACM

ACS

ABNT

Chicago

Harvard

IEEE

MLA

Turabian

Vancouver

Download Citation

CrossRef Cited-by

Dimensions

PlumX

Article abstract page views

Downloads

License

Scimago Journal & Country Rank (SJR)

Indexed and registered

Keywords

IBN Publindex

Directory of Open Access Journals

SciELO Colombia

REDIB (Red Iberoamericana de Innovación y Conocimiento Científico)

Science Citation Index Expanded^TM

Scopus

Latindex