The use of machine learning for various predictive models of the occurrence of pipe defects
Abstract
The paper analyses and compares possible predictive models for predicting pipeline defects using machine learning. As an example of the models, the data from in-tube diagnostics for stress corrosion cracking are selected. Special attention was paid to the description of the learning process of models based on machine learning algorithms based on retrospective data. Extended metrics are presented for the prepared parameters of the defects themselves and related data from the survey reports. The performance metrics of the algorithms are given in comparison to the probabilities of a correct prognosis. Conclusions have been drawn on the correlation of defects of this type with data on the soil composition of the soil of the segment of the analysed pipe. The analysis of statistical data on the defects by the pipe orientation was carried out. An approach has been proposed and tested to improve the accuracy characteristics of the model based on the Random Forest algorithm using preliminary data selection, which made it possible to achieve a heuristic probability of more than 80%.
References
Asha LN, Huang Y, Yodo N, Liao H. A Quantitative Approach of Measuring Sustainability Risk in Pipeline Infrastructure Systems. Sustainability. 2023; 15(19):14229. doi: 10.3390/su151914229
Zhuchkov KN, Zavyalov AP, Lukyanov VA. On the necessity to develop approaches to the assessment of reliability parameters of equipment and pipelines of gas transport systems. Gas Ind. 2016; 11(745): 54–60.
Lu H, Xu ZD, Zang X, et al. Leveraging Machine Learning for Pipeline Condition Assessment. Journal of Pipeline Systems Engineering and Practice. 2023; 14(3). doi: 10.1061/JPSEA2.PSENG-1464
Zavyalov AP, Zhuchkov KN, Vasilchenko M. Process Pipeline Strength Calculation Methodology Enhancement Using Finite-Element Method. Journal of Pipeline Systems Engineering and Practice. 2023; 14 (2). doi: 10.1061/JPSEA2.PSENG-1401
Moubray J. Reliability-centered Maintenance. 2 nd. Industrial Press Inc.; 1997.
Kraidi L, Shah R, Matipa W, Borthwick F. An investigation of mitigating the safety and security risks allied with oil and gas pipeline projects. J. Pipeline Sci. Eng. 2021; 1: 349–359. doi: 10.1016/j.jpse.2021.08.002
Kucharj PE. MERO implementation of PIMS Science and technologies: oil and oil products pipeline transportation. Edn Ozdkzh. 2012; 2(6): 103–105.
Berle Ø, Norstad I, Asbjørnslett BE. Optimization, risk assessment and resilience in LNG transportation systems. Supply Chain. Manag. Int. J. 2013; 18: 253–264.
Wang H, Xu Y, Shi B, et al. Optimization and intelligent control for operation parameters of multiphase mixture transportation pipeline in oilfield: A case study. J. Pipeline Sci. Eng. 2021; 1: 367–378.
Yasir M, Afrin T, Huang Y, Yodo N. Sustainable Development for Oil and Gas Infrastructure from Risk, Reliability, and Resilience Perspectives. Sustainability. 2023; 15(6): 4953. doi: 10.3390/su15064953
Jan AA, Lai FW, Asif M, et al. Embedding sustainability into bank strategy: Implications for sustainable development goals reporting. Int. J. Sustain. Dev. World Ecol. 2023; 30: 229–243. doi: 10.1080/13504509.2022.2134230
Jan AA, Lai FW, Siddique J, et al. A walk of corporate sustainability towards sustainable development: a bibliometric analysis of literature from 2005 to 2021. Environ Sci Pollut Res. 2023; 30: 36521–36532. doi: 10.1007/s11356-022-24842-4
Institute of Gas and Gas Technology-Gazprom VNIIGAZ. Rules for determining the technical condition of main gas pipelines based on the results of in-pipe inspection (Russian). Available online: https://gisprofi.com/gd/documents/sto-gazprom-2-2-3-292-2009-pravila-opredeleniya-tehnicheskogo-sostoyaniya.html?ysclid=ltvorqbret20928955 (accessed on 2 June 2024).
Ovodkova K, Zhuchkov K, Zavyalov A. Revisiting the issue of preparation of initial array of information for training neural networks to determine parameters of pipeline defects. Proceedings of the Gubkin Russian State University of Oil and Gas. 2023; 2(311): 85–97.
Su H, Wang J, Zeng Y, et al. Construction and optimization of corrosion map in a broad region of acidic soil via machine learning. Corrosion 2024; 4498. doi: 10.5006/4498
Huang Y, Xu D, Huang L, et al. Responses of soil microbiome to steel corrosion. NPJ Biofilms Microbiomes. 2021; 7(6). doi: 10.1038/s41522-020-00175-3
Linton V, Gamboa E, Law M. Strategies for the repair of stress-corrosion cracked gas transmission pipelines: assessment of the potential for fatigue failure of dormant stress-corrosion cracks due to cyclic pressure service. Journal of pipeline engineering. 2007; 6(4): 207–217.
Vasilchenko M, Zavyalov A, Zhuchkov K. Increasing the stability of a spatially distributed information system using a robust algorithm for filtering anomalous measurements. IT in industry. 2020; 3(8): 1–7
Oguzie EE, Agochukwu IB, Onuchukwu AI. Monitoring the corrosion susceptibility of mild steel in varied soil textures by corrosion product count technique. Materials Chemistry and Physics. 2004; 84(1): 1–6, doi: 10.1016/j.matchemphys.2003.09.002.
Alsit A, Hamdan H, Alkhedher M, et al. Electrochemistry Numerical Study of Stress Corrosion Cracking under Near-Neutral Soil Solution. Key Engineering Materials. 2023. doi: 10.4028/p-zud4rd
Zhuchkov K, Zavyalov A. Improvement of technology of in-tube diagnostics of pipelines using algorithm of automated processing of diagnostic data. Science and technologies: oil and oil products pipeline transportation. 2022; 12(6): 540–549. doi: 10.28999/2541-9595-2022-12-6-540-549
Wang YA, Huang Q, Yao Z, Zhang Y. On a class of linear regression methods. Journal of Complexity. 2024; 82: 101826. doi: 10.1016/j.jco.2024.101826
Attaoui S, Bentata B, Bouzebda S, Laksaci A. The strong consistency and asymptotic normality of the kernel estimator type in functional single index model in presence of censored data. AIMS Mathematics. 2024; 9(3): 7340–7371. doi: 10.3934/math.2024356
Božić D, Runje B, Razumić A. Risk Assessment for Linear Regression Models in Metrology. Preprints. 2024; 2024020948. doi: 10.20944/preprints202402.0948.v1
Linear regression. Available online: https://www.geeksforgeeks.org/ml-linear-regression/ (accessed on 18 November 2024)
Mantas CJ, Castellano JG, Moral-García S, Abellán J. A comparison of random forest based algorithms: random credal random forest versus oblique random forest. Soft Computing-A Fusion of Foundations, Methodologies and Applications. 2019; 23(21): 10739–10754. doi: 10.1007/s00500-018-3628-5
Liu J, Wang H, Hang H, et al. Self-Supervised Random Forest on Transformed Distribution for Anomaly Detection. IEEE Transactions on Neural Networks and Learning Systems. 2024; 1–15. doi: 10.1109/TNNLS.2023.3348833.
Random forest. Available online: https://www.geeksforgeeks.org/random-forest-algorithm-in-machine-learning/ (accessed on 18 November 2024).
Zhuchkov K, Vasilchenko M, Zagrebneva A, Zavyalov A. Improving the accuracy of estimates of the pulse sequence period using the methodology of complete sufficient statistics. Scientific Reports. 2022; 12( 1): 19932. doi: 10.1038/s41598-022-24457-2
Bulanov V, Fomicheva O. Method of trees solutions for binary and multiclass classification problems. Engineering Physics. 2020; 3:19–26. doi: 10.25791/infizik.03.2020.1123
Copyright (c) 2024 Konstantin Zhuchkov, Alexey Zavyalov, Alexey Lopatin, Dmitry Pochikeev, Ksenya Ovodkova, Mikhail Vasilchenko
This work is licensed under a Creative Commons Attribution 4.0 International License.