Universal Data-Driven Permeability Modeling by Connecting MICP Analytics With Big Data
OR
About the Course
SPWLA members, view the course for free!
Permeability governs fluid transport in porous media and is crucial for optimal reservoir performance. Estimating permeability in tight rocks, such as organic shale or tight sandstones/carbonates, is technically challenging and often associated with significant uncertainties. Big data algorithms combined with mercury injection capillary pressure (MICP) analytics provide valuable insights into pore geometry and connectivity—key factors influencing permeability. Traditional methods proposed by Purcell (1949), Thomeer (1960), and Swanson (1981) are often constrained by limiting assumptions, hence constraining their effectiveness across heterogeneous rocks or various rock types. To address this challenge, we develop a generalized data-driven method leveraging large core data sets to estimate permeability across a diverse range of geological formations.
We analyzed 199 samples from 7 hydrocarbon fields, incorporating MICP and routine core analysis (RCA) data spanning marine mud, tight gas, carbonate, and sandstone formations. Parametric fitting was performed using Thomeer's method, Gaussian cumulative distribution and its probability distribution function (CDF-PDF), and generalized extreme-value (GEV) CDF-PDF. Optimized fitting parameters from GEV, including location (μ), scale (σgev), and weighting constraints, served as inputs for machine-learning methods to estimate permeability. Ridge regression, random forests, and artificial neural networks (ANNs) were applied to estimate logarithmic permeability from porosity and optimized parameters and benchmarked against Swanson's method.
The GEV CDF-PDF method achieved the strongest correlation with permeability, with an R2 of 95%. Incorporating Gaussian Mixture Modeling (GMM) with k-Means++ initialization enhanced the accuracy of the estimation. Ridge regression and random forests improved to 81%, while ANNs achieved 89%. In contrast, Swanson's method consistently overestimated permeability in tight rocks.
This universal method developed in this paper exhibits robust applicability and computational efficiency, yielding permeability estimations in under one minute of CPU time on standard hardware. Nonlinear machine-learning methods with regularization and ANN, combined with prior classification, are recommended for improved accuracy. Ensuring diverse training datasets is essential to maximize reliability of results as well as accurate permeability estimation across varied rock types.
Your Instructor

Chicheng Xu earned his Ph.D. in Petroleum & Geosystems Engineering from the University of Texas at Austin in 2013 and accumulated 20+ years of industry experience in SLB, BP America, BHP Billiton, Aramco Americas, and CNPC USA. His research focuses on advancing subsurface intelligence and automation through computational techniques and data analytics, particularly for interpretation, classification, and modeling using multiscale subsurface data integration. Chicheng co-founded and chaired the SPWLA PDDA SIG and is currently leading the Open Petro Data & Utilities (OPDU) project integrating Petro-LLM capabilities. He served in editorial boards for prominent journals including SEG Interpretation, SPWLA Petrophysics, SPE Reservoir Evaluation & Engineering, and SPE Journal. He is appointed as the Executive Editor of SPE Journal starting in September 2025. He has received multiple technical and service awards, including the 2018 Regional Formation Evaluation Technical Award from SPE Gulf Coast, the 2019 SPWLA Meritorious Service Award, th2020 SPE Outstanding Associate Editor Award, the 2021 SPWLA Meritorious Technical Award, and the 2022 Regional Data Science and Engineering Analytics Technical Award. In 2025, he received the Best Paper Award from the Artificial Intelligence in Geosciences Journal, the SPWLA Distinguished Technical Achievement Award, and the SPE A Peer Apart honoree.