Essential Data Science for Subsurface Geoscientists and Engineers

This course provides a practical introduction to data science and its application in the E&P domain.

Description

Interest in data science and machine learning is rapidly expanding, offering the promise of increased efficiency in E&P, and holding the potential to analyse and extract value from vast amounts of under-utilised legacy data. Combined with petroleum geoscience and engineering domain knowledge, the key elements underlying the successful application of the technology are: data, code, and algorithms. This course builds on public datasets, code examples written in Python, statistical graphics, and algorithms from popular data science packages to provide a practical introduction to the subject and its application in the E&P domain.

Duration and Logistics

Classroom version: 5 days consisting of lectures and computer-based exercises and practicals.

Virtual version: Ten, 3-hour online sessions presented over 5 days. The course is at an introductory level and all subject matter will be taught from scratch. No prior experience of statistics, Python coding or machine learning is required, although some basic college level knowledge of maths and statistics is useful. Hands-on computer workshops form a significant part of this course, and participants must come equipped with a laptop computer running Windows (8, 10, 11) or MacOS (10.10 or above) with sufficient free storage (4 Gb). Detailed installation instructions are provided in advance so that participants can set up their computer with the data science toolkit and course materials before the course starts.

Level and Audience

Fundamental. This is an introductory course for reservoir geologists, reservoir geophysicists, reservoir engineers, data management, and technical staff who want to learn the key concepts of data science.

Objectives

You will learn to:

  1. Analyse project data using the data science toolkit; notebooks, visualization, and communication.
  2. Perform data import and manipulation, data visualization, exploratory data analysis, and building predictive models from data.
  3. Have a working knowledge of coding in Python.
  4. Coordinate reference systems including geographic and projected coordinate systems.
  5. Use the fundamentals of machine learning including background concepts, the different types of machine learning, and the basic workflow to build and evaluate models from data.

Tutor(s)

David Psaila: Analytic Signal Limited

Course Content

The course comprises a mix of lectures and hands-on computer workshops. You’ll gain a working knowledge of coding in Python. You’ll learn the tradecraft of data import and manipulation, data visualization, exploratory data analysis, and building predictive models from data. You’ll also gain a powerful working environment for data science on your own computer, which together with code examples provided by the course will give you a jump start to applying the techniques you’ll learn to your own projects. For a flavour of what you’ll learn, check out this gallery of visualization samples https://www.analyticsignal.com/visualization/index.html drawn from the course workshops.

What data sources are used?
Using real E&P data sources is an important element of the hands-on computer workshops. This course makes extensive use of open data provided the UK Oil and Gas Authority and the UK National Data Repository. These data sources are not only typical of the challenges and complexity presented by E&P datasets, but also contain sufficient data quality issues to make them ideal for teaching the all important skills of data cleaning and manipulation. The course makes use of well logs, tops, seismic, and production data from these sources. The data are released in the public domain and you can continue to use these sources as you gain in experience after the course.

What data science tools are used?
The course introduces a data science toolkit based on Visual Studio Code from Microsoft. This free product is rapidly growing in popularity as an environment for Python coding and data science. We think this toolkit provides a best-in-class environment for learning data science and subsequently moving to work on real projects, and we provide a free extension to further enhance its data science capabilities. The toolkit components will be installed on your computer - the advantage of this approach over cloud-based platforms is that your data is never uploaded to the cloud (if security is an issue), and you will be able to continue working when offline (if internet access is an issue).

Day 1
Module 1. Overview

  • What is Data Science - Overview of the course, and an outline of the scope of data science.
  • Data Science for E&P - Addressing the role of data science in E&P and an example application to log data quality control and reconstruction using machine learning.

Module 2. Data Science Toolkit - Notebooks, Visualization, and Communication

  • Overview of the data science toolkit.
  • Hands-on workshop introducing the toolkit and getting started with Python scripts and notebooks.
  • Overview of how to manage and use Python packages.
  • Hands-on workshop on Python packages covering how to install and manage packages, and how to use packages from your Python notebooks.
  • Introduction to data visualization with SandDance.
  • Hands-on workshop introducing SandDance for interactive data visualization using a dataset of offshore wells from the UK Continental Shelf.
  • Overview of Markdown, a lightweight markup language for adding simple formatting to plain text documents, and documenting Python notebooks.
  • Hands-on workshop on Markdown for formatting text documents and annotating Python notebooks.

Day 2
Module 3. Python Fundamentals

  • Python 101 - Introduction to Python fundamentals including variables, types, statements, expressions, control flow, and functions.
  • Hands-on workshop on Python 101.
  • Python 102 - More Python fundamentals including modules, files and folders, data structures, and data frames.
  • Hands-on workshop on Python 102.

Day 3
Module 4. Computational Thinking

  • Introduction to Computational Thinking - the analytical and logical processes of decomposing a complex task and expressing it in a form that can be performed by a computer.
  • Hands-on workshop on Computational Thinking applied to the design and implementation an interactive base map for UK E&P data.

Module 5. Exploratory Data Analysis

  • Exploratory Data Analysis - Introduction to the Exploratory Data Analysis process and key Python packages for data analysis and statistical graphics.
  • Hands-on workshop on exploratory data analysis of daily production data from the Vulcan gas field in the UK Southern North Sea - reading data, handling dates, cleaning values, resampling, merging datasets, creating statistical graphics, exporting results.
  • Statistical Graphics - Why visualization is so important. Introduction to the Plotly package for statistical graphics. A classification of statistical graphics. Demonstration of a gallery of statistical graphics samples.
  • Hands-on workshop on statistical graphics - using the Plotly Express package to create a gallery of statistical graphics samples. Code snippets (small blocks of reusable code) help make exploratory data analysis more fun by accelerating the journey from raw data files to working graphics.
  • Descriptive Statistics - Introduction to univariate and multivariate statistics.

Day 4
Module 6. Exploring E&P Data

  • Well header data - Introduction to handling well header data (surface location and attributes) using the pandas and plotly packages.
  • Hands-on workshop on well header data - including import, data cleaning, date handling, posting well data on cultural/satellite base map and visualizing historical trends.
  • Production data - Introduction to handling field production data using the pandas and plotly packages.
  • Hands-on workshop on field production data - including import, data cleaning, date handling, queries, visualizing hierarchical and time series data.
  • Well log data - Introduction to handling wireline logs from LAS files using the lasio, pandas, and plotly packages.
  • Hands-on workshop on well log and tops data - including LAS file import, merging tops, and data visualization.
  • Seismic data - Introduction to handling seismic SEG-Y data using the segyio, and plotly packages.
  • Hands-on workshop on seismic data - including SEG-Y file import, extracting binary and trace headers, visualizing seismic trace data, and calculating seismic attributes.

Day 5
Module 7. Geospatial Data

  • Coordinate reference systems - Introduction to geographic and projected coordinate systems, defining a coordinate reference system from EPSG codes, offsets between coordinate reference systems, and transforming positions between reference systems.
  • Hands-on workshop on coordinate reference systems - how to define a coordinate reference system and transform positions using the pyproj package.

Module 8. Machine Learning Fundamentals

  • Machine Learning - introduction to the fundamentals of machine learning including background concepts, the different types of machine learning, and the basic workflow to build and evaluate models from data.
  • Supervised learning with regression - introduction to regression including random forest regression and performance evaluation.
  • Hands-on workshop on regression for reconstructing wireline logs.
  • Unsupervised Learning - introduction to unsupervised learning for dimensionality reduction, clustering and outlier detection.
  • Hands-on workshop on dimensionality reduction for wireline logs.
  • Explainable Machine Learning - introduction to explainable machine learning: techniques for looking inside the so-called black box models of machine learning to understand why particular predictions are made and which variables are important.

Similar events

Learn workflows to apply sequence stratigraphic concepts to the interpretation of core, well log, seismic and outcrop data.

More Information

Learn how to interpret faults and critically assess their impact on exploration traps and complex reservoir development.

More Information

A comprehensive examination of seismic stratigraphy to aid exploration and development of plays in mixed carbonate-siliciclastic systems, using examples from the Permian Basin.

More Information

Become a more effective member of a multi-disciplinary team by developing a solid understanding of engineering concepts and terminology.

More Information

Good seismic structural interpretation and analysis is required in everything from exploration risking to reservoir management. These examples and elements are discussed during the course.

More Information

This course focuses on large scale rules, risks, uncertainties, strategies and workflows to aid in de-risking carbonate exploration.

More Information

Learn core handling and core description techniques and how to integrate core-based facies analysis into reservoir characterization of siliceous and calcareous mudstones, muddy sandstones, and sandstones.

More Information

Study world-class outcrops displaying sandbody architecture in a variety of stratigraphic and structural settings to improve understanding of reservoir geometry and fluid flow characteristics.

More Information

Examine superb outcrops to develop better understanding of clastic shoreline and shelf systems and apply sedimentology and sequence stratigraphic concepts to build depositional models and predict facies distributions. 

More Information

Develop the skills necessary to identify and predict new prospects and better subdivide reservoirs by applying the concepts of sequence stratigraphy.

More Information

This course examines the structural geology of salt basins, the interactions between salt and surrounding strata, and includes interpretation of seismic data in salt settings.

More Information

Superb outcrops illustrate the range of depositional facies and tectonics that influence the style and intensity of faulting, folding and fracture development.

More Information

Learn the language of reservoir engineers, as well as what you should expect of them and how you can help them.

More Information

This software-independent course examines the reasons why reservoir models often disappoint and offers solutions for building more efficient, fit-for-purpose models.

More Information

Establishes the value of seismic data and provides a solid grounding in seismic interpretation techniques for early-career geoscientists, technical support staff, engineers and managers.

More Information

An overview of seismic technologies that might be used effectively along with geological and engineering data to solve exploration and production problems.

More Information

Creativity and innovation are learnable skills: lectures, discussions and exercises develop creative thinking techniques, models and frameworks that can be applied to oil and gas industry situations.

More Information

Application of the techniques described has consistently provided new interpretations that have led to new field discoveries and/or identification of stratigraphic compartments within existing fields.

More Information

Learn to use play fairway mapping and petroleum system analysis to identify and high-grade potential plays and prospects.

More Information

Learn to use petroleum systems analysis (regional geology, geochemistry and petroleum systems modeling) to evaluate unconventional/resource play reservoirs.

More Information

Of particular interest to individuals evaluating the pre-salt of Brazil and West Africa but will appeal to all geoscientists who wish to expand their knowledge of non-marine  carbonate reservoirs.

More Information

A course that will guide participants through the lifecycle of a CCUS project with emphasis on key concepts, processes, and workflows of the CCUS industry.

More Information

This course provides an analysis-level treatment of fault geometry, characterization of seal effectiveness, and assessment of rupture hazard with application to hydrocarbon exploration, reservoir development and management, fluid pressure containment analysis for CCS, and induced seismicity hazard assessment. 

More Information

The course investigates world-class outcrops to introduce engineers to a wide spectrum of stratigraphic and structural features commonly found in exploration and production.

More Information

A field course focusing on the analysis of deepwater lithofacies, stratal geometries and key stratigraphic surfaces.

More Information

The course aims to improve the understanding of uncertainties in the mapping of complex fault zones and the processes that create potential seals and compartmentalisation in reservoirs in the subsurface for oil and gas as well as CO2.

More Information

A course aimed at women working in the energy industry as geoscientists, with a technical focus on salt tectonics.

More Information

This course will offer geoscientists an understanding of how they can use and adapt their expertise gained in the oil and gas industry to the growing geothermal industry.

More Information

Learn the theory and techniques required to assess trap and seal integrity in hydrocarbon subsurface reservoirs and carbon storage complexes.

More Information

Gain critical insights into the nature and behavior of thick evaporite sequences and learn to interpret salt structures in the subsurface.

More Information

Explore natural deformation at outcrop and tie to subsurface data to aid in fracture prediction in unconventional and conventional reservoirs.

More Information

Understand depth migrated seismic data and connect these data to geological settings for prospect evaluation and generation.

More Information

A comprehensive approach to fractures, teaching how to describe, measure and analyze them and to evaluate their impact on permeability in conventional and unconventional hydrocarbon reservoirs, and for EOR, CO2 sequestration and geothermal energy applications.

More Information

An in-depth introduction to clastic reservoirs, with a focus on stratigraphic and structural heterogeneities that impact reservoir prediction and production.

More Information

Learn to integrate all available data to develop an understanding of reservoir architecture and create fit-for-purpose reservoir maps for exploration, development and production projects.

More Information

Learn how to plan successful onshore seismic acquisition projects and how to work with contractors to execute these projects.

More Information

Understand how to plan successful offshore seismic acquisition projects and how to work with contractors to execute these projects. 

More Information

Explains the factors that control mudrock reservoir quality and presents practical methods to evaluate reservoir heterogeneity

More Information

A concise overview of the basics of sedimentology, stratigraphy and structure of clastic units; and of the environments of deposition of clastic sediments.

More Information

A course that introduces the fundamentals of geomechanics followed by an in-depth look at the relevance to unconventionals especially how geomechanics relates to the effect of fabric and heterogeneity.

More Information

This course classifies unconventional reservoirs from a petroleum systems perspective and reinforces the concepts of depositional controls on reservoir architecture and fractures.

More Information

This course will introduce the fundamentals of land seismic acquisition including receiver types and their spectrum indication.

More Information

Carbonate systems present different seismic responses and geobody geometries to clastic systems; this course examines the characteristics of seismic imaging of carbonate systems.

More Information

This course provides an introduction to siliciclastic facies in all aqueous settings focusing on sand deposition for application to conventional reservoirs.

More Information

This class provides an overview of seismic wave propagation, discusses important issues related to seismic data acquisition and imaging, and introduces students to practical seismic interpretation workflows including mapping techniques.

More Information

The course will examine the depositional facies and stratal geometries developed in deepwater slope and channel environments.

More Information

This field course will use outcrops from the Cretaceous and Jurassic of Utah to analyze some of the major challenges facing the storage of CO2 in subsurface formations.

More Information