3D lithological mapping of borehole descriptions using word embeddings

dc.contributor.authorFuentes I.
dc.contributor.authorPadarian J.
dc.contributor.authorIwanaga T.
dc.contributor.authorVervoort R.W.
dc.date.accessioned2023-07-23T01:58:17Z
dc.date.available2023-07-23T01:58:17Z
dc.date.issued2020
dc.description.abstractIn recent years the exponential growth in digital data and the expansion of machine learning have fostered the development of new applications in geosciences. Natural Language Processing (NLP) tackles various issues that arise from using human language data. In this study, NLP is applied to classify and map lithological descriptions in a three dimensional space. The data originates from the Australian Groundwater Explorer dataset of the Bureau of Meteorology, which contains the description and geolocation of bores drilled in New South Wales (NSW), Australia. A GloVe model trained with scientific journal articles and Wikipedia contents related to geosciences was used to obtain embeddings (vectors) from borehole descriptions. In parallel, and as a baseline, the descriptions were classified combining regular expressions and expert criterion. The description embeddings were subsequently classified using a multilayer perceptron neural network (MLP). The performance was evaluated using different accuracy metrics. The embeddings were triangulated and the resulting embeddings were classified using the trained MLP and compared against a nearest neighbour (NN) interpolation of lithological classes. The mapping of the descriptions was carried out by using 3D voxels. Coupling NLP with supervised classification alternatives and interpolation methods resulted in reasonable 3D representation of lithologies. This methodology is a first step in demonstrating the applicability of NLP to the geosciences, which also allows for an uncertainty quantification in the different steps of the process, such as classification and interpolation. Interpolation techniques, although acceptable, might be replaced by machine learning techniques to improve the performance of 3D models.ru_RU
dc.identifier.citationComputers & Geosciences, 2020, 141, 104516ru_RU
dc.identifier.doi10.1016/j.cageo.2020.104516
dc.identifier.urihttps://repository.geologyscience.ru/handle/123456789/41598
dc.language.isoenru_RU
dc.subjectNatural language processingru_RU
dc.subjectgeoscienceru_RU
dc.subjectGloVe modelru_RU
dc.subjectword embeddingsru_RU
dc.subjectNew South Walesru_RU
dc.subjectAustraliaru_RU
dc.title3D lithological mapping of borehole descriptions using word embeddingsru_RU
dc.typeArticleru_RU

Файлы

Оригинальный пакет

Показано 1 - 1 из 1
Загрузка...
Изображение-миниатюра
Имя:
Fuen_20.pdf
Размер:
3.5 MB
Формат:
Adobe Portable Document Format
Описание:

Пакет лицензий

Показано 1 - 1 из 1
Загрузка...
Изображение-миниатюра
Имя:
license.txt
Размер:
1.71 KB
Формат:
Item-specific license agreed upon to submission
Описание: