3D lithological mapping of borehole descriptions using word embeddings

Show simple item record

dc.contributor.author Fuentes I.
dc.contributor.author Padarian J.
dc.contributor.author Iwanaga T.
dc.contributor.author Vervoort R.W.
dc.date.accessioned 2023-07-23T01:58:17Z
dc.date.available 2023-07-23T01:58:17Z
dc.date.issued 2020
dc.identifier.citation Computers & Geosciences, 2020, 141, 104516 ru_RU
dc.identifier.uri https://repository.geologyscience.ru/handle/123456789/41598
dc.description.abstract In recent years the exponential growth in digital data and the expansion of machine learning have fostered the development of new applications in geosciences. Natural Language Processing (NLP) tackles various issues that arise from using human language data. In this study, NLP is applied to classify and map lithological descriptions in a three dimensional space. The data originates from the Australian Groundwater Explorer dataset of the Bureau of Meteorology, which contains the description and geolocation of bores drilled in New South Wales (NSW), Australia. A GloVe model trained with scientific journal articles and Wikipedia contents related to geosciences was used to obtain embeddings (vectors) from borehole descriptions. In parallel, and as a baseline, the descriptions were classified combining regular expressions and expert criterion. The description embeddings were subsequently classified using a multilayer perceptron neural network (MLP). The performance was evaluated using different accuracy metrics. The embeddings were triangulated and the resulting embeddings were classified using the trained MLP and compared against a nearest neighbour (NN) interpolation of lithological classes. The mapping of the descriptions was carried out by using 3D voxels. Coupling NLP with supervised classification alternatives and interpolation methods resulted in reasonable 3D representation of lithologies. This methodology is a first step in demonstrating the applicability of NLP to the geosciences, which also allows for an uncertainty quantification in the different steps of the process, such as classification and interpolation. Interpolation techniques, although acceptable, might be replaced by machine learning techniques to improve the performance of 3D models. ru_RU
dc.language.iso en ru_RU
dc.subject Natural language processing ru_RU
dc.subject geoscience ru_RU
dc.subject GloVe model ru_RU
dc.subject word embeddings ru_RU
dc.subject New South Wales ru_RU
dc.subject Australia ru_RU
dc.title 3D lithological mapping of borehole descriptions using word embeddings ru_RU
dc.type Article ru_RU
dc.identifier.doi 10.1016/j.cageo.2020.104516


Files in this item

This item appears in the following Collection(s)

Show simple item record