
Encoding lithological drilling descriptions with GEOBERTje
VITO experts in the (sub)surface, together with IT experts, developed a domain-specific large language model, 'GEOBERTje'. This model was trained on a large number of Dutch lithological drill descriptions from the Flanders Subsurface Database (DOV) and can automatically code drill descriptions into primary and secondary lithologies.
GeoBERTje opens up new possibilities to utilise large datasets in (3D) models and analyses. VITO makes the code freely available, so that it can be used and potentially further developed for performing other specific tasks.


Want to know more?
Would you like more information about GEOBERTje or other innovative projects related to the (shallow) subsurface? Contact Katrijn Dirix.
Homegrown Innovation: IT in Geological Research
This research is a result of VITO's own innovation budget in the realm of the (sub)surface. It is an example of how VITO invests in its data-driven research capabilities with advanced hardware and data systems. These form the basis for the development of innovative digital tools, thanks to the close collaboration between geologists and IT experts. VITO develops the tools to efficiently unlock complex data for partners, such as the Flemish government. The tools can be used in projects, but can also inspire new lines of research.