
Iniital 19th August 2025 Post:
I created an Open-source Geological AI App yesterday that lists reference literature and visualises plate reconstructions if relevant to the question. It was just to illustrate how we can use Large Language Models (LLM) to enable natural language querying of multiple sources, both unstructured and structured, to help us gather and synthesize disparate information together.
It is very easy to use off the shelf LLMs that already understand much geological terminology. I used the open Gemma model here as an example, you can either download an LLM and run locally for total privacy, or use a free Huggingface inference endpoint. The references are from Crossref, arxiv and others. The plate reconstruction is using the excellent hashtag#Gplates API from AuScope, you can choose which reconstruction model to use. The Lithostratigraphic Units table is generated using the also excellent, Macrostrat API from xDD University of Wisconsin-Madison
There is a lot of open-source data out there and it can be quite easy to develop your own AI apps in Python that run locally. You can use your own private data as well and keep everything local and secure.
I’ll add a few more things next weekend, clean up the code and share shortly. You can watch a video of the app and plate reconstructions here: https://www.linkedin.com/posts/paulhcleverley_gplates-geology-geoscience-activity-7363126587233370113-uw5Q?utm_source=share&utm_medium=member_desktop&rcm=ACoAAAJdxjgBSnNfsxghi8atnlNooAgz4mP6AtE
Update: 21st August 2025:
GEOAssist: Opensource Geological AI App V1.0 released! Added an autonomous agent using DuckDuckGo for Internet search, downloading PDFs, machine reading them for additional terms for ‘adaptive search’.
All PDFs are saved to your downloads folder automatically so you can easily collect hundreds or more of files for a given query. PDF filenames are not typically helpful, so are also automatically renamed to the author_date_title leveraging text from inside the PDF, to aid geoscience literature management and browsing.
You can control the number of adaptive rounds; for example, a query “Discuss the Jurassic of Egypt” using GEOAssist, autonomously found and downloaded over 200 relevant PDF papers, several hundred DOI references, a Gen AI summary as well as checking Macrostrat and Gplates for relevant data. I posted about some of this last weekend.
The purpose is to raise awareness and capabilities in the geological sciences around AI. How easy it is to build your own AI apps, the sorts of things you can do, and open-source data that exists. It is easy to add your own private local sources and keep things secure for confidential content.
Code to run the app V1.0 is in Github. I will be updating periodically. Link to Github code below:
