Large language models possess some ecological knowledge, but how much?

Filip Dorm,Joseph W. Millard,Drew Purves,Michael Harfoot,Oisin Mac Aodha

Published 2026 in bioRxiv

ABSTRACT

Large Language Models (LLMs) have shown remarkable capabilities in question answering across various domains, yet their effectiveness in ecological knowledge remains underexplored. Understanding their potential to recall and synthesize ecological information is crucial as AI tools become increasingly integrated into scientific workflows. Here, we assess the ecological knowledge of two LLMs, Gemini 1.5 Pro and GPT-4o, across a suite of ecologically focused tasks. These tasks evaluate an LLM’s ability to predict species presence, generate range maps, list critically endangered species, classify threats, and estimate species traits. We introduce a new benchmark dataset to quantify LLM performance against expert-derived data. While the LLMs tested outperform naive baselines, achieving around 20 percent-age points higher accuracy in species presence prediction, they reach only a third of the mean F1 score for range map generation and improve threat classification by just around 10 points over random guessing. These results highlight both the promise and challenges of applying LLMs in ecology. Our findings suggest that domain-specific fine-tuning is necessary to improve eco-logical knowledge in LLMs. By providing a repeatable evaluation framework, our benchmark dataset will facilitate future research in this area, helping to refine AI applications for ecological science.

PUBLICATION RECORD

Publication year
2026
Venue
bioRxiv
Publication date
2026-02-04
Fields of study
Biology, Computer Science, Environmental Science
Identifiers
DOI 10.1101/2025.02.10.637097
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Extracting massive ecological data on state and interactions of species using large language models
2025cited by this paper
New frontiers in artificial intelligence for biodiversity research and conservation with multimodal language models
2025cited by this paper
BB-GeoGPT: A framework for learning a large language model for geographic information science
2024cited by this paper
Large language models help facilitate the automated synthesis of information on potential pest controllers
2024cited by this paper
Large language models overcome the challenges of unstructured text data in ecology
2024cited by this paper
Large Language Models are Geographically Biased
2024cited by this paper
Large language models debunk fake and sensational wildlife news
2024cited by this paper
Distilling Named Entity Recognition Models for Endangered Species from Large Language Models
2024cited by this paper
Harnessing large language models for coding, teaching and inclusion to empower research in ecology and evolution
2024cited by this paper
Testing the reliability of an AI-based large language model to extract ecological information from the scientific literature
2024cited by this paper
Combining Observational Data and Language for Species Range Estimation
2024cited by this paper
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
2024cited by this paper
Fully automatic extraction of morphological traits from the web: Utopia or reality?
2024cited by this paper
Toward Reliable Biodiversity Information Extraction From Large Language Models
2024cited by this paper
Generating Binary Species Range Maps
2024influential reference
VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images
2024cited by this paper
Uncertainty Estimation in Large Language Models to Support Biodiversity Conservation
2024cited by this paper
LD-SDM: Language-Driven Hierarchical Species Distribution Modeling
2023cited by this paper
Mathematical Capabilities of ChatGPT
2023cited by this paper
AI-assisted coding: Experiments with GPT-4
2023cited by this paper
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
2023cited by this paper
GPT4GEO: How a Language Model Sees the World's Geography
2023cited by this paper
Spatial Implicit Neural Representations for Global-Scale Species Mapping
2023influential reference
K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization
2023cited by this paper
Mapping with ChatGPT
2023cited by this paper
From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference
2023cited by this paper
Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images
2023cited by this paper
Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs
2023cited by this paper
Automated synthesis of biodiversity knowledge requires better tools and standardised research output
2022cited by this paper
Locating and Editing Factual Associations in GPT
2022cited by this paper
AVONET: morphological, ecological and geographical data for all birds.
2022cited by this paper
TaxoNERD: deep neural models for the recognition of taxonomic entities in the ecological and evolutionary literature
2021cited by this paper
COMBINE: a coalesced mammal database of intrinsic and extrinsic traits.
2021cited by this paper
Text‐analysis reveals taxonomic and geographic disparities in animal pollination literature
2019cited by this paper
Language Models as Knowledge Bases?
2019cited by this paper
Seven Shortfalls that Beset Large-Scale Knowledge of Biodiversity
2015cited by this paper
Has the Earth’s sixth mass extinction already arrived?
2011cited by this paper
Statsmodels: Econometric and Statistical Modeling with Python
2010cited by this paper
New frontiers in AI for biodiversity research and conservation with multimodal language models
year unknowncited by this paper
The changing landscape of text mining - a review of approaches for ecology and evolution
year unknowncited by this paper

CITED BY

A Prospectus on Generative Artificial Intelligence in Marine Ecosystem Modelling
2026cites this paper
Spatial Context Improves the Integration of Text with Remote Sensing for Mapping Environmental Variables
2026cites this paper
Utilizing large language models to construct a dataset of Württemberg’s 19th-century fauna from historical records
2025cites this paper
Toward a global repository of insect traits (GRIT)
2025cites this paper
MetaBeeAI: an AI pipeline for full-text systematic reviews in biology
2025cites this paper
AI-Driven Knowledge Synthesis for Food Web Parameterisation
2025cites this paper
Beyond BLEU: Ethical Risks of Misleading Evaluation in Domain-Specific QA with LLMs
2025cites this paper
Evaluating Large Language Models for IUCN Red List Species Information
2025influential citation