The integration of artificial intelligence (AI) into biodiversity research and conservation is growing rapidly, demonstrating great potential in reducing the intensive human labour required for data preprocessing, thereby, facilitating larger data collections that offer ecological insights at unprecedented scales. However, most of these AI applications for biodiversity are still in the early stages of development, hindered by challenges inherent in real‐world datasets and the limited accessibility of these technologies to practitioners without extensive programming knowledge. The recent advent of multimodal language models, which can process and generate multiple data modalities, has significantly expanded the realm of possible AI applications in biodiversity research. These models have demonstrated the ability to classify species and recognize more complex concepts, such as animal postures and orientations, without prior exposure during training. Multimodal language models can also provide explanations for their predictions and interact with humans in natural language, thereby making them more transparent, intuitive and accessible to non‐specialists. Despite these advancements, the use of multimodal language models for biodiversity still needs to overcome unique barriers to application, including high computational and financial demands, reliance on prompt engineering for consistent model performance on large datasets and insufficient open‐source sharing of state‐of‐the‐art methods. This paper explores the transformative potential of multimodal language models for biodiversity research and discusses several possible applications in biodiversity research. We also discuss challenges to implement these models in real‐world conservation scenarios and propose directions for future research to overcome these hurdles. Our goal is to encourage robust discussions and research into the integration of multimodal language models to advance AI for biodiversity research and conservation.
New frontiers in artificial intelligence for biodiversity research and conservation with multimodal language models
Zhongqi Miao,Yuanhan Zhang,Zalan Fabian,Andres Hernandez Celis,Sara Beery,Chunyuan Li,Ziwei Liu,Amrita Gupta,Md Nasir,Wanhua Li,Jason Holmberg,Meredith S. Palmer,Kaitlyn M Gaynor,Pablo Arbeláez,Pengce Wang,R. Dodhia,J. Ferres
Published 2025 in Methods in Ecology and Evolution
ABSTRACT
PUBLICATION RECORD
- Publication year
2025
- Venue
Methods in Ecology and Evolution
- Publication date
2025-08-27
- Fields of study
Not labeled
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-94 of 94 references · Page 1 of 1
CITED BY
Showing 1-6 of 6 citing papers · Page 1 of 1