• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer

Center for Artificial Intelligence and Cybersecurity – AIRI

  • Home
  • About Us
    • Center Activities
    • Vision, Mission and Goals
    • Center Faculty
    • Steering Committee
    • Press
  • Research
    • Scientific Projects
    • Research Papers
  • Laboratories
    • Machine Learning
    • Natural Speech & Language Processing
    • Blockchain Technology
    • Information Processing & Pattern Recognition
    • AI in Medicine
    • Data Mining
    • Computer Vision
    • Complex Networks
    • Human-Computer Interaction
    • Maritime Cybersecurity
    • Autonomous Navigation
    • AI in Mechatronics
    • AI in Education
    • Hybrid Computational Methods
    • Drug Design
    • Legal Aspects of AI
    • Ethically Aligned AI
    • Cultural Complexity
    • Trustworthy and Explainable AI
  • Collaboration
    • Industry Collaboration
    • Industry Projects
    • International Collaboration
  • News
  • Contact
  • Login

Pretraining and evaluation of BERT models for climate research

03.11.2025

Motivated by the pressing issue of climate change and the growing volume of data, we pretrain three new language models using climate change research papers published in top-tier journals. Adaptation of existing domain-specific models based on Bidirectional Encoder Representations from Transformers (BERT) architecture is utilized for CliSciBERT (domain adaptation of SciBERT) and SciClimateBERT (domain adaptation of ClimateBERT) and pretraining from scratch resulted in CliReBERT (Climate Research BERT). The performance assessment is performed on the climate change NLP text classification benchmark ClimaBench. We evaluate SciBERT, ClimateBERT, BERT, RoBERTa and DistilRoBERTa – along with our new models – CliReBERT, CliSciBERT and SciClimateBERT – using five different random seeds on all seven ClimaBench datasets. CliReBERT achieves the highest overall performance with a macro-averaged F1 score of 65.45%, and performs better than other models on three out of seven tasks. Additionally, CliReBERT demonstrates the most stable fine-tuning behavior, yielding the lowest average standard deviation across seeds (0.0118). The 5-fold stratified cross-validation on the SciDCC dataset showed that CliReBERT achieved the highest overall macro-average F1 score (53.75%), performing slightly better than RoBERTa and DistilRoBERTa, while the domain-adapted models underperformed their base counterparts. The results show the usefulness of the new pretrained models for text classification in the climate change domain and underline the positive influence of domain-specific vocabulary.

Authors:
Andrija Poleksić, Sanda Martinčić-Ipšić
Journal:
Discover Applied Sciences
Publishing date:
24.10.2025
View original article

Primary Sidebar

Latest Projects

Advanced Data Analysis Using Digital Signal Processing and Machine Learning Techniques

Compound Flooding in Coastal Rivers in Present and Future Climate

Data Processing on Graphs

North Adriatic Hydrogen Valley

Data Governance and Intellectual Property Governance in Common European Data Spaces – DGIP-CEDS

Latest Research Papers

Pretraining and evaluation of BERT models for climate research

Digital Twin-Driven Federated Learning and Reinforcement Learning-Based Offloading for Energy-Efficient Distributed Intelligence in IoT Networks

Forecasting the Trajectory of Personal Watercrafts Using Models Based on Recurrent Neural Networks

A System for Real-Time Detection of Abandoned Luggage

Enhancing Biophysical Muscle Fatigue Model in the Dynamic Context of Soccer

Latest News

Pretraining and evaluation of BERT models for climate research

Invited lecture: “About the first GPS receiver on the Moon, and the other NASA space PNT stories” by James J. Miller (NASA)

Agreement on collaboration between the Faculty of Engineering in Rijeka and the Shanghai Artificial Intelligence Research Institute

Arian Skoki defended his doctoral thesis “Data-Driven Assessment of Player Performance and Recovery in Soccer”

Anna Maria Mihel defended her PhD dissertation topic

We provide the expertise for solving real world problems using AI

If your company wants to implement artificial intelligence in your products or services, or increase your level of cybersecurity, our multidisciplinary team of scientists is your ideal partner.

Contact us

Footer

Center for Artificial Intelligence and Cybersecurity
  • jlerga@airi.uniri.hr
  • +385 51 406 500

University of Rijeka

University of Rijeka

About the Center

  • About Us
  • News
  • Privacy Policy
  • Contact

Center Activities

  • Laboratories
  • Scientific Projects
  • Industry Projects
  • Research Papers
  • Industry Collaboration
  • International Collaboration

Footer bottom left

© 2020 Center for Artificial Intelligence and Cybersecurity, all rights reserved.

Designed & developed by Nela Dunato Art & Design