Skip to main navigation Skip to search Skip to main content

Addressing Hallucination in Causal Q&A: The Efficacy of Fine-tuning over Prompting in LLMs

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

This paper presents our approach and findings for participating in the FinCausal 2025 competition (Moreno-Sandoval et al., 2025), which addresses causal question answering derived from financial documents, specifically English and Spanish annual reports. We investigate the effectiveness of generative models, such as Llama, in contrast to common extractive methods like BERT-based token classification. While prompt optimization and few-shot learning offer some improvements, they were insufficient for consistently outperforming extractive methods in FinCausal, suffering from hallucinations. In contrast, fine-tuning generative models was shown to be essential for minimizing hallucinations and achieving superior performance. Using our fine-tuned multilingual model for both tasks, we outperform our extractive and monolingual approaches, achieving top results for Spanish and second-best for English in the competition. Our findings indicate that fine-tuned large language models are well-suited for causal Q&A from complex financial narratives, offering robust multilingual capabilities and effectively mitigating hallucinations.

Original languageEnglish
Title of host publicationJoint Workshop of the 9th Financial Technology and Natural Language Processing, FinNLP 2025, the 6th Financial Narrative Processing, FNP 2025, and the 1st Workshop on Large Language Models for Finance and Legal, LLMFinLegal 2025
EditorsChung-Chi Chen, Antonio Moreno-Sandoval, Jimin Huang, Qianqian Xie, Sophia Ananiadou, Hsin-Hsi Chen
PublisherAssociation for Computational Linguistics (ACL)
Pages253-258
Number of pages6
ISBN (Electronic)9798891762091
Publication statusPublished - 2025
EventJoint Workshop of the 9th Financial Technology and Natural Language Processing, FinNLP 2025, the 6th Financial Narrative Processing, FNP 2025, and the 1st Workshop on Large Language Models for Finance and Legal, LLMFinLegal 2025, co-located with the 31st International Conference on Computational Linguistics, COLING 2025 - Abu Dhabi, United Arab Emirates
Duration: 19 Jan 202520 Jan 2025

Publication series

NameProceedings - International Conference on Computational Linguistics, COLING
ISSN (Print)2951-2093

Conference

ConferenceJoint Workshop of the 9th Financial Technology and Natural Language Processing, FinNLP 2025, the 6th Financial Narrative Processing, FNP 2025, and the 1st Workshop on Large Language Models for Finance and Legal, LLMFinLegal 2025, co-located with the 31st International Conference on Computational Linguistics, COLING 2025
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period19/01/2520/01/25

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Theoretical Computer Science

Fields of Expertise

  • Information, Communication & Computing

Fingerprint

Dive into the research topics of 'Addressing Hallucination in Causal Q&A: The Efficacy of Fine-tuning over Prompting in LLMs'. Together they form a unique fingerprint.

Cite this