Auto-summarization of the texts of construction dispute precedents

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Advancements in text analysis are driving the adoption of document automation in the construction industry. Despite significant financial losses from construction disputes, efforts to automate document processes in this domain remain limited. Effective dispute management requires the rapid identification of relevant precedent cases to help practitioners respond appropriately. However, the complexity and length of such texts pose challenges to quick comprehension. This study presents a natural language processing (NLP) model for automatically summarizing construction dispute case texts. The model was tested on 300 U.S. construction dispute cases sourced from the Westlaw database. Various NLP models, including large language models (LLMs) such as OpenAI's models and BERT, were evaluated, achieving an F-score of approximately 0.39 based on the ROUGE-L metric. To accomplish the domain-specific objective of summarizing construction precedent cases, this study explored multiple approaches, including data preprocessing, fine-tuning, and model engineering using LangChain. Furthermore, this study aims to develop models for summarizing legal precedent texts and investigates methods to capture the distinctive characteristics of construction dispute data compared to general legal texts. The models were validated through domain experts who recognize the unique nature of construction disputes, enhancing the reliability of the evaluation process. The findings contribute significantly to the automation of construction dispute document summarization, enabling practitioners to manage such cases more efficiently.

Original languageEnglish
Article number103381
JournalAdvanced Engineering Informatics
Volume65
DOIs
StatePublished - May 2025

Keywords

  • Construction dispute
  • Dispute precedent
  • Large language model
  • Natural language processing
  • Text summarization

Fingerprint

Dive into the research topics of 'Auto-summarization of the texts of construction dispute precedents'. Together they form a unique fingerprint.

Cite this