From AIR to (AI)R : The use of LLM for interpreting archaeological excavation data

Nurra, Federico; Dell'Unto, Nicolo; Derudas, Paola

From AIR to (AI)R : The use of LLM for interpreting archaeological excavation data

Mark

Nurra, Federico ; Dell'Unto, Nicolo ^LU

and Derudas, Paola ^LU

(2024) Thematic Semester Digital Humanities and Artificial intelligence.

Abstract: Since 2021, the DarkLab at Lund University (LU) and the Digital Research Service at the French National Institute of Art History (INHA) have collaborated to develop state-of-the-art digital systems and tools for the management and publication of archaeological data, including information from fieldwork and artefact collections. Drawing on the combined expertise of both institutions, this partnership has led to the successful creation and launch of AIR (Archaeological Interactive Report). Following the international workshop on ‘Advanced 3D Archaeological Documentation and Linked Open Data’, held in Lund, Sweden, 17-19 April 2024, we began testing large language models (LLMs) for processing, transforming and interpreting archaeological... (More); Since 2021, the DarkLab at Lund University (LU) and the Digital Research Service at the French National Institute of Art History (INHA) have collaborated to develop state-of-the-art digital systems and tools for the management and publication of archaeological data, including information from fieldwork and artefact collections. Drawing on the combined expertise of both institutions, this partnership has led to the successful creation and launch of AIR (Archaeological Interactive Report). Following the international workshop on ‘Advanced 3D Archaeological Documentation and Linked Open Data’, held in Lund, Sweden, 17-19 April 2024, we began testing large language models (LLMs) for processing, transforming and interpreting archaeological data, with very promising results. The source data, accessible via the AIR API, is structured in JSON-LD and formalized according to the most widely used ontologies in the field, such as CIDOC CRM and CRM-Archaeo. We have tested two prominent LLMs, GPT-4 by OpenAI and the Mistral Large model by Mistral AI, to analyze this data. In this talk, we will present the results of this experiment: we will focus on data structure, standardized models, and the nuanced challenges of integrating semantics and ontologies into archaeological descriptions and narratives. The presentation will illustrate our approach to improving the interpretation of archaeological data using Large Language Models. (Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/0712fd28-19bf-42e6-98b5-15bda5c56524

author

Nurra, Federico ; Dell'Unto, Nicolo ^LU

and Derudas, Paola ^LU

organization

publishing date

2024

type

Contribution to conference

publication status

published

subject

Archaeology

keywords

Digital archaeology, LLMs, open data, archaeological interpretations

conference name

Thematic Semester Digital Humanities and Artificial intelligence.

conference location

Paris, France

conference dates

2024-12-10 - 2024-12-12

language

English

LU publication?

yes

id

0712fd28-19bf-42e6-98b5-15bda5c56524

alternative location

https://semtemiahn.hypotheses.org/final-conference

date added to LUP

2024-12-13 09:31:51

date last changed

2025-04-04 14:19:15

@misc{0712fd28-19bf-42e6-98b5-15bda5c56524,
  abstract     = {{Since 2021, the DarkLab at Lund University (LU) and the Digital Research Service at the French National Institute of Art History (INHA) have collaborated to develop state-of-the-art digital systems and tools for the management and publication of archaeological data, including information from fieldwork and artefact collections. Drawing on the combined expertise of both institutions, this partnership has led to the successful creation and launch of AIR (Archaeological Interactive Report). Following the international workshop on ‘Advanced 3D Archaeological Documentation and Linked Open Data’, held in Lund, Sweden, 17-19 April 2024, we began testing large language models (LLMs) for processing, transforming and interpreting archaeological data, with very promising results. The source data, accessible via the AIR API, is structured in JSON-LD and formalized according to the most widely used ontologies in the field, such as CIDOC CRM and CRM-Archaeo. We have tested two prominent LLMs, GPT-4 by OpenAI and the Mistral Large model by Mistral AI, to analyze this data. In this talk, we will present the results of this experiment: we will focus on data structure, standardized models, and the nuanced challenges of integrating semantics and ontologies into archaeological descriptions and narratives. The presentation will illustrate our approach to improving the interpretation of archaeological data using Large Language Models.}},
  author       = {{Nurra, Federico and Dell'Unto, Nicolo and Derudas, Paola}},
  keywords     = {{Digital archaeology; LLMs; open data; archaeological interpretations}},
  language     = {{eng}},
  title        = {{From AIR to (AI)R : The use of LLM for interpreting archaeological excavation data}},
  url          = {{https://semtemiahn.hypotheses.org/final-conference}},
  year         = {{2024}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

From AIR to (AI)R : The use of LLM for interpreting archaeological excavation data