Linked Pasts 11 – Epigraphic Data in Wikidata: From Ontology to Practice
Dates: 2–3 December 2025
Duration: 1h30 each day
Convenors: Anna Clara Maniero Azzolini, Maxime Guénette, Anne Chen, Emily Helm
Format: Online
How participants joined: SAS event (LP11 programme)
Type of audience: Independent Scholars, Researchers, Professors, students (Historians)
Requirements: Wikidata account
Objectives: This session aimed to introduce and explore the use of Wikidata, a collaborative knowledge base, for epigraphy, providing both theoretical knowledge on linked open data (LOD) and ontologies, and practical skills for entering, querying, and managing epigraphic inscriptions.
Report
Day 1
Introduction, Theoretical Framework and Hands-On Practice
The first session opened with a short survey designed to understand participants’ academic backgrounds, their previous experience with Wikidata, and their motivations for attending the workshop. Although all participants had created a Wikidata account in advance, as requested, most had not yet engaged in structured editing within the platform. Some were particularly interested because of their work with Linked Open Data (LOD), while others had prior experience editing Wikipedia, the sister project of Wikidata.
Theoretical Framework: Linked Open Data and Wikidata
Before discussing Wikidata itself, we examined the concept of Linked Open Data (LOD) as a model for publishing structured data online in a way that is open, interoperable, and machine-readable. Rather than allowing data to remain in isolated silos, LOD promotes standardisation and connectivity, enabling datasets to be linked across institutions, disciplines, and national boundaries.
Within epigraphy, several highly valuable databases already exist (such as EDR, EDH, EDCS, EDB, and RIB) offering extensive documentation of inscriptions, including texts, editions, metadata, and findspots. While these resources are indispensable, they also present certain limitations: uneven geographical or chronological coverage, inconsistent multilingual support, restricted possibilities for user contributions, and querying systems that, although useful, can only operate within the boundaries of their individual databases.
What is often lacking is a more centralised, multilingual, and collaborative space in which inscriptions can be compared, corrected, enriched, and queried flexibly across datasets. In this context, Wikidata becomes particularly significant: as a collaborative knowledge base within the Wikimedia ecosystem, Wikidata stores structured data (rather than narrative articles as Wikipedia). Its infrastructure is based on Wikibase, an open-source software that both institutions and individuals can install.
An example discussed during the session was the way Wikipedia infoboxes automatically retrieve information from Wikidata: a single correction to a data item can propagate across multiple language versions of Wikipedia, ensuring consistency and broad dissemination: this demonstrates the scale and impact of structured data within a global knowledge network.
From a research perspective, Wikidata’s appeal lies in its openness, scale (with nearly 120 million entities), and full compatibility with the Linked Open Data model. In the epigraphic domain, inscriptions can be linked to related datasets such as Trismegistos, Pleiades, RIB, or museum catalogues, thereby situating them within wider archaeological, prosopographical, geographical, and material contexts. Given the inherently interdisciplinary nature of epigraphy, this interconnected model is particularly advantageous.
Data Structure and Modelling
A key component of the introduction involved explaining how Wikidata structures information. All data are organised in the form of triples (subject, predicate, object) comparable to concise declarative statements. Each element is assigned a unique identifier: items are labelled with Q-numbers and properties with P-numbers. In this way we can ensure clarity and disambiguation, allowing both humans and machines to interpret statements precisely.
Participants were also introduced to qualifiers and references. Qualifiers allow statements to be contextualised (for instance, specifying dates, certainty levels, or types), while references ensure verifiability (which is crucial for scientific data).
Practical Session
Following the theoretical overview, participants moved into breakout groups for a practical exercise. Anna Clara led the editing group, while Maxime and Emily guided querying activities.
The editing exercise focused on manual editing through the Wikidata interface, enabling participants to become familiar with adding statements, identifiers, references, and qualifiers. This slower, hands-on approach allowed for careful reflection on modelling choices and provided space for real-time feedback and discussion.
Meanwhile, the querying group explored how structured data can be retrieved and analysed using SPARQL. Participants were introduced to the Query Builder as an accessible entry point, followed by a brief demonstration of the more advanced Query Service for complex searches. The aim was to show how modelling decisions directly affect the kinds of questions that can be asked of the data.
Day 2
Advanced Modelling, Validation, and Case Studies
The second session began with a recap of the editing principles introduced in the first session, followed by more advanced demonstrations of query construction and exploration of additional Wikidata services in break-out rooms.
Epigraphic WikiProjects and Data Models
The session highlighted two epigraphy-related initiatives within Wikidata: the IDEA project and the Altinum project, both led by the authors. These projects provide concrete models for representing inscriptions and serve as collaborative spaces in which contributors discuss standards and best practices, thereby offering participants in the working groups the opportunity to engage directly with established data models.
The IDEA Project reunites material from Dura-Europos in Wikidata and by modelling inscriptions, papyri, coins, buildings, and other artefacts, the project demonstrates how LOD can reconstruct archaeological and historical contexts through highly queryable relationships. The Altinum Project was presented as a case study of integrating data from EDR, EDCS, and unpublished theses into Wikidata. It illustrated how consistent modelling enables inscriptions and associated individuals to be embedded within a shared, multilingual ecosystem, enhancing visibility and interoperability.
Anne and Emily also presented current epigraphic data models in Wikidata, explaining how inscriptions, metadata, and relationships to places, people, and objects are encoded. They discussed strategies for verifying whether inscriptions already exist in the database and introduced ShEx schemas as a method for validating data structures and ensuring modelling consistency.
Breakout Sessions
After this theoretical introduction, participants joined the break-out rooms according to which topic they wanted to explore. Anna Clara led the editing group, Anne the ShEx Schemas and Emily the querying group. Participants engaged with practical tasks tailored to each thematic focus, deepening their understanding of the topics.
Outcomes and Reflections
By the end of the two-day workshop, participants had:
- Developed a clear understanding of Wikidata as a collaborative, structured knowledge base aligned with the principles of Linked Open Data.
- Gained practical experience in manual editing, including the use of properties, items, values, qualifiers, and references.
- Acquired foundational skills in querying data using SPARQL, Query Builder, and, to a basic level, the Query Service.
- Developed awareness of current epigraphic data models and the importance of consistency and validation (including ShEx schemas).
- Reflected on the broader challenges and opportunities involved in modelling inscriptions within a global, open ecosystem.
Overall, the workshop successfully combined discussions with practical applications. It demonstrated the potential of Wikidata to support digital epigraphy by fostering collaboration, enhancing interoperability, and increasing the discoverability and reusability of inscriptional data across projects and disciplines.
The number of questions raised throughout the sessions, together with the consistently positive feedback, reflected participants’ engagement and willingness to contribute actively; moreover, several inscriptions were modelled with notable accuracy and coherence, demonstrating a solid understanding of the principles discussed.






























