Marie Puren

Open Access to Data about Silk Heritage: A Case Study in Digital Information Sustainability

Abstract

This article builds on work conducted and lessons learned within SILKNOW, a research project that aimed at enhancing the preservation and digital dissemination of silk heritage. Taking the project and this heritage typology as a case study in the digital transformation of cultural heritage institutions, it illustrates specific challenges that these institutions must face and demonstrates a few innovative answers to meet those challenges. The methodology combines approaches typical of the humanities and others usual in ICT, being inductive regarding materials and methods (consisting of a detailed review of existing online repositories and research projects devoted to textile heritage) and descriptive for the results and discussion (which explain at length the development of some tools and resources that responded to the needs detected in the previous analysis). The article reports on the state of the art and recent developments in the field of textile heritage, the tools implemented to allow the semantic access and text analysis of descriptive records associated with silk fabrics, and the spatiotemporal visualization of that information. Finally, it argues that institutional policies, namely the creation and free dissemination of open data related to cultural heritage are just as important as technical developments, showing why any future effort in these areas should take data sustainability, both in its technical and in institutional aspects, into account, since it is the most responsible and reasonable approach in terms of efficient resource allocation.

Continue reading

Explorer les débats parlementaires français de la troisième république par leurs sujets

By Marie Puren, Aurélien Pellet

2023-06-01

In Humanistica 2023

Abstract

Cet article compare trois méthodes pour explorer de grands corpus de documents historiques par leurs sujets. Nous travaillons ici sur les débats parlementaires franais de la Troisième République, qui se prêtent particulièrement bien à ce type d’analyse. Après avoir présenté le contexte de cette étude, nous exposons les résultats obtenus avec trois méthodes issues du traitement automatique des langues et appliquées sur des textes publiés entre 1876 et 1914 : l’allocation de Dirichlet latente, les plongements de mots et le Transfer Learning.

Continue reading

Conceptual modeling of european silk heritage with the SILKNOW data model and extension

By Marie Puren, Pierre Vernus

1970-01-01

In Digital Humanities Quarterly

Abstract

Silk holds significant historical importance in European history, fostering economic growth, innovation in weaving techniques, and the creation of exceptional artifacts. Despite the conservation efforts of numerous institutions, silk heritage remains at risk due to its fragile nature. This paper outlines the methodology employed by the Horizon 2020 SilkNow project aimed at enhancing the promotion and preservation of silk-related cultural heritage collections. We present the development of a CIDOC CRM-based data model for the creation of a comprehensive knowledge graph. We also introduce the SilkNow extension, designed to encapsulate the intricate semantics associated with the production processes of silk fabrics. Our results demonstrate the potential of Semantic Web technologies in safeguarding and enriching the visibility of silk heritage through improved data interoperability and accessibility.

Continue reading

InTEIrviews: An ODD for qualitative interviews in the humanities

By Marie Puren, Florian Cafiero

1970-01-01

In Journal of the Text Encoding Inititative

Abstract

Qualitative interviews are a crucial research method for many disciplines in the humanities and social sciences. Yet apart from rare initiatives, interview transcriptions are seldom shared with other researchers. Even when they are, annotations are mostly absent, and rarely standardized or designed to be read by anyone but the initial research team. In this paper, we advocate for a more open management of these resources, and present a proposition for an XML-TEI–conformant standard that allows for their accurate transcription and annotation. The ODD we present is aimed at facilitating systematic analyses of corpora of interview transcriptions, as well as ensuring wider dissemination and reusability of these resources. Relying entirely on existing TEI elements to create this ODD, we combine primarily the elements and attributes declared by the modules “Transcription of Speech” and “Language Corpora.” This ODD is designed to help describe the actual development of the interview, transcribe the interpretations made by the researcher(s), and tackle ethical or anonymity issues.

Continue reading