Abstract
This article presents a novel approach for the citation network construction from Jewish Responsa literature based on automatic extraction of references from texts. Jewish Responsa literature contains thousands of answers to questions related to Jewish law (Halachah), spanning over 1,300 years by authors from all over the world. This literature is abundant with references, but because of their high lexical and format variability their automatic identification and extraction is very challenging. In this article we present a novel, multi-layered approach that splits the reference extraction task into two main subtasks: (i) reference boundaries' identification; (ii) reference internal components' identification. We experimented with several different machine learning models: Conditional Random Field (CRF) model, Bidirectional Encoder Representations from Transformers (BERT) model, and a combined approach, BERT-CRF. Additionally, we examined the influence of the training corpus on the model's accuracy by comparing the performance of the models trained on modern Hebrew vs. Rabbinic Hebrew. We found that the best results were achieved by a BERT-CRF model trained on Rabbinic Hebrew. The constructed network can be utilized to build various tools for analyzing trends and influences in the Jewish Halachic corpus, such as the most influencing authors, the authors' sources of authority, and their evolution over time and place.
| Original language | English |
|---|---|
| Journal | Journal on Computing and Cultural Heritage |
| Volume | 18 |
| Issue number | 2 |
| DOIs | |
| State | Published - 19 Apr 2025 |
Bibliographical note
Publisher Copyright:© 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.
Keywords
- Bidirectional Encoder Representations from Transformers
- Conditional Random Fields
- Jewish Rabbinic literature
- digital humanities
Fingerprint
Dive into the research topics of 'Automatic Construction of the Citation Network from the Medieval Jewish Responsa Literature'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver