17 Tantras SIG Proposal

SUMMARY
Version [e.g., 1.0]
Mission [17 Tantra common-spell]
Champion(s): [@Mikko] [@Jordhen] [@Gade]
Communication: - Discord Channel - GitHub Team - Google Calender
Documentation: - PRD(Requirements) - Progress Board - Visual - Meeting Minutes

Proposal Details

1. Problem Statement / Motivation

The 17 Tantra texts exist in multiple versions, each containing numerous orthographic and phonetic variations. This lack of standardisation poses challenges for scholars. To facilitate accurate textual analysis, digital preservation, and collaborative research, there is a pressing need to create a machine-readable lexicon, compile a critically collated e-text from different versions, document variant spellings and phonetic forms, and produce a consensus-based critical edition.

2. Previous Work

  1. Resource Identification & OCR; Located three different versions -tharpa ling parma (ཐར་པ་གླིང་པར་མ།), Azom parma (ཨ་འཛོམ་པར་མ), and Sichuan Merig Publisher House (སི་ཁྲོན་མི་རིགས་དཔེ་སྐྲུན་ཁང་།) and applied OCR for Tibetan text digitisation.
  2. Text Cleaning & Normalisation; Post-processing of OCR output for version-specific accuracy.
  3. Critical Collation & Footnotes; Compared variants and documented differences with detailed footnotes using Pecha tools.
  4. Quality Assurance; Conducted thorough proofreading and validation for accuracy.

3. Scope

  • In Scope:

    • Creation of a machine-readable lexicon.

    • Creation of collated e-text from three different versions.

    • Compilation of variant spellings and phonetic forms used across versions.

    • Critical edition of 17 tantra for consensus on common-spell decisions.

  • Out of Scope:

    • Does not include full translations, historical analysis, full root text standardisation, or spelling changes to unrelated Tibetan texts.

4. Potential Deliverables

  1. A machine-readable lexicon and collated e-text combining three versions of the 17 Tantras
  2. A compilation of spelling and phonetic variants, leading to a draft critical edition based on common-spell consensus.
  3. Structured E-Text: Upload and format the full text of the 17 Tantras in digital form, ensuring clean, accurate, and standardised layout.
  4. Source Citations: Provide properly formatted references and citations for the text based on recognised scholarly sources.
  5. Source Image Inclusion: include one version image of a manuscript or pecha version per text, clearly linked as part of the source section.
  6. Pedurma Notes Integration: Add textual notes based on Pedurma (comparative edition) to highlight variant readings or important editorial observations in the source area.
  7. Interactive Page Referencing: Implement left-side margin page numbering linked to scanned source images, with corresponding text highlights for improved navigation and visual reference.
  8. Stanza Numbering with Internal Links: Display stanza numbers on the right margin, each linked to its corresponding section.

5. Proposed Approaches

  1. Use Pecha tools to compare different versions of the text
  2. Document spelling variations in a structured and consistent format
  3. Engage experts to support the preparation of a critical edition
  4. Enhance a prototyp@MikkoMikkoKotil tool for detecting and applying co@Username1lated@Username2spellings.

Members:

  • [@MikkoMikkoKotil] —@Username1Cham@Username2ion

  • [@kaldan] — Developer

  • @Username1

    [@lhakpa_wangyal]@Username2— Coordinator

Contributors / Annotators:

  • @Username1
  • @Username2
  • @Username3

SIG Meeting Calendar


Related SIGs and Groups


Useful Links