ℹ️ BEC Homepage

BDRC Etext Corpus Project

Mission: To publish a foundational dataset of Tibetan Buddhist Literature for AI and Research by collecting, OCRing, cataloging, cleaning, and aligning etexts.

:high_voltage: Quick Actions

:speech_balloon: Chat Join our Discord channel
:building_construction: Github Board Active Code Sprint / Active Data Sprint
:world_map: Flowchart Excalidraw+
:date: Meeting Minutes All minutes

WG Meeting Calendar

  • Annual: Strategic Planning (Board & Leads) to set the organizational vision and objectives.
  • Bi-monthly: OKR & Epic Drafting Workshop; OKR and Epic Setting; Strategy & Roadmap Review
  • Sprint (Bi-Weekly): A 14-day cycle comprising Community Hubs Review, Sprint Planning, Contributor Syncs, Demo & Vote, .
  • Daily: Standup meetings

:world_map: Strategic Roadmap (Active Epics)

Major initiatives we are prioritizing this quarter.

Status Epic Title :open_book: Read (PRD) :hammer_and_wrench: Do (Work)
:yellow_circle: In Progress Establish Gold Standard Catalog* Collection of the most accurate (manually transcribed) digital versions of Buddhist texts.* [View Spec]
:green_circle: Planning Develop OCR Evaluation Frameworkdescription View Spec View Label
:green_circle: Planning Refine OCR Models & Training Data Frameworkdescription View Spec View Label
:green_circle: Planning Launch Modern Text Acquisitiondescription View Spec View Label
:yellow_circle: In Progress Build Cataloging & Outlining Toolsdescription View Spec View Label
:green_circle: Planning Initiate Text Boundary Annotationdescription View Spec View Label

:busts_in_silhouette: Members and Roles

Voters:

Non-voting advisors:

Contributors:

  • @Elie_Roux - Tech Lead

    • Gabor

    • @Tashi_Tsering - OPS / AI engineer @tash

      • Arihant - OPS / AI
    • @Kaldan - data / AI engineer

    • @Ganga_Gyatso - data / AI engineer

      • Tsethar (Cataloger)

      • Sonam_Gyaltso (Cataloger)

      • Tenzin_Norbu (Cataloger)

      • lhujam_tashi789 (Cataloger)

    • @Tashi_Dhondup - Data Collection

      • Text Alignment
      • Transcription

:hammer_and_wrench: Contribution Zone

Ready to help? Pick a task based on your time availability.


:books: Library & Governance