Contributor Sync [BEC] - 2025/12/26 18:04 CST - Notes by Gemini

:link: Minutes :locked:

Summary

NT updated the team on testing a paid Google Meet account for transcription and improving project setup with milestones and OKRs, while Tashi Tsering reported that the back-end for the new alignment tool is deployed, and Karma Tsering is working on the front-end production. NT and Élie Roux discussed using AI to write and criticize clear project requirements before starting coding, and Karma Tsering demonstrated the MVP for both annotator and reviewer dashboards. ཨེ་མ (Ema) presented the finalized annotator board for tracking progress across different catalogs, Tenzin Kaldan reported on data uploads and developer onboarding, and Ganga Gyatso updated on the completion of the DOC to RTF transformation.

Suggested next steps

  • NT will share the link to the uploaded data from various websites and groups later today.
  • Élie Roux will make a first pass on Monday to pre-process some images to create a clean batch for transcription, and will send Tashi Tsering the link to the command line script for the BDRC app to provide OCR output as initial transcription.
  • Élie Roux will synchronize with Tashi Tsering early next week (Monday or Tuesday) on the first batch of images for transcription.
  • Tashi Tsering will upload the images for the transcription task to the S3 bucket to use the URLs for display in the UI.
  • Tenzin Kaldan will keep the completely original files for the data from Kurt in the source folder and send Élie Roux the folder information for the deleted redundant folder via Discord.
  • Élie Roux will look into a specific scan that Tenzin Kaldan found to be missing illustration compared to the BDRC scan.
  • Ganga Gyatso will upload the completed RTF to XML transformation files for QC on Google Drive when ready.
Transcription

A: dlskdjsdlksjd

B: dslkdjsldsjds