data-cleaning
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Domain Tagging With Supervised Classification for Retrieval Augmented Translation |   | 0 | 44 | January 17, 2025 | 
| Using Topic Modeling to Cluster User Translation (BERTopic) |   | 0 | 109 | November 28, 2024 | 
| Sentence Length Proportions As Data Cleaning Heuristic |       | 3 | 56 | January 7, 2025 | 
| Validating Data Cleaning for Translation Model Training |   | 0 | 45 | December 7, 2024 | 
| Creating openpecha/cleaned_MT_v1.0.3 |   | 0 | 34 | December 5, 2024 | 
| Toward a Cleaner Translation Dataset |   | 0 | 76 | November 3, 2024 | 
| Topic Modeling Buddhist Material in the Translation Dataset |     | 2 | 47 | November 28, 2024 |