|
Domain Tagging With Supervised Classification for Retrieval Augmented Translation
|
|
0
|
52
|
January 17, 2025
|
|
Using Topic Modeling to Cluster User Translation (BERTopic)
|
|
0
|
130
|
November 28, 2024
|
|
Sentence Length Proportions As Data Cleaning Heuristic
|
|
3
|
65
|
January 7, 2025
|
|
Validating Data Cleaning for Translation Model Training
|
|
0
|
45
|
December 7, 2024
|
|
Creating openpecha/cleaned_MT_v1.0.3
|
|
0
|
40
|
December 5, 2024
|
|
Toward a Cleaner Translation Dataset
|
|
0
|
83
|
November 3, 2024
|
|
Topic Modeling Buddhist Material in the Translation Dataset
|
|
2
|
50
|
November 28, 2024
|