|
Domain Tagging With Supervised Classification for Retrieval Augmented Translation
|
|
0
|
47
|
January 17, 2025
|
|
Using Topic Modeling to Cluster User Translation (BERTopic)
|
|
0
|
110
|
November 28, 2024
|
|
Sentence Length Proportions As Data Cleaning Heuristic
|
|
3
|
60
|
January 7, 2025
|
|
Validating Data Cleaning for Translation Model Training
|
|
0
|
45
|
December 7, 2024
|
|
Creating openpecha/cleaned_MT_v1.0.3
|
|
0
|
35
|
December 5, 2024
|
|
Toward a Cleaner Translation Dataset
|
|
0
|
78
|
November 3, 2024
|
|
Topic Modeling Buddhist Material in the Translation Dataset
|
|
2
|
49
|
November 28, 2024
|