Aletheia-ng/amharic-pretraining-corpus
Viewer
• Updated • 600k • 41
Viewer
• Updated • 690M • 217
Viewer
• Updated • 11M • 1k
Viewer
• Updated • 12.2M • 8
Aletheia-ng/processed_data
Viewer
• Updated • 2.81M • 8
Viewer
• Updated • 94.8M • 43
Viewer
• Updated • 158M • 881
Viewer
• Updated • 200M • 25
Aletheia-ng/pidgin-corpus-synth
Viewer
• Updated • 57.1k • 27
Aletheia-ng/yoruba-corpus-synth
Viewer
• Updated • 20.2k • 12
Aletheia-ng/nigerian-pidgin-corpus-synth
Updated • 15
Aletheia-ng/pretrain_data10
Viewer
• Updated • 40.9M • 23
Aletheia-ng/low_resource_languages_pretrain_data4
Viewer
• Updated • 469M • 295
Aletheia-ng/low_resource_languages_pretrain_data5
Viewer
• Updated • 212M • 133
Aletheia-ng/pretrain_data11
Aletheia-ng/pretrain_data9
Viewer
• Updated • 79.1M • 189
Aletheia-ng/pretrain_test
Viewer
• Updated • 112M • 1
Aletheia-ng/pretrain_data5
Viewer
• Updated • 9.43M • 8
Aletheia-ng/pretrain_data4
Viewer
• Updated • 124M • 54
Aletheia-ng/pretrain_data7
Viewer
• Updated • 13M • 26
Aletheia-ng/pretrain_data3
Viewer
• Updated • 143M • 131
Aletheia-ng/low_resource_languages_pretrain_data2
Viewer
• Updated • 587M • 307
Aletheia-ng/low_resource_languages_pretrain_data
Viewer
• Updated • 734M • 180
Aletheia-ng/pretrain_data6
Viewer
• Updated • 205M • 114
Viewer
• Updated • 136 • 9
Aletheia-ng/pretrain_data
Viewer
• Updated • 109M • 15
Aletheia-ng/pretrain_data2
Viewer
• Updated • 18.2M • 20
Aletheia-ng/low_resource_languages_pretrain
Viewer
• Updated • 202M • 609
• 1