A family of bilingual JA/EN LLMs. https://shisa.ai/posts/shisa-v2.1/
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper • 2406.08464 • Published • 71 -
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper • 2406.20094 • Published • 104 -
argilla/magpie-ultra-v1.0
Viewer • Updated • 3.22M • 1.01k • 50 -
simplescaling/s1K-1.1
Viewer • Updated • 1k • 3.27k • 142
JA/EN Bilingual LLMs
A family of bilingual JA/EN LLMs
Comparing Efficiency and Quality of various formats
-
cyberagent/Mistral-Nemo-Japanese-Instruct-2408
Text Generation • 12B • Updated • 590 • 46 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-FP8-Dynamic
12B • Updated • 4 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-SQ-GPTQ-W8A8-INT8
12B • Updated • 6 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-GPTQ-W4A16-gs32
12B • Updated • 6
A family of bilingual JA/EN LLMs. https://shisa.ai/posts/shisa-v2.1/
A family of bilingual JA/EN LLMs
-
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper • 2406.08464 • Published • 71 -
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper • 2406.20094 • Published • 104 -
argilla/magpie-ultra-v1.0
Viewer • Updated • 3.22M • 1.01k • 50 -
simplescaling/s1K-1.1
Viewer • Updated • 1k • 3.27k • 142
Comparing Efficiency and Quality of various formats
-
cyberagent/Mistral-Nemo-Japanese-Instruct-2408
Text Generation • 12B • Updated • 590 • 46 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-FP8-Dynamic
12B • Updated • 4 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-SQ-GPTQ-W8A8-INT8
12B • Updated • 6 -
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-GPTQ-W4A16-gs32
12B • Updated • 6
JA/EN Bilingual LLMs