Abiray
/

Sutra-Instruct-v2-350M

code-generation

continuous-pre-training

custom-architecture

Model card Files Files and versions

ray commited on Mar 20

Commit

7c032f9

·

verified ·

1 Parent(s): 8adaff3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -31,7 +31,7 @@ Despite its lightweight footprint, Sutra-Instruct-v2 acts as a highly capable "W
 - **Developer:** Abhiray / Jay
 ## 📚 Training Data Recipe (Continuous Pre-Training)
-To transition the model from a general-purpose base to a reasoning and coding specialist, we executed a Continuous Pre-Training phase using approximately **3.3 Billion high-quality tokens**.
 The dataset mixture was meticulously balanced to prevent overfitting while maximizing logical deduction:

 - **Developer:** Abhiray / Jay
 ## 📚 Training Data Recipe (Continuous Pre-Training)
+To transition the model from a general-purpose base to a reasoning and bit of coding , we executed a Continuous Pre-Training phase using approximately **3.3 Billion high-quality tokens**.
 The dataset mixture was meticulously balanced to prevent overfitting while maximizing logical deduction: