ray commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -31,7 +31,7 @@ Despite its lightweight footprint, Sutra-Instruct-v2 acts as a highly capable "W
|
|
| 31 |
- **Developer:** Abhiray / Jay
|
| 32 |
|
| 33 |
## 📚 Training Data Recipe (Continuous Pre-Training)
|
| 34 |
-
To transition the model from a general-purpose base to a reasoning and coding
|
| 35 |
|
| 36 |
The dataset mixture was meticulously balanced to prevent overfitting while maximizing logical deduction:
|
| 37 |
|
|
|
|
| 31 |
- **Developer:** Abhiray / Jay
|
| 32 |
|
| 33 |
## 📚 Training Data Recipe (Continuous Pre-Training)
|
| 34 |
+
To transition the model from a general-purpose base to a reasoning and bit of coding , we executed a Continuous Pre-Training phase using approximately **3.3 Billion high-quality tokens**.
|
| 35 |
|
| 36 |
The dataset mixture was meticulously balanced to prevent overfitting while maximizing logical deduction:
|
| 37 |
|