Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
20
5
18
Void
Parveshiiii
Follow
mathias8765's profile picture
noah245's profile picture
isael234567's profile picture
150 followers
·
34 following
parveshiiii
AI & ML interests
I love deep neural nets.
Recent Activity
reacted
to
their
post
with 🔥
3 days ago
🚀 Wanna train your own AI Model or Tokenizer from scratch? Building models isn’t just for big labs anymore — with the right data, compute, and workflow, you can create **custom AI models** and **tokenizers** tailored to any domain. Whether it’s NLP, domain‑specific datasets, or experimental architectures, training from scratch gives you full control over vocabulary, embeddings, and performance. ✨ Why train your own? - Full control over vocabulary & tokenization - Domain‑specific optimization (medical, legal, technical, etc.) - Better performance on niche datasets - Freedom to experiment with architectures ⚡ The best part? - Tokenizer training (TikToken / BPE) can be done in **just 3 lines of code**. - Model training runs smoothly on **Google Colab notebooks** — no expensive hardware required. 📂 Try out my work: - 🔗 https://github.com/OE-Void/Tokenizer-from_scratch - 🔗 https://github.com/OE-Void/GPT
posted
an
update
3 days ago
🚀 Wanna train your own AI Model or Tokenizer from scratch? Building models isn’t just for big labs anymore — with the right data, compute, and workflow, you can create **custom AI models** and **tokenizers** tailored to any domain. Whether it’s NLP, domain‑specific datasets, or experimental architectures, training from scratch gives you full control over vocabulary, embeddings, and performance. ✨ Why train your own? - Full control over vocabulary & tokenization - Domain‑specific optimization (medical, legal, technical, etc.) - Better performance on niche datasets - Freedom to experiment with architectures ⚡ The best part? - Tokenizer training (TikToken / BPE) can be done in **just 3 lines of code**. - Model training runs smoothly on **Google Colab notebooks** — no expensive hardware required. 📂 Try out my work: - 🔗 https://github.com/OE-Void/Tokenizer-from_scratch - 🔗 https://github.com/OE-Void/GPT
liked
a dataset
3 days ago
Modotte/MathX-20M
View all activity
Organizations
Parveshiiii
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
3 days ago
Modotte/MathX-20M
Viewer
•
Updated
4 days ago
•
20.3M
•
83
•
1
liked
a model
about 1 month ago
Org-Exp/M1-MathX
Text Generation
•
1.0B
•
Updated
Dec 27, 2025
•
12
•
3
liked
a Space
about 2 months ago
Sleeping
1
AIRealNet
🏆
1
Demo for XenArcAI-AIRealNet
liked
2 datasets
2 months ago
Modotte/CodeX-2M-Thinking
Viewer
•
Updated
15 days ago
•
2.19M
•
639
•
8
Modotte/CodeX-7M-Non-Thinking
Viewer
•
Updated
15 days ago
•
7.36M
•
912
•
7
liked
a model
3 months ago
Modotte/SparkEmbedding-300m
Sentence Similarity
•
0.3B
•
Updated
15 days ago
•
18
•
11
liked
a model
4 months ago
Parveshiiii/Classifier
0.2B
•
Updated
Nov 4, 2025
•
3
•
1
liked
4 datasets
4 months ago
Parveshiiii/Embedder
Viewer
•
Updated
Sep 22, 2025
•
990k
•
14
•
2
Parveshiiii/AI-vs-Real
Viewer
•
Updated
Sep 25, 2025
•
14k
•
363
•
5
Parveshiiii/Complete-it
Viewer
•
Updated
Oct 2, 2025
•
190k
•
31
•
2
Modotte/Bhagwat-Gita-Infinity
Preview
•
Updated
15 days ago
•
427
•
9
liked
a model
4 months ago
Modotte/AIRealNet
Image Classification
•
0.2B
•
Updated
15 days ago
•
17.2k
•
•
6
liked
3 models
5 months ago
Parveshiiii/Auto-Completer-0.1
Text Generation
•
0.4B
•
Updated
Sep 9, 2025
•
2
•
1
Parveshiiii/Auto-Completer-0.2
Text Generation
•
0.4B
•
Updated
Sep 9, 2025
•
3
•
2
Threatthriver/qussar-alpha
0.4B
•
Updated
Sep 9, 2025
•
6
•
2
liked
a model
7 months ago
Parveshiiii/mistral-small-int8
Text Generation
•
7B
•
Updated
Jul 8, 2025
•
3
•
1
liked
a dataset
7 months ago
Modotte/MathX-5M
Viewer
•
Updated
5 days ago
•
5.05M
•
2.58k
•
68
liked
a dataset
9 months ago
Parveshiiii/opencode_reasoning_filtered
Viewer
•
Updated
Jul 8, 2025
•
568k
•
78
•
4