liuganghuggingface
/

DemoDiff-0.7B

Graph Machine Learning

Model card Files Files and versions

liuganghuggingface commited on Oct 9

Commit

32d216f

·

verified ·

1 Parent(s): 0fcc30e

Update README.md

Files changed (1) hide show

README.md +14 -10

README.md CHANGED Viewed

@@ -7,13 +7,17 @@ tags:
 - biology
 ---
-context_length: 150
-depth: 24
-diffusion_steps: 500
-hidden_size: 1280
-mlp_ratio: 4
-num_heads: 16
-task_name: pretrainv6reset
-tokenizer_name: pretrainv6reset
-vocab_ring_len: 300
-vocab_size: 3000

 - biology
 ---
+### Model Configuration
+| Parameter | Value | Description |
+|------------|--------|-------------|
+| **context_length** | 150 | Maximum sequence length for the input context. |
+| **depth** | 24 | Number of transformer layers. |
+| **diffusion_steps** | 500 | Number of diffusion steps during training. |
+| **hidden_size** | 1280 | Hidden dimension size in the transformer. |
+| **mlp_ratio** | 4 | Expansion ratio in the MLP block. |
+| **num_heads** | 16 | Number of attention heads. |
+| **task_name** | `pretrain` | Task type for model training. |
+| **tokenizer_name** | `pretrain` | Tokenizer used for model input. |
+| **vocab_ring_len** | 300 | Length of the circular vocabulary window. |
+| **vocab_size** | 3000 | Total vocabulary size. |