Upload RL-trained question generation model

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,13 +1,13 @@
 ---
 license: apache-2.0
 base_model: Qwen/Qwen3-4B-Instruct-2507
 tags:
-- question-generation
-- rl
-- grpo
-- lora
 pipeline_tag: text-generation
-library_name: transformers
 ---
 # qwen3-4b-question-gen
@@ -41,4 +41,4 @@ from vllm import LLM, SamplingParams
 llm = LLM(model="ash256/qwen3-4b-question-gen")
 outputs = llm.generate(["Generate a technical screening question for a senior backend engineer:"], SamplingParams(max_tokens=256))
 print(outputs[0].outputs[0].text)
-```

 ---
 license: apache-2.0
+library_name: transformers
 base_model: Qwen/Qwen3-4B-Instruct-2507
 tags:
+  - question-generation
+  - rl
+  - grpo
+  - lora
 pipeline_tag: text-generation
 ---
 # qwen3-4b-question-gen
 llm = LLM(model="ash256/qwen3-4b-question-gen")
 outputs = llm.generate(["Generate a technical screening question for a senior backend engineer:"], SamplingParams(max_tokens=256))
 print(outputs[0].outputs[0].text)
+```

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:782f2b7794eb91ac319f0234bb3899c29e46d6ea74f7ca03ad1ff6821fe02607
 size 4967215360

 version https://git-lfs.github.com/spec/v1
+oid sha256:3320e2e0bc614646cef2b27cde2eea518da80f31e63e0f346ad61c7750c9bf49
 size 4967215360

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:40b2c71e5d756671bf45f13107512d3ff17b2cf36d5bc7522ab4714b5de2fc8e
 size 3077766632

 version https://git-lfs.github.com/spec/v1
+oid sha256:fb50315a4622cc96217c77044dad2c33ead85bd281c77e4427997a991f01ce15
 size 3077766632