ZeroGPU Explorers

community

AI & ML interests

None defined yet.

Recent Activity

caizhongang authored a paper 5 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

limingcv authored a paper 14 days ago

Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimization

limingcv authored a paper 14 days ago

ViPO: Visual Preference Optimization at Scale

View all activity

authored a paper 7 days ago

Aryabhata: An exam-focused language model for JEE Math

Paper • 2508.08665 • Published Aug 12, 2025 • 16

posted an update 20 days ago

Post

185

Reinforcement learning can sometimes lead to emergent behavior through much simpler training setups compared to large scale pre-training.

I explored this idea by running a small GRPO experiment on Qwen3.5 4B, and the results were pretty exciting.

Hypothesis: improving visual mathematical reasoning may also improve the model’s ability to transcribe LaTeX from images.

I wrote a short breakdown of the experiment here:
https://hanzlajavaid.github.io/blog/grpo-experiment-exploring-emergent-properties/

authored a paper 26 days ago

Selectivity and Shape in the Design of Forward-Forward Goodness Functions

Paper • 2604.13081 • Published Apr 16

in zero-gpu-explorers/README 29 days ago

Why doesn't anyone host llms in zerogpu spaces?

#172 opened about 1 month ago by

in zero-gpu-explorers/README about 1 month ago

Why doesn't anyone host llms in zerogpu spaces?

#172 opened about 1 month ago by

submitted a paper to Daily Papers about 2 months ago

Diffutron: A Masked Diffusion Language Model for Turkish Language

Paper • 2603.20466 • Published Mar 20 • 9

submitted a paper to Daily Papers about 2 months ago

LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis

Paper • 2603.20176 • Published Mar 20 • 11

authored a paper about 2 months ago

LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis

Paper • 2603.20176 • Published Mar 20 • 11

authored a paper about 2 months ago

Diffutron: A Masked Diffusion Language Model for Turkish Language

Paper • 2603.20466 • Published Mar 20 • 9

submitted a paper to Daily Papers 2 months ago

Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey

Paper • 2603.04445 • Published 27 days ago • 5

authored a paper 2 months ago

Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey

Paper • 2603.04445 • Published 27 days ago • 5

authored a paper 3 months ago

Recursive Think-Answer Process for LLMs and VLMs

Paper • 2603.02099 • Published Mar 2 • 7

submitted a paper to Daily Papers 3 months ago

Recursive Think-Answer Process for LLMs and VLMs

Paper • 2603.02099 • Published Mar 2 • 7

authored a paper 3 months ago

AfriNLLB: Efficient Translation Models for African Languages

Paper • 2602.09373 • Published Feb 10 • 3

submitted a paper to Daily Papers 4 months ago

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Paper • 2601.17124 • Published Jan 23 • 33

authored a paper 5 months ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 30

submitted a paper to Daily Papers 5 months ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 30

authored a paper 5 months ago

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Paper • 2512.21338 • Published Dec 24, 2025 • 23

submitted a paper to Daily Papers 5 months ago

In Pursuit of Pixel Supervision for Visual Pre-training

Paper • 2512.15715 • Published Dec 17, 2025 • 11

submitted a paper to Daily Papers 5 months ago

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44