In a Training Loop 🔄

Gyanateet Dutta

Ryukijano

muhammadzeeshan007's profile picture

bennybearlover's profile picture

Pacosp's profile picture

https://ryukijano.github.io

gyanateet
Ryukijano
gyanateet-dutta-386215192

AI & ML interests

Computer Vision, Robotics, Generative modelling, AI for Sciences.

Recent Activity

upvoted a paper 1 day ago

World Action Models: The Next Frontier in Embodied AI

updated a Space 4 days ago

Ryukijano/CatCon-One-Shot-Controlnet-SD-1-5-b2

updated a model 4 days ago

Ryukijano/parameter-golf-models

View all activity

Organizations

Ryukijano 's collections 22

AI-For-Quantum Computing

nvidia/Ising-Calibration-1-35B-A3B

Image-Text-to-Text • 665k • Updated about 1 month ago • 2.3k • 50
nvidia/Ising-Decoder-SurfaceCode-1-Fast

Updated 18 days ago • 148 • 13
nvidia/Ising-Decoder-SurfaceCode-1-Accurate

Updated 18 days ago • 39 • 9
nvidia/QCalEval

Viewer • Updated Apr 13 • 243 • 1.2k • 16

Learning

Running

80

TorchCode

🔥

80

Run and edit interactive notebooks in your browser

Vision_transformer_robotics

lerobot/pi0_old

Robotics • 4B • Updated Sep 19, 2025 • 1.63k • 307
nvidia/GR00T-N1.5-3B

Robotics • 3B • Updated Sep 17, 2025 • 3.7k • 188
nvidia/PhysicalAI-Autonomous-Vehicles

Updated 9 days ago • 222k • 873
facebook/tribev2

Updated Mar 27 • 174k • 517

Midi-composer

Running on Zero

Agents

Featured

582

Midi Music Generator

🎼

582

Generate MIDI music from prompts

Neural Rendering

This collection focuses on using neural networks for photorealistic rendering and image synthesis. It features models capable to text-to-image gen.

NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

Paper • 2307.14620 • Published Jul 27, 2023 • 15
LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs

Paper • 2306.05410 • Published Jun 8, 2023 • 4
ashawkey/nerf2mesh

Updated Aug 23, 2023 • 14
Build error

Featured

25

NeRF

🔮

25

Own Work

Solving The Travelling Salesmen Problem using HNN and HNN-SA algorithms

Paper • 2202.13746 • Published Feb 8, 2022 • 1
Improved Pothole Detection Using YOLOv7 and ESRGAN

Paper • 2401.08588 • Published Nov 10, 2023 • 1

LLMs

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264
3D-LFM: Lifting Foundation Model

Paper • 2312.11894 • Published Dec 19, 2023 • 15
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 61
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 31

Audio

EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks

Paper • 2402.00892 • Published Jan 31, 2024 • 13
Running on Zero

MCP

Featured

294

MusicGen Streaming

🔥

294

Generate music from text descriptions in real-time
Runtime error

Agents

145

Whisper JAX

👀

145

Transcribe or translate audio from microphone, file, or YouTube
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning

Paper • 2406.03344 • Published Jun 5, 2024 • 22

Text_to_video diffusion

Runtime error

Agents

Featured

400

AnimateDiff-Lightning

⚡

400

Generate animated videos from text prompts
tencent/HunyuanVideo

Text-to-Video • Updated Mar 6, 2025 • 794 • • 2.17k

Text-3D

Running on L4

Agents

Featured

1.17k

Stable Fast 3D

🎮

1.17k

Generate a 3D mesh from a single image
Runtime error

Agents

Featured

184

Roblox 3D Assets Generator v1

🪄

184

Create a 3D model from an image in 10 seconds!
Running on Zero

Agents

Featured

148

LLaMA Mesh

👀

148

Create 3D mesh by chatting.
stabilityai/stable-point-aware-3d

Image-to-3D • 2B • Updated Apr 8, 2025 • 1.62k • 346

Audio->3D

fudan-generative-ai/hallo

Updated Jul 11, 2024 • 97

AI-4-Sciences

Running on CPU Upgrade

Agents

65

FAIR Chem UMA Demo

⚛

65

Run molecular dynamics simulations on uploaded structures

STEM

nvidia/AMPLIFY_350M

Fill-Mask • 0.4B • Updated Sep 26, 2025 • 33 • 8

VILA

Efficient-Large-Model/VILA1.5-3b

Text Generation • Updated Jul 18, 2024 • 997 • 34

Diffusion models

Explore the capabilities of diffusion models for natural language processing. This collection features a diverse set of models trained using diffusion

PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models

Paper • 2309.05793 • Published Sep 11, 2023 • 51
3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 202
stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.03M • • 7.71k
Ryukijano/lora-trained-xl-kaggle-p100

Text-to-Image • Updated Sep 28, 2024 • 4 • 1

Deep Reinforcement Learning

Features implementations and paces of popular RL algorithms and new paradigms on a variety of environments.

Ryukijano/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Mar 21, 2023
Ryukijano/Mujoco_rl_halfcheetah_Decision_Trasformer

Reinforcement Learning • Updated Jul 22, 2023 • 7
Ryukijano/poca-SoccerTwos

Reinforcement Learning • Updated Jul 18, 2023 • 28
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Paper • 2308.03526 • Published Aug 7, 2023 • 29

Deep learning

NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation

Paper • 2311.12229 • Published Nov 20, 2023 • 25
Running on Zero

Agents

Featured

1.01k

IP-Adapter-FaceID

🧑

1.01k

Generate AI images that blend your face with any prompt
Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5, 2024 • 98

Computer vision

Unsupervised Universal Image Segmentation

Paper • 2312.17243 • Published Dec 28, 2023 • 20
Denoising Vision Transformers

Paper • 2401.02957 • Published Jan 5, 2024 • 31
timm/ViT-B-16-SigLIP

Zero-Shot Image Classification • Updated Oct 25, 2023 • 72.7k • 37
Running on Zero

Agents

19

Slimsam

🌖

19

Small yet powerful mask generation application ⚡️

Multi modal foundational models

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7, 2024 • 65

Vision_language_models

Running

81

Experimental Moondream WebGPU

🌕

81

Render 3D graphics using WebGPU
meta-llama/Llama-3.2-90B-Vision-Instruct

Image-Text-to-Text • 89B • Updated Mar 4, 2025 • 4.42k • 357
Hcompany/Holo1-3B

Image-Text-to-Text • 4B • Updated Jun 10, 2025 • 651 • 83

2D->3D

Paused

Agents

68

MeshAnythingV2

🚀

68

Generate artist-style 3D mesh from your input model
Runtime error

Agents

10

En3D

🏃

10
Runtime error

Agents

55

MASt3R

📉

55

Generate 3D models from images
naver/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric

Image-to-3D • 0.7B • Updated Jul 18, 2024 • 55.3k • 18

Segmentation

Runtime error

Agents

11

Image

📚

11

Generate and save object segmentation masks from images

AI-For-Quantum Computing

nvidia/Ising-Calibration-1-35B-A3B

Image-Text-to-Text • 665k • Updated about 1 month ago • 2.3k • 50
nvidia/Ising-Decoder-SurfaceCode-1-Fast

Updated 18 days ago • 148 • 13
nvidia/Ising-Decoder-SurfaceCode-1-Accurate

Updated 18 days ago • 39 • 9
nvidia/QCalEval

Viewer • Updated Apr 13 • 243 • 1.2k • 16

AI-4-Sciences

Running on CPU Upgrade

Agents

65

FAIR Chem UMA Demo

⚛

65

Run molecular dynamics simulations on uploaded structures

Learning

Running

80

TorchCode

🔥

80

Run and edit interactive notebooks in your browser

STEM

nvidia/AMPLIFY_350M

Fill-Mask • 0.4B • Updated Sep 26, 2025 • 33 • 8

Vision_transformer_robotics

lerobot/pi0_old

Robotics • 4B • Updated Sep 19, 2025 • 1.63k • 307
nvidia/GR00T-N1.5-3B

Robotics • 3B • Updated Sep 17, 2025 • 3.7k • 188
nvidia/PhysicalAI-Autonomous-Vehicles

Updated 9 days ago • 222k • 873
facebook/tribev2

Updated Mar 27 • 174k • 517

VILA

Efficient-Large-Model/VILA1.5-3b

Text Generation • Updated Jul 18, 2024 • 997 • 34

Midi-composer

Running on Zero

Agents

Featured

582

Midi Music Generator

🎼

582

Generate MIDI music from prompts

Diffusion models

Explore the capabilities of diffusion models for natural language processing. This collection features a diverse set of models trained using diffusion

PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models

Paper • 2309.05793 • Published Sep 11, 2023 • 51
3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 202
stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.03M • • 7.71k
Ryukijano/lora-trained-xl-kaggle-p100

Text-to-Image • Updated Sep 28, 2024 • 4 • 1

Neural Rendering

This collection focuses on using neural networks for photorealistic rendering and image synthesis. It features models capable to text-to-image gen.

NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

Paper • 2307.14620 • Published Jul 27, 2023 • 15
LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs

Paper • 2306.05410 • Published Jun 8, 2023 • 4
ashawkey/nerf2mesh

Updated Aug 23, 2023 • 14
Build error

Featured

25

NeRF

🔮

25

Deep Reinforcement Learning

Features implementations and paces of popular RL algorithms and new paradigms on a variety of environments.

Ryukijano/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Mar 21, 2023
Ryukijano/Mujoco_rl_halfcheetah_Decision_Trasformer

Reinforcement Learning • Updated Jul 22, 2023 • 7
Ryukijano/poca-SoccerTwos

Reinforcement Learning • Updated Jul 18, 2023 • 28
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Paper • 2308.03526 • Published Aug 7, 2023 • 29

Own Work

Solving The Travelling Salesmen Problem using HNN and HNN-SA algorithms

Paper • 2202.13746 • Published Feb 8, 2022 • 1
Improved Pothole Detection Using YOLOv7 and ESRGAN

Paper • 2401.08588 • Published Nov 10, 2023 • 1

Deep learning

NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation

Paper • 2311.12229 • Published Nov 20, 2023 • 25
Running on Zero

Agents

Featured

1.01k

IP-Adapter-FaceID

🧑

1.01k

Generate AI images that blend your face with any prompt
Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5, 2024 • 98

LLMs

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264
3D-LFM: Lifting Foundation Model

Paper • 2312.11894 • Published Dec 19, 2023 • 15
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 61
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 31

Computer vision

Unsupervised Universal Image Segmentation

Paper • 2312.17243 • Published Dec 28, 2023 • 20
Denoising Vision Transformers

Paper • 2401.02957 • Published Jan 5, 2024 • 31
timm/ViT-B-16-SigLIP

Zero-Shot Image Classification • Updated Oct 25, 2023 • 72.7k • 37
Running on Zero

Agents

19

Slimsam

🌖

19

Small yet powerful mask generation application ⚡️

Audio

EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks

Paper • 2402.00892 • Published Jan 31, 2024 • 13
Running on Zero

MCP

Featured

294

MusicGen Streaming

🔥

294

Generate music from text descriptions in real-time
Runtime error

Agents

145

Whisper JAX

👀

145

Transcribe or translate audio from microphone, file, or YouTube
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning

Paper • 2406.03344 • Published Jun 5, 2024 • 22

Multi modal foundational models

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7, 2024 • 65

Text_to_video diffusion

Runtime error

Agents

Featured

400

AnimateDiff-Lightning

⚡

400

Generate animated videos from text prompts
tencent/HunyuanVideo

Text-to-Video • Updated Mar 6, 2025 • 794 • • 2.17k

Vision_language_models

Running

81

Experimental Moondream WebGPU

🌕

81

Render 3D graphics using WebGPU
meta-llama/Llama-3.2-90B-Vision-Instruct

Image-Text-to-Text • 89B • Updated Mar 4, 2025 • 4.42k • 357
Hcompany/Holo1-3B

Image-Text-to-Text • 4B • Updated Jun 10, 2025 • 651 • 83

Text-3D

Running on L4

Agents

Featured

1.17k

Stable Fast 3D

🎮

1.17k

Generate a 3D mesh from a single image
Runtime error

Agents

Featured

184

Roblox 3D Assets Generator v1

🪄

184

Create a 3D model from an image in 10 seconds!
Running on Zero

Agents

Featured

148

LLaMA Mesh

👀

148

Create 3D mesh by chatting.
stabilityai/stable-point-aware-3d

Image-to-3D • 2B • Updated Apr 8, 2025 • 1.62k • 346

2D->3D

Paused

Agents

68

MeshAnythingV2

🚀

68

Generate artist-style 3D mesh from your input model
Runtime error

Agents

10

En3D

🏃

10
Runtime error

Agents

55

MASt3R

📉

55

Generate 3D models from images
naver/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric

Image-to-3D • 0.7B • Updated Jul 18, 2024 • 55.3k • 18

Audio->3D

fudan-generative-ai/hallo

Updated Jul 11, 2024 • 97

Segmentation

Runtime error

Agents

11

Image

📚

11

Generate and save object segmentation masks from images

Gyanateet Dutta

AI & ML interests

Recent Activity

Organizations

Ryukijano 's collections 22

TorchCode

Midi Music Generator

NeRF

MusicGen Streaming

Whisper JAX

AnimateDiff-Lightning

Stable Fast 3D

Roblox 3D Assets Generator v1

LLaMA Mesh

FAIR Chem UMA Demo

IP-Adapter-FaceID

Slimsam

Experimental Moondream WebGPU

MeshAnythingV2

En3D

MASt3R

Image

FAIR Chem UMA Demo

TorchCode

Midi Music Generator

NeRF

IP-Adapter-FaceID

Slimsam

MusicGen Streaming

Whisper JAX

AnimateDiff-Lightning

Experimental Moondream WebGPU

Stable Fast 3D

Roblox 3D Assets Generator v1

LLaMA Mesh

MeshAnythingV2

En3D

MASt3R

Image