Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Vithursan Thangarasa's picture
8 2 20

Vithursan Thangarasa

vithursant
tobiasgoecke's profile picture 21world's profile picture
·
https://vithursant.com/
  • vithursant19
  • vithursant

AI & ML interests

Large language models, Sparse Neural Network training, Generative Computer Vision

Recent Activity

liked a model 16 days ago
OpenMOSE/Qwen3-VL-REAP-145B-A22B-GGUF
liked a model 16 days ago
OpenMOSE/Qwen3-VL-REAP-145B-A22B
liked a model 21 days ago
cyankiwi/MiniMax-M2-REAP-162B-A10B-AWQ-4bit
View all activity

Organizations

Cerebras's profile picture MLX Community's profile picture wut?'s profile picture yofo's profile picture yofo-ironwood's profile picture

authored 4 papers over 1 year ago

MediSwift: Efficient Sparse Pre-trained Biomedical Language Models

Paper • 2403.00952 • Published Mar 1, 2024

Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation

Paper • 2104.09648 • Published Apr 19, 2021

RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Paper • 2206.14098 • Published Jun 28, 2022

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18, 2024 • 13
authored 2 papers about 2 years ago

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Paper • 2303.11525 • Published Mar 21, 2023 • 1

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models

Paper • 2303.10464 • Published Mar 18, 2023 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs