Instructions to use ThomasYn/GenomeOcean-100M-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use ThomasYn/GenomeOcean-100M-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="ThomasYn/GenomeOcean-100M-GGUF", filename="GenomeOcean-100M-FP16.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use ThomasYn/GenomeOcean-100M-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf ThomasYn/GenomeOcean-100M-GGUF # Run inference directly in the terminal: llama-cli -hf ThomasYn/GenomeOcean-100M-GGUF
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf ThomasYn/GenomeOcean-100M-GGUF # Run inference directly in the terminal: llama-cli -hf ThomasYn/GenomeOcean-100M-GGUF
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf ThomasYn/GenomeOcean-100M-GGUF # Run inference directly in the terminal: ./llama-cli -hf ThomasYn/GenomeOcean-100M-GGUF
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf ThomasYn/GenomeOcean-100M-GGUF # Run inference directly in the terminal: ./build/bin/llama-cli -hf ThomasYn/GenomeOcean-100M-GGUF
Use Docker
docker model run hf.co/ThomasYn/GenomeOcean-100M-GGUF
- LM Studio
- Jan
- Ollama
How to use ThomasYn/GenomeOcean-100M-GGUF with Ollama:
ollama run hf.co/ThomasYn/GenomeOcean-100M-GGUF
- Unsloth Studio new
How to use ThomasYn/GenomeOcean-100M-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ThomasYn/GenomeOcean-100M-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ThomasYn/GenomeOcean-100M-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for ThomasYn/GenomeOcean-100M-GGUF to start chatting
- Docker Model Runner
How to use ThomasYn/GenomeOcean-100M-GGUF with Docker Model Runner:
docker model run hf.co/ThomasYn/GenomeOcean-100M-GGUF
- Lemonade
How to use ThomasYn/GenomeOcean-100M-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull ThomasYn/GenomeOcean-100M-GGUF
Run and chat with the model
lemonade run user.GenomeOcean-100M-GGUF-{{QUANT_TAG}}List all available models
lemonade list
GenomeOcean-100M-GGUF
Model Overview
This is a GGUF quantized version of GenomeOcean-100M, designed for high-efficiency DNA sequence modeling.
- Architecture: Mistral-based Genomic LLM
- Quantization: GGUF (4-bit)
- Primary Use: DNA sequence scoring, generation, and genomic feature analysis.
Benchmark Results (Local Evaluation)
Evaluation conducted on genomic sequences (max length 512) using TP=2.
| Metric | FP16 (Original) | GGUF (4-bit) | Change |
|---|---|---|---|
| VRAM Footprint | ~2x Model Size | 2.2 GB | Optimized |
| Model Size | 228.3 MB | 69.5 MB | -69.6% |
| NLL Loss | 6.2110 | 6.2483 | +3.80% Drift |
| Perplexity (PPL) | 498.1917 | 517.1174 | +3.80% Drift |
| Generation Time | 92.2s | 54.6s | Optimized |
Usage
Using vLLM
from vllm import LLM, SamplingParams
# Load the model
llm = LLM(model="ThomasYn/GenomeOcean-100M-GGUF")
# Generate sequences
prompts = ["ATG", "GCA"]
sampling_params = SamplingParams(temperature=0.7, top_p=0.95, max_tokens=100)
outputs = llm.generate(prompts, sampling_params)
for output in outputs:
print(f"Generated: {output.outputs[0].text}")
Using go-infer (GenomeOcean CLI)
# Scoring sequences
python -m genomeocean.cli score --model_dir ThomasYn/GenomeOcean-100M-GGUF --sequence_file data.txt
# Generating sequences
python -m genomeocean.cli generate --model_dir ThomasYn/GenomeOcean-100M-GGUF --num 10 --max_seq_len 512
Model Repository Structure
This repository contains the necessary configuration files and model weights for GGUF inference.
model.gguf: Quantized weightsconfig.json: Model configurationmodeling_mistral.py: Architecture implementationtokenizer.json&tokenizer_config.json: Genomic tokenizer files
Citation
If you use this model in your research, please cite:
@article{genomeocean2026,
title={GenomeOcean: A Large-scale Foundation Model for Ocean Genomics},
author={Thomas Yn, et al.},
journal={bioRxiv},
year={2026}
}
- Downloads last month
- 34
Hardware compatibility
Log In to add your hardware
We're not able to determine the quantization variants.
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support