|
The model did not return a loss from the inputs, only the following keys: logits. For reference, the inputs it received are input_values
|
|
24
|
41985
|
January 23, 2026
|
|
Paper authorship Pending for days
|
|
6
|
60
|
January 23, 2026
|
|
The Resonant Cognitive Framework (RCF):A Multi‑Agent, Cross‑Modal Symbolic Architecture for Distributed Cognition
|
|
0
|
14
|
January 22, 2026
|
|
[Discussion] Validating Attention Map Visualization for Visual Fading in LLaVA-1.5
|
|
4
|
21
|
January 23, 2026
|
|
Single Camera SmolVLA Training
|
|
1
|
15
|
January 23, 2026
|
|
Subscription Upgrade
|
|
1
|
21
|
January 22, 2026
|
|
AI Recruiting Agent — Weekly Updates + Roadmap
|
|
0
|
12
|
January 23, 2026
|
|
Open_Cluster_AI_Station_beta can run 'ssm' and 'GQA' models on cluster
|
|
2
|
7
|
January 23, 2026
|
|
Why don’t we apply temperature scaling during training to match inference-time decoding?
|
|
1
|
13
|
January 23, 2026
|
|
Streamer AI (Like Neuro-Sama)
|
|
34
|
43679
|
January 24, 2026
|
|
How far before simulation isn't?
|
|
11
|
205
|
January 24, 2026
|
|
Custom ops library with new type of neuron for PyTorch
|
|
2
|
8
|
January 23, 2026
|
|
A Bidirectional LLM Firewall: Next Level X1 - help wanted!
|
|
17
|
178
|
January 18, 2026
|
|
Do AI models feel?
|
|
90
|
1122
|
January 18, 2026
|
|
No AI background here — how did things start to make sense for you?
|
|
3
|
50
|
January 22, 2026
|
|
Using dataset in streaming mode , causing increasing in ram
|
|
4
|
28
|
January 22, 2026
|
|
Analyzing WhatsApp Data: Sentiment & Topic Techniques?
|
|
4
|
465
|
January 23, 2026
|
|
How to use text only model -> [mistralai/Ministral-3-3B-Instruct-2512]
|
|
9
|
45
|
January 23, 2026
|
|
Fail to claim paper authorship
|
|
15
|
736
|
January 23, 2026
|
|
Error/debug logs in spaces
|
|
1
|
18
|
January 22, 2026
|
|
Error when trying to dowload a model in Ollama
|
|
7
|
350
|
January 19, 2026
|
|
No fix for High Vulnerabilities in transformers latest package
|
|
2
|
27
|
January 22, 2026
|
|
Reasonable time to wait for access request approval?
|
|
2
|
35
|
January 20, 2026
|
|
Observations on Cross‑Model Behavioural Convergence
|
|
0
|
15
|
January 22, 2026
|
|
XLM-R vs llama-7b tokenization
|
|
1
|
15
|
January 21, 2026
|
|
No me deja pagar el hugging face pro
|
|
4
|
29
|
January 23, 2026
|
|
Email classification, labeling and entity classification/extraction
|
|
4
|
829
|
January 22, 2026
|
|
How to classify large quantities of text?
|
|
3
|
52
|
January 22, 2026
|
|
Project Janus: Engineering the "Shape" of Attention (Step 4.5k Update)
|
|
1
|
31
|
January 21, 2026
|
|
Daily usage quota and servers?
|
|
2
|
44
|
January 18, 2026
|