So, what are you going to do with these findings? Will you need to get Apple involved? Will you submit pull requests to the torch or mlx folks? Any chance of further improvements with more processing time? Can the regressions be mitigated with some kind of switch between the standard kernels and your optimized ones?
Aaron Reitz
Adreitz
·
AI & ML interests
None yet
Recent Activity
new activity
10 days ago
orabazes/FLUX.2-dev-GGUF:unet unexpected: ['model_sampling.sigmas']
commented on
an
article
5 months ago
Automated Discovery of High-Performance GPU Kernels with OpenEvolve
new activity
8 months ago
HiDream-ai/HiDream-I1-Full:No way on earth to get "an albino woman with white skin and dark hair"
Organizations
None yet