News
Newest
Ask
Show
Jobs
Built with Nuxt.js
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
(arxiv.org)
20 points | by
PaulHoule
16 hours ago
1 comments
Reubend
10 hours ago
Paper looks great. No GitHub link that I can find though. Maybe I'll take a crack at an implementation if I've got some extra free time.
1 comments