llama.cpp 支持 llama-4 了！

在 MacStudio M3 Ultra 512G 上运行 Llama-4-Scout-17B-16E-Instruct-Q8_0-GGUF 大概是 27 token/s

但是，raspberry 有几个 r 还是数不对

You must log in or register to comment.