NVIDIA 刚发布了一个 Llama-3.3-Nemotron-Super-49B-v1 模型

NVIDIA 刚发布了一个 Llama-3.3-Nemotron-Super-49B-v1 模型

NVIDIA 刚发布了一个 Llama-3.3-Nemotron-Super-49B-v1 模型。

这是一个基于llama-3.3的推理模型，这个模型是通过他们整理的蒸馏数据 (来自这些模型： Llama-3.3-70B-Instruct, DeepSeek-R1, Qwen-2.5-Math-7B-Instruct, Qwen-2.5-Coder-32B-Instruct 等等）

另外强调了这个模型适用于RAG，并且可以商用。（以下分数均开启推理模式）AIME25 分数大概是 58.4 (QwQ-32B 是60)，GPQA66.67 (QwQ-32B 是 65.2)，看测评跟QwQ-32B不相上下。

总之我已经在做中模型竞技场了。各位可以等一个中模型水平横评。

模型地址：huggingface.co/nvidia/Llama-3_3-Nemotron-Super-49B-v1

1

You must log in or register to comment.

jingfelix
1 year ago
在 Ollama 上搜了一下原来半年前就有 llama3.1 based nemotorn 系列模型了 https://ollama.com/library/nemotron