LAUNCHES

Nvidia’s Nemotron Cascade 2 30B Model Achieves 97.6% on HumanEval Benchmark

M megaone_admin Mar 22, 2026 2 min read
Engine Score 8/10 — Important

The announcement of a new NVIDIA Nemotron model has significant industry impact and offers high actionability for developers working with local LLMs. While the information is novel and timely, its reliability and verification are somewhat limited by being sourced from a community forum.

Editorial illustration for: Nvidia's Nemotron Cascade 2 30B Model Achieves 97.6% on HumanEval Benchmark

A Reddit user has highlighted strong performance results for Nvidia’s Nemotron Cascade 2 30B-A3B model, which achieved 97.6% on the HumanEval coding benchmark and 88% on ClassEval. The results were posted by user ilintar on the LocalLLaMA subreddit, who tested mradermacher’s IQ4_XS quantized version of the model.

According to the post, the Nemotron Cascade 2 30B-A3B “is *not* based on the Qwen architecture despite a similar size, it’s a properly hybrid model based on Nemotron’s own arch.” The user noted that despite discussions around Nvidia’s Nemotron Super family of models, this particular model “has largely flown under the radar.”

The evaluation used HumanEval and ClassEval benchmarks, which the tester described as “quick to run and complicated enough for most small models to still have noticeable differences.” On HumanEval, the model’s 97.6% score reportedly left “both medium Qwen3.5 models in the rear window,” though specific comparison scores were not provided.

The Reddit user indicated they moved away from subjective evaluation methods, stating: “I’ve been running some evals on local models lately since I’m kind of tired of the ‘vibe feels’ method of judging them.” The combination of HumanEval and ClassEval was chosen as the testing methodology for its balance of speed and complexity.

The poster indicated plans for additional testing, writing “I’m going to run some more tests on this model, but I feel it deserves a bit more attention.” No timeline was provided for when additional benchmark results might be available.

Share

Enjoyed this story?

Get articles like this delivered daily. The Engine Room — free AI intelligence newsletter.

Join 500+ AI professionals · No spam · Unsubscribe anytime

M
MegaOne AI Editorial Team

MegaOne AI monitors 200+ sources daily to identify and score the most important AI developments. Our editorial team reviews 200+ sources with rigorous oversight to deliver accurate, scored coverage of the AI industry. Every story is fact-checked, linked to primary sources, and rated using our six-factor Engine Score methodology.

About Us Editorial Policy