Open Source Model Open Source

gpt-oss-puzzle-88B

A deployment-optimized large language model by NVIDIA for efficient reasoning and long-context inference.

Price Free / Open Source
Category Open Source Model
Model Open Source
Company NVIDIA
Team Size 42,000
Total Funding $20M

gpt-oss-puzzle-88B is a deployment-optimized large language model developed by NVIDIA, derived from OpenAI's gpt-oss-120b. It utilizes the Puzzle framework for neural architecture search to enhance inference efficiency for reasoning-heavy workloads while maintaining or improving accuracy. The model is specifically optimized for NVIDIA H100-class hardware and supports long-context inference up to 128K tokens.

5/10

The gpt-oss-puzzle-88B model shows technical merit with 2.82x throughput improvements on H100 GPUs, but remains a niche deployment optimization tool rather than a breakthrough model. Despite NVIDIA's backing, it lacks broader market adoption and serves primarily specialized inference efficiency use cases.

Free Tier
Try gpt-oss-puzzle-88B

Visit the official gpt-oss-puzzle-88B website