AI Observatory / Model Radar Nvidia / nvidia/llama-3.3-nemotron-super-49b-v1.5

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

Back To Model Radar Return To Index

131,072 Context

16,384 Max output

$0.10 / 1M Prompt price

2025-10-10 Created

01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider	Nvidia	Input modalities	text
Output modalities	text	Prompt price	$0.10 / 1M
Completion price	$0.40 / 1M	Request price	N/A
Context length	131,072	Max completion tokens	16,384
Supported parameters	frequency_penalty, include_reasoning, logit_bias, max_tokens, min_p, presence_penalty, reasoning, repetition_penalty, response_format, seed, stop, temperature, tool_choice, tools, top_k, top_p

Best for nvidia/llama-3.3-nemotron-super-49b-v1.5

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Deep analysis Coding workflows High-volume usage tool-capable low-cost

02 / Related

Related models in nearby categories.

Related models are derived from overlapping use-case categories so the detail page stays navigable.

Aion Labs $0.70 / 1M

AionLabs: Aion-1.0-Mini

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...

Deep analysis Coding workflows High-volume usage

Amazon $0.30 / 1M

Amazon: Nova 2 Lite

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...

Deep analysis Cross-modal work Long context High-volume usage

Arcee Ai $0.22 / 1M

Arcee AI: Trinity Large Thinking

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7

Deep analysis Long context High-volume usage Coding workflows

Arcee Ai $0.04 / 1M

Arcee AI: Trinity Mini

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function...

Deep analysis High-volume usage Coding workflows

Arcee Ai $0.75 / 1M

Arcee AI: Virtuoso Large

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...

Deep analysis High-volume usage Coding workflows

Baidu $0.07 / 1M

Baidu: ERNIE 4.5 21B A3B Thinking

ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.

Deep analysis Coding workflows High-volume usage

03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.