llama-3.3-nemotron-super-49b-v1

Name: llama-3.3-nemotron-super-49b-v1
Brand: NVIDIA

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

Text128KReasoningTools131.1K

入力無料

出力無料

コンテキスト128K

エンドポイントopenai

機能

推論ツール構造化

モダリティ

入力

text

出力

text

クイック統計

コンテキストウィンドウ131.1K

最大出力131.1K

モードchat

トークナイザーLlama3

学習データ更新2024

量子化fp8

Hugging Facenvidia/Llama-3_3-Nemotron-Super-49B-v1_5

パフォーマンス

パフォーマンスデータを読み込み中...

対応パラメータ

パラメータ	常時	デフォルト
frequency_penalty		(送信しない)
include_reasoning		-
logit_bias		-
max_tokens		-
min_p		-
presence_penalty		(送信しない)
reasoning		-
repetition_penalty		(送信しない)
response_format		-
seed		-
stop		-
temperature		0.6
tool_choice		-
tools		-
top_k		(送信しない)
top_p		0.95

§ 01

料金

使った分だけのお支払い。従量課金利用時は月額最低料金なし。

入力料金	$0.00 · 100万トークン
出力料金	$0.00 · 100万トークン
コンテキストウィンドウ	128K トークン
対応エンドポイント	openai
ベンダー	NVIDIA

§ 02

コードから llama-3.3-nemotron-super-49b-v1 を呼び出す

OpenAI 互換の SDK を UnoRouter に向け、モデル名を指定するだけです。YOUR_API_KEY はダッシュボードで取得した実際のキーに置き換えてください。

bash

curl https://api.unorouter.com/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-3.3-nemotron-super-49b-v1",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

サインインするとAPIキーが自動入力されます

§ 03

よくあるご質問

llama-3.3-nemotron-super-49b-v1 の100万トークンあたりの料金はいくらですか？

入力は100万トークンあたり $0.00、出力は100万トークンあたり $0.00 で課金されます。トークン単位の課金で、バッチサイズへの切り上げはありません。

API で llama-3.3-nemotron-super-49b-v1 を使うにはどうすればよいですか？

UnoRouter の /v1/chat/completions エンドポイントへ model=llama-3.3-nemotron-super-49b-v1 を指定してリクエストを送信してください。OpenAI 互換のクライアントライブラリであれば動作します。認証は標準の Bearer トークンを使用します。