NVIDIA: Llama Nemotron Embed VL 1B V2 (free)
nvidia/llama-nemotron-embed-vl-1b-v2:free
Released Feb 25, 2026131,072 context
$0/M input tokens$0/M output tokens
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.