Skip to content
Go back

Hermes connects Nvidia Nemotron Model via OpenRouter for Free

Updated:
Edit page

Hermes connects Nvidia Nemotron Model via OpenRouter for Free

Background

Takeaways

hermes-tui

Register OpenRouter

A quick comparison chart is attached to show differences between Nvidia Nemotron, Google Gemma, and Claude’s current flagship model, Opus 4.7.

modelCompare

There is also a short explanation of reasoning vs completion at the end of this article.

In Hermes

Backup your current config.yaml

cp ~/.hermes/config.yaml ~/.hermes/config.yaml.backup

Show current model configuration

hermes config show

You should see any other configured model similar to this You should see a configured model similar to this:

â—† Model
  Model:        {'default': 'elephant-alpha', 'provider': 'openrouter', 'base_url': 'https://openrouter.ai/api/v1', 'api_mode': 'chat_completions'}
  Max turns:    90

Set the primary model

hermes config set model.provider openrouter
hermes config set model "nvidia/nemotron-3-super-120b-a12b:free"

✓ Set model = nvidia/nemotron-3-super-120b-a12b:free in /home/jeff/.hermes/config.yaml

Configure fallback models

hermes config edit

Then edit it manually:

model:
  default: nvidia/nemotron-3-super-120b-a12b:free
  provider: openrouter
  base_url: https://openrouter.ai/api/v1
  api_mode: chat_completions
providers: {}
fallback_providers:
- provider: openrouter
  model: openrouter/free
- provider: openrouter
  model: openrouter/elephant-alpha

Note: openrouter/free provides best available free models from OpenRouter

Restart gateway

hermes gateway restart

Other free available models

You can also browse free and paid models from OpenRouter and NousResearch here:

https://openrouter.ai/models?q=free https://portal.nousresearch.com/models

Free usage limit

When using free model via OpenRouter, you may receive a warning:

API call failed after 3 retries: HTTP 429: Rate limit exceeded: free-models-per-day. Add 10 credits to unlock 1000 free model requests per day

This message comes from OpenRouter.ai. You can add 10 credits to avoid this message without being charged.

Difference between Reasoning and Completion

The main difference is how the model gets to the answer. A completion model predicts the most likely next word from learned patterns, while a reasoning model works through steps before giving the final response.

Core Definitions


Edit page
Share this post on:

Next Post
OpenClaw Security Risks