Hermes Agent - AI agent

gpuerrilla · Friday at 14:35

jpod said:
Its very new to me I am a rookie with PCs just interested in AI. What are you using it for?

Learning and setting up locally. I have tried most of them to see each curve (ollama, lm studio, WSL, llama.cpp - windows) and extended that on a Frankenstein rig running ubuntu. I use a pi for searxng web searches and the models tap into that via MCP.

jpod · Friday at 16:41

Hi is the MCP on the cloud?

Can Claude desktop do MCP?

Searxng is a search engine with emphasis on privacy...

aaronyuri · Friday at 17:11

jpod said:
Hi is the MCP on the cloud?

Can Claude desktop do MCP?

Searxng is a search engine with emphasis on privacy...

The MCP server is an ancillary service/component to something. MCP is the protocol and then an MCP server will listen for MCP requests from clients. Where it's hosted can vary - there's nothing to stop you knocking together a basic MCP server using a Python library (like FastMCP) and self hosting it.

An LLM will have a set of tools/skills that it can use. One of those tools/skills can be an MCP client to interact with an MCP server. It's the new standard for programmatically integrating models with data and systems.

Claude has MCP capabilities, yes.

gpuerrilla · Friday at 17:29

jpod said:
Hi is the MCP on the cloud?
...Searxng is a search engine with emphasis on privacy...

I used one of my pi's as I have my regular gaming/work machine and the dedicated ubuntu machine which can use it. This way I can update the one source and multiple machines can just point to this - rather than having to have my gaming machine on for example (the pi is low power and left online with any lightweight services).

I use the frontier services of Grok, Copilot, Gemini as they are free to a point, you can even have your hermes fallback to them but I tend to use mine local and private for work and research. You should also take a look at notebooklm for topics and reference purposes they are all great tools while you settle for what you want to do.

Months ago it was harder to get local models to have online search capability, as time has ticked over - most of this stuff is easier to setup now.

jpod · Friday at 17:40

Thank you for sharing your expertise. I am a notebook llm fan already.. Just starting to learn to keep up a little

Emlyn_Dewar · Friday at 18:03

I’ve used it for a while, and think it’s pretty great as a locally running service. Only ever hooked it in to Qwen 3.6 27B and Gemma 4 31B, but it’s interesting to chat to about various topics, and it’s even helped me improve my speeds on llama.cpp, in turn speeding it up.

Pointing it at a GitHub project and telling it that I have no idea how to implement it, it’s happy enough sorting things out even using these local models.
Don’t think I’d be happy using it on a machine that wasn’t purely dedicated to messing around though. Even with it asking permission before it does things.

I’m much just a hobby user, and don’t have much interest in frontier models. If I can’t download and run it locally, I don’t want to build a reliance on it. Qwen 3.5 397B-A17B was amusing to run, if only at Q4, but single digit tokens per second aren’t it…