DeepHermes Preview features swappable standard output to R1 distill CoT reasoning. Its kind of blowing my mind.

SmokeyDope@lemmy.world · edit-2 4 days ago

DeepHermes Preview features swappable standard output to R1 distill CoT reasoning. Its kind of blowing my mind.

Sims@lemmy.ml · 4 days ago

Agree. I also shift between them. As the bare minimum, I use a thinking model to ‘open up’ the conversation, and then often continue with a normal model, but it certainly depends on the topic.

Long ago we got ‘routellm’ I think, that routed a request depended on its content, but the concept never got traction for some reason. Now it seems that closedai and other big names are putting some attention to it. Great to see DeepHermes and other open players be in front of the pack.

I don’t think it will take long before we have the agentic framework do the activation of different ‘modes’ of thinking dependent on content/context, goals etc. It would be great if a model can be triggered into several modes in a standard way.

SmokeyDope@lemmy.world · edit-2 2 days ago

I think the idea of calling multiple different kinds of ways to for llms to ‘process’ a given input in a standard way is promising.

I feel that after reasoning we will train models how to think emotionally in a more intricate way. By combining reasoning with a more advanced sense of individuality and greater emotions simulation we may get a little closer to finding a breakthrough.