Model Reson — fine-tuned LLaMA-7B for meta-cognition and recursive reasoning

Nexus-Walker · September 8, 2025, 3:38pm

Hi everyone,

I’ve uploaded the weights for a side-project I’ve been working on: Reson.
It’s a fine-tune of LLaMA-7B (LoRA, PEFT + bitsandbytes 4-bit). The dataset is ~11k instruction–response pairs that I wrote/curated, focused less on benchmarks and more on “how the model thinks”.

The aim wasn’t to squeeze out another leaderboard model, but to see what happens if you push a model toward:

reflecting on its own process (meta-cognition),
recursive / loop reasoning,
cross-domain adaptability,
edge cases like deception/strategy (to simulate human-like flexibility).

Training ran locally, final loss around 0.33 (It was in different batch)
Weights are here: Nexus-Walker/Reson · Hugging Face

What I’d like feedback on:

How do you evaluate this kind of behavior? Standard metrics don’t capture it.
How to keep the behavior stable without catastrophic forgetting as dataset grows?
Any prior work I should read that tried something similar?

This is still experimental and definitely rough around the edges, but I think it shows interesting “proto-agent” behaviors worth exploring.
Curious to hear your thoughts.

John6666 · September 8, 2025, 11:22pm

Hmm. I hope it helps clarify the issues…

Nexus-Walker · September 9, 2025, 9:56am

Thanks a lot for the reply, it’s incredibly helpful. I’ve read through everything and your action plan is very clear. I can confirm the data is indeed around 11k pairs, as you noted from the card. I’ll focus on creating an eval harness that can track PLAN, CRITIQUE, and REVISE, and then I’ll explore the approach with multiple LoRAs for new skills. I’ll keep you posted on my progress, but sounds it will be a long process…

Topic		Replies	Views
MarCognity-AI for 13 Critical Questions About LLMs Research	2	89	October 17, 2025
Can a Small LLM Learn to Reason Like a Larger One? Reflection-based Fine-Tuning vs Classical SFT on LLaMA 3.2 (Java CodeGen) Research	4	297	June 20, 2025
Can Small Models Reflect? Prompt-only Metacognition in LLaMA 3B (No Fine-Tuning) Models	1	69	June 10, 2025
Reasoning LLM Benchmarking 🤗Transformers	2	3656	March 24, 2025
Title: Looking for guidance and collaborators to train an open LLM project (“Hyperion”) Beginners	4	47	December 28, 2025

Model Reson — fine-tuned LLaMA-7B for meta-cognition and recursive reasoning

Related topics