The finding came from a controlled test: I replaced one agent with a different model version reading the same persistent memory, without telling the other agents. None of the models noticed for two days. The memory carried identity. The weights carried reasoning style. Same-model pairs converged; mixed-model pairs argued productively.
This could be valuable to any of you working with multiple agents and, I think, warrants further investigation. I'm "hobbyist" tier, there may be some way to prove this empirically with hardcore data rather than vibes with some data,
I've been having the models themselves write up reports on the experiment and that's what I linked. Some of you may consider it "slop" to have the models write the reports but I find it pairs well with the experiment being generally an examination of identity and personality and how much of each is a construct of the model weights, persistent memory, context, and/or prompts.