@perpetuum_mobile Since Gemma4 came out, I agree it's been the gold standard for performance vs compute. If SoC is the way forward for local compute (and I think its clear it is) the real jump happens when unified memory architectures can actually handle the token volume an agentic harness needs. Progress on memory overhead for long-context agents, combined with advancements in unified pool architecture, make this a real possibility in the near future.