Choosing a Mac Mini for local LLMs — what would YOU actually buy?

Got three options on my radar and genuinely can't decide. Not looking for spec sheets - want to hear from people actually running this stuff daily: M4 (32GB) - newest but apparently the slowest of the three for inference? M2 Pro (32GB) - heard it actually beats the base M4 on tok/s M1 Max (64GB) - oldest chip but highest memory bandwidth Running Ollama, coding assistants (Qwen/Kimi), maybe some RAG pipelines. Budget is $2-3k so I'm not totally screwed on options. And yeah ob openclaw to stop spending on closed models.