Verdict
If your data cannot leave the machine and you need 70B-class capability at production speed, this is the answer. Llama 3.3 70B Q4 at 22 tok/s through ZeroClaw with network egress denied. The €4,500-7,000 price tag rules out hobbyist use, but for regulated-industry self-hosting it's competitive with managed alternatives.
Setup notes
macOS host. Ollama with Llama 3.3 70B Q4. ZeroClaw install. Air-gap mode if your threat model demands it. Check for Asahi Linux's server readiness if Linux is required.
Performance
70B Q4: 22 tok/s. 192 GB unified memory means you can keep multiple models warm simultaneously without swapping.
What breaks
- Cost-sensitive setups
- Linux-only production stacks
Want to know more
See the full ZeroClaw review and the Mac Studio M3 Ultra buyer's notes.