Animated — primary data flow
Dashed — remote access path
Remote devices
Network — LAN (192.168.x.x) or internet via ngrok / SSH tunnel
macOS host · Apple Silicon
macOS services
System Settings → Sharing
Ollama
0.0.0.0:11434 via LaunchAgent
llama.cpp / MLX inference engine
Apple Silicon Metal · unified memory · automatic GPU offload
e4b · 12b · 27b — GGUF 4-bit · 128K–256K context
~/.ollama/models/blobs — SHA256-named GGUF files on disk
↑ Select any node to view details and commands