Running LLM Inference on Kubernetes: What It Actually Takes Security Boulevard