Accelerating LLM Inference with Prompt Caching for Open‑Source Models on Databricks Databricks