Loads GPT-2 base + LoRA adapter + KV cache. First load downloads ~117 MB; subsequent loads use cache.