Optimize content endpoint: cache OCI client and extracted SKILL.md

## Context

The `GET /api/v1/skills/{ns}/{name}/versions/{ver}/content` endpoint (PR #11) creates a brand-new OCI client and temp store directory for every request. It pulls the full layer blob, extracts SKILL.md, serves it, then cleans up.

## Problem

This is extremely inefficient for repeated requests to the same skill version. Each request:
- Creates a new OCI layout store in a temp directory
- Pulls the entire layer from the registry
- Decompresses gzip + extracts tar to find SKILL.md
- Deletes everything on response

## Proposed solution

Options (in order of complexity):

1. **In-memory LRU cache** — cache extracted SKILL.md content keyed by `(repository, tag, digest)`. Invalidate on digest change during sync. Simplest, covers the 90% case.

2. **Shared OCI client** — reuse a single client instance across requests so pulled layers are cached in the local OCI store. Add periodic cleanup of old layers.

3. **Streaming tar extraction** — instead of unpacking to disk, stream through the gzip+tar to extract only SKILL.md without writing temp files.

## Current state

Documented as a known limitation in PR #11. Acceptable for Phase 2a since content requests are infrequent (individual skill detail pages), but should be optimized before production use.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize content endpoint: cache OCI client and extracted SKILL.md #12

Context

Problem

Proposed solution

Current state

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Optimize content endpoint: cache OCI client and extracted SKILL.md #12

Description

Context

Problem

Proposed solution

Current state

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions