Skip to content

Understanding Connexion’s intended use cases and throughput characteristics #2107

Description

@mehdihasan

Hi team,

We’re currently using the spec-first / Connexion framework for one of our observability pipeline ingestion APIs. The service runs on Kubernetes with HPA enabled.

During peak traffic, we’re observing that each pod can reliably handle roughly 9–10 parallel requests, but does not seem able to process more concurrent requests beyond that. As a result, handling peak load relies heavily on scaling out via HPA rather than increasing per-pod concurrency.

This made us pause and reflect, and we wanted to sanity-check our understanding with the maintainers directly.

A few questions we were hoping to get your perspective on:

Is this kind of per-pod concurrency/throughput characteristic typical or expected for a Connexion-based service?

Are there recommended configurations, deployment patterns, or tuning suggestions to increase per-pod throughput with this framework?

In your view, is Connexion a good fit for high-throughput ingestion-style workloads? If not, could you help explain why from a design or architectural standpoint?

More broadly, what kinds of use cases do you see Connexion being best suited for, and where do its strengths really shine? Conversely, what are the main trade-offs or limitations teams should be aware of?

We’re mainly trying to better understand the intended use cases and design goals of the framework, and to learn from the development team’s experience rather than making assumptions on our side.

Thanks in advance - really appreciate any insights you’re willing to share.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions