Privileged issue
Issue Content
https://blog.google/innovation-and-ai/technology/developers-tools/introducing-flex-and-priority-inference/
Possibly just the AI Studio API, but this allows cost savings or priority access for a fee.
Mostly configuration parameter changes (so likely in the discovery doc), but it does have a header in the reply with the service tier actually used.
Is there a standard way to pass this along so LangSmith can handle the billing estimate accordingly?
(Hunter Lovell (@hntrl) Christian Bromann (@christian-bromann))
Privileged issue
Issue Content
https://blog.google/innovation-and-ai/technology/developers-tools/introducing-flex-and-priority-inference/
Possibly just the AI Studio API, but this allows cost savings or priority access for a fee.
Mostly configuration parameter changes (so likely in the discovery doc), but it does have a header in the reply with the service tier actually used.
Is there a standard way to pass this along so LangSmith can handle the billing estimate accordingly?
(Hunter Lovell (@hntrl) Christian Bromann (@christian-bromann))