Add instrumentation of AWS Bedrock to use gen_ai conventions #13355

anuraaga · 2025-02-20T05:30:31Z

Hello - I'm working on getting genai conventions implemented in OTel instrumentation. This adds genai conventions for AWS bedrock calls. It is a Java version of what's in Python and being worked on for JS

This is an initial implementation to include the minimum support, just Converse API with span attributes. Will followup with metrics, log events, other APIs, etc.

One point for the gen ai instrumentations is we're generally using an API recording mechanism rather than crafted mocks - this has become the industry standard for genai instrumentations due to the complexity and non-determinism in LLM responses. i.e., Python is also using a similar mechanism. While I've kept the implementation local to aws-sdk for now, it is with the intent that it could be moved to testing to be used in a future openai sdk instrumentation, or others.

/cc @codefromthecrypt

…nstrumentation into bedrock-genai

anuraaga · 2025-02-20T05:31:44Z

instrumentation/aws-sdk/aws-sdk-2.2/library/build.gradle.kts

@@ -72,6 +89,7 @@ tasks {
  }

  check {
+    dependsOn(testing.suites)


Note this causes other suites defined above to be run that I think previously weren't

anuraaga · 2025-02-20T05:32:43Z

...rc/main/java/io/opentelemetry/instrumentation/awssdk/v2_2/internal/BedrockRuntimeAccess.java

+      isEnabled = false;
+    } else {
+      try {
+        Class.forName("software.amazon.awssdk.services.bedrockruntime.model.ConverseRequest");


I think unlike SQS / SNS, because there is no entrypoint added for wrapping, I need this for safety when bedrock isn't included (like in :test target)

anuraaga · 2025-02-20T05:36:32Z

.../java/io/opentelemetry/instrumentation/awssdk/v2_2/internal/TracingExecutionInterceptor.java

@@ -128,6 +131,7 @@ public TracingExecutionInterceptor(
  }

  @Override
+  @SuppressWarnings("deprecation") // need to access deprecated signer


Even with compileOnly, the compile versions of the other artifacts also get bumped causing some new deprecations

codefromthecrypt

Thanks for bringing this to bare and also introducing infrastructure with a minimal change before doing too much, @anuraaga! I'm clicking approve but I'm not an approver, but anyway I approve ;)

Those looking, if you scroll bottom-up you'll see the recorded request, which is also how other genai SDKs have been working due to norms in other SDKs and also routine edge cases of responses which change more frequently than some might expect (not specific about bedrock)

RECORD_WITH_REAL_API is the key where it allows someone with credentials to record, without manual copy/paste, which allows the tests to run and survive refactoring etc vs needing credentials always.

cc'ing a few folks who I know are interested in java side like @karthikscale3 @ThomasVitale and @salaboy

salaboy · 2025-02-20T09:10:22Z

Good stuff! I am happy to see progress in these areas! @JonasKunz :)

xrmx

LGTM, asserts in tests looks familiar to what we have in the python implementation :)

anuraaga · 2025-02-21T00:42:05Z

@trask I noticed it says "requested a review from a team", not the team (on mobile it says "unknown") and there aren't any reviewers, so maybe there is something wrong with the assign script. Would you be able to help find a reviewer? Thanks!

anuraaga · 2025-02-21T03:59:56Z

@trask Sorry might be a red herring. @codefromthecrypt showed that he sees a reviewer listed, it is just not showing up for me. Perhaps there is a visibility restriction on the team. If there are eyes on the PR already, no worries

Cirilla-zmh · 2025-02-21T04:25:12Z

Great work! 👍

However, I have some concerns. Similar instrumentation also appears in frameworks like Spring AI for GenAI, which serves as a higher-level API. Clearly, Spring AI also references the AWS SDK. I think this may cause some confusion for new users, as they have to deal with the existence of two similar spans and add some configuration to suppress one of them.

Of course, this is definitely a common situation, similar to the relationship between Kafka and Spring-Kafka. The difference is that we can automatically suppress one of them. However, given that Spring AI provides library instrumentation, addressing this issue may become more challenging. cc @trask @ThomasVitale

I think we need to consider these issues, but I certainly don’t believe this will become a barrier for merging this PR. :)

Cirilla-zmh · 2025-02-21T04:27:00Z

BTW, do you plan to log the GenAI prompts and completions in events? I didn't see those elements in this PR. If you would be willing to share your approach, that would be incredibly helpful!

anuraaga · 2025-02-21T04:29:48Z

Hi @Cirilla-zmh - yes, this is just an initial PR to add the minimum level of instrumentation, while setting up the test infrastructure that will allow more easily iterating on features with smaller PRs. Will be following up with events (we'll probably use the stable log API directly), metrics, etc.

jaydeluca · 2025-02-21T10:48:22Z

...brary/src/main/java/io/opentelemetry/instrumentation/awssdk/v2_2/internal/AwsSdkRequest.java

+  ConverseRequest(BEDROCK_RUNTIME, "ConverseRequest", request("gen_ai.request.model", "modelId")),
+  ;


Suggested change

ConverseRequest(BEDROCK_RUNTIME, "ConverseRequest", request("gen_ai.request.model", "modelId")),

;

ConverseRequest(BEDROCK_RUNTIME, "ConverseRequest", request("gen_ai.request.model", "modelId"));

jaydeluca · 2025-02-21T10:57:05Z

...c/main/java/io/opentelemetry/instrumentation/awssdk/v2_2/AbstractAws2BedrockRuntimeTest.java

+            trace ->
+                trace.hasSpansSatisfyingExactly(
+                    span ->
+                        span.hasName("chat amazon.titan-text-lite-v1")


We could also assert the spanKind for all of these

Suggested change

span.hasName("chat amazon.titan-text-lite-v1")

span.hasName("chat amazon.titan-text-lite-v1")

.hasKind(SpanKind.INTERNAL)

jaydeluca · 2025-02-21T10:57:56Z

...c/main/java/io/opentelemetry/instrumentation/awssdk/v2_2/AbstractAws2BedrockRuntimeTest.java

+                                equalTo(GEN_AI_USAGE_INPUT_TOKENS, 8),
+                                equalTo(GEN_AI_USAGE_OUTPUT_TOKENS, 14),
+                                equalTo(
+                                    GEN_AI_RESPONSE_FINISH_REASONS, Arrays.asList("end_turn")))));


Arrays.asList is used several times, could do static import

Suggested change

GEN_AI_RESPONSE_FINISH_REASONS, Arrays.asList("end_turn")))));

GEN_AI_RESPONSE_FINISH_REASONS, asList("end_turn")))));

anuraaga added 3 commits February 20, 2025 14:24

Add instrumentation of AWS Bedrock to use gen_ai conventions

ce57073

Merge branch 'main' of github.com:open-telemetry/opentelemetry-java-i…

1fd295d

…nstrumentation into bedrock-genai

Drift

e8a70b6

anuraaga requested a review from a team as a code owner February 20, 2025 05:30

anuraaga commented Feb 20, 2025

View reviewed changes

Update muzzle

1a6cda3

codefromthecrypt approved these changes Feb 20, 2025

View reviewed changes

xrmx approved these changes Feb 20, 2025

View reviewed changes

Cirilla-zmh approved these changes Feb 21, 2025

View reviewed changes

jaydeluca reviewed Feb 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add instrumentation of AWS Bedrock to use gen_ai conventions #13355

Add instrumentation of AWS Bedrock to use gen_ai conventions #13355

anuraaga commented Feb 20, 2025 •

edited

Loading

anuraaga Feb 20, 2025

anuraaga Feb 20, 2025

anuraaga Feb 20, 2025

codefromthecrypt left a comment

salaboy commented Feb 20, 2025

xrmx left a comment

anuraaga commented Feb 21, 2025

anuraaga commented Feb 21, 2025

Cirilla-zmh commented Feb 21, 2025

Cirilla-zmh commented Feb 21, 2025

anuraaga commented Feb 21, 2025

jaydeluca Feb 21, 2025

jaydeluca Feb 21, 2025

jaydeluca Feb 21, 2025

		ConverseRequest(BEDROCK_RUNTIME, "ConverseRequest", request("gen_ai.request.model", "modelId")),
		;

	span.hasName("chat amazon.titan-text-lite-v1")
	span.hasName("chat amazon.titan-text-lite-v1")
	.hasKind(SpanKind.INTERNAL)

	GEN_AI_RESPONSE_FINISH_REASONS, Arrays.asList("end_turn")))));
	GEN_AI_RESPONSE_FINISH_REASONS, asList("end_turn")))));

Add instrumentation of AWS Bedrock to use gen_ai conventions #13355

Are you sure you want to change the base?

Add instrumentation of AWS Bedrock to use gen_ai conventions #13355

Conversation

anuraaga commented Feb 20, 2025 • edited Loading

anuraaga Feb 20, 2025

Choose a reason for hiding this comment

anuraaga Feb 20, 2025

Choose a reason for hiding this comment

anuraaga Feb 20, 2025

Choose a reason for hiding this comment

codefromthecrypt left a comment

Choose a reason for hiding this comment

salaboy commented Feb 20, 2025

xrmx left a comment

Choose a reason for hiding this comment

anuraaga commented Feb 21, 2025

anuraaga commented Feb 21, 2025

Cirilla-zmh commented Feb 21, 2025

Cirilla-zmh commented Feb 21, 2025

anuraaga commented Feb 21, 2025

jaydeluca Feb 21, 2025

Choose a reason for hiding this comment

jaydeluca Feb 21, 2025

Choose a reason for hiding this comment

jaydeluca Feb 21, 2025

Choose a reason for hiding this comment

anuraaga commented Feb 20, 2025 •

edited

Loading