text input from datastream for v1.0 #1521

longcw · 2025-02-19T11:01:15Z

No description provided.

…el-user-transcription

changeset-bot · 2025-02-19T11:01:20Z

⚠️ No Changeset found

Latest commit: fffa2dc

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

longcw · 2025-02-19T11:03:03Z

livekit-agents/livekit/agents/pipeline/room_io.py

+                extra={"text": text, "participant": self._participant_identity},
+            )
+            self._agent.interrupt()
+            self._agent.generate_reply(user_input=text)


we are calling agent inside the RoomInput, is that okay? should the agent.input has a text input, or we make agent required for RoomInput?

Let's make agent required

Ok, I'll do that when merging the RoomInput and Output.

lukasIO · 2025-02-19T15:28:54Z

livekit-agents/livekit/agents/pipeline/room_io.py

@@ -48,6 +50,7 @@ class RoomOutputOptions:
 DEFAULT_ROOM_INPUT_OPTIONS = RoomInputOptions()
 DEFAULT_ROOM_OUTPUT_OPTIONS = RoomOutputOptions()
 LK_PUBLISH_FOR_ATTR = "lk.publish_for"
+LK_TEXT_INPUT_TOPIC = "lk.room_text_input"


we should be clear about what topics we want to support. If the goal is to have this work ootb with chat components, then choosing a custom topic here might not be the best choice

I think the question is are we going to keep the chat components in python/js sdk, and having both the original and the datastream or only the datastream for it? If so I can adjust here accordingly. cc @davidzhao

the chat components are for client-side. but Python/Node agents should agree on the same topic so that it works with the client-side components

@lukasIO what do you recommend we should use? is client-side component listening to both the transcription topic and chat?

currently chat components only listen to the chat topic and also send their messages only on the chat topic

how would this work with how we are sending transcriptions? do you suggest also sending transcriptions to chat topic?

that was my understanding, yes. But maybe I misunderstood or am forgetting something.
Why wouldn't you want it on the chat topic?

mostly wondering if there's any conflicts between what the agent would want to use as input.. versus what is being spit out as output.

i.e. if there are two agents in the room, would that cause any cross talk.. or if the agent is being added to a livestream with a chat feature, would it automatically start interpreting random transcripts.

for that reason it seems it might be a good idea to be explicit about what is being sent to the agent as input?

longcw · 2025-02-20T05:16:06Z

Do we wan to keep the ChatManager in python sdk, related PR livekit/python-sdks#360?

davidzhao · 2025-02-20T05:56:30Z

Do we wan to keep the ChatManager in python sdk, related PR livekit/python-sdks#360?

IMO we should deprecate and remove ChatManager.. agents should not use it any longer

longcw added 5 commits February 18, 2025 19:13

add user transcription for realtime model

d371b70

Merge remote-tracking branch 'origin/dev-1.0' into longc/realtime-mod…

a316ecb

…el-user-transcription

wip for text input

07e79ac

Merge remote-tracking branch 'origin/dev-1.0' into longc/text-input

6ff00cb

fix when realtime model user transcription disabled

fffa2dc

longcw requested a review from davidzhao February 19, 2025 11:01

longcw requested a review from theomonnom February 19, 2025 11:01

longcw commented Feb 19, 2025

View reviewed changes

theomonnom approved these changes Feb 19, 2025

View reviewed changes

longcw merged commit 9525860 into dev-1.0 Feb 19, 2025

longcw deleted the longc/text-input branch February 19, 2025 15:12

lukasIO reviewed Feb 19, 2025

View reviewed changes

theomonnom pushed a commit that referenced this pull request Feb 21, 2025

text input from datastream for v1.0 (#1521)

dbba990

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text input from datastream for v1.0 #1521

text input from datastream for v1.0 #1521

longcw commented Feb 19, 2025

changeset-bot bot commented Feb 19, 2025

longcw Feb 19, 2025

theomonnom Feb 19, 2025

longcw Feb 19, 2025

lukasIO Feb 19, 2025

longcw Feb 20, 2025

davidzhao Feb 20, 2025

lukasIO Feb 20, 2025

davidzhao Feb 20, 2025

lukasIO Feb 20, 2025

davidzhao Feb 20, 2025

longcw commented Feb 20, 2025

davidzhao commented Feb 20, 2025

text input from datastream for v1.0 #1521

text input from datastream for v1.0 #1521

Conversation

longcw commented Feb 19, 2025

changeset-bot bot commented Feb 19, 2025

⚠️ No Changeset found

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

longcw commented Feb 20, 2025

davidzhao commented Feb 20, 2025