Skip to content

Conversation

asmigosw
Copy link
Contributor

Prompt length needs to be greater than the decoded image tokens + buffer, so in case user passes PL to be less than this value, VLM inference was failing, hence added a check to update the PL and CL accordingly.

Signed-off-by: Asmita Goswami <[email protected]>
f"Increasing increase the prompt length to at least {total_tokens + prompt_len}."
)
prompt_len = total_tokens + prompt_len
ctx_len = prompt_len + 50
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

from where this 50 is coming?

@asmigosw asmigosw marked this pull request as draft May 21, 2025 08:15
asmigosw added 3 commits May 21, 2025 08:29
Signed-off-by: Asmita Goswami <[email protected]>
Signed-off-by: Asmita Goswami <[email protected]>
Signed-off-by: Asmita Goswami <[email protected]>
@asmigosw asmigosw marked this pull request as ready for review May 21, 2025 08:35
@quic-rishinr quic-rishinr marked this pull request as draft June 9, 2025 05:20
@asmigosw asmigosw marked this pull request as ready for review June 30, 2025 09:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants