You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I believe this is happening as a result of the HTML sanitizer. So that I understand the usecase, is there a reason that you need to write unclosed html tags?
Thanks @rileyajones . In recent reinforcement fine-tuning approaches, the input prompts to the model may end with unclosed tags to trigger chain-of-thought reasoning (such as orz), where the tag is supposed to be closed by the models' outputs.
Environment information
Issue description
text log unexpectedly closes unclosed tags, for example
hello!<think>
becomeshello!<think></think>
.Steps to reproduce:
The text was updated successfully, but these errors were encountered: