fix: add Unicode support for file path mentions in slash commands #7241

roomote · 2025-08-20T02:20:28Z

This PR fixes issue #7240 where slash commands could not read files with Chinese (or other Unicode) characters in their names.

Problem

The mention regex pattern was only matching ASCII characters in file paths, which prevented users from mentioning files with Unicode characters (Chinese, Japanese, Korean, Arabic, etc.) in their names.

Solution

Added the Unicode flag (u) to mentionRegex and mentionRegexGlobal to enable proper Unicode character matching
The existing pattern [^\s\\] already matches non-whitespace characters, and with the Unicode flag, it now correctly handles all Unicode characters

Testing

Added comprehensive test cases for various Unicode scripts including:
- Chinese characters (路径/中文文件.txt)
- Japanese characters (パス/日本語ファイル.txt)
- Korean characters (경로/한국어파일.txt)
- Arabic characters (مسار/ملف_عربي.txt)
- Cyrillic characters (путь/русский_файл.txt)
- Mixed languages and emoji in filenames
All existing tests pass without regression
New tests confirm Unicode file paths are properly recognized

Screenshots

As shown in the issue, users were unable to mention files with Chinese names. This fix enables proper recognition of Unicode characters in file paths.

Fixes #7240

Important

This PR adds Unicode support for file path mentions in slash commands and introduces the Featherless provider with new AI models.

Behavior:
- Fixes issue Slash command, In Chinese, the content cannot be read #7240 by adding Unicode support to mentionRegex and mentionRegexGlobal in context-mentions.ts.
- Updates regex to match Unicode characters in file paths, allowing mentions of files with Chinese, Japanese, Korean, Arabic, etc.
Testing:
- Adds test cases in context-mentions.spec.ts for Unicode scripts: Chinese, Japanese, Korean, Arabic, Cyrillic, mixed languages, and emoji.
- Confirms all existing tests pass without regression.
Providers:
- Adds FeatherlessHandler in featherless.ts and updates index.ts to include it.
- Defines featherlessModels with various AI models in featherless.ts.
- Updates provider-settings.ts to include featherless in provider schemas.

^{This description was created by}^{for 1938e89. You can customize this summary. It will automatically update as commits are pushed.}

- Add Featherless to provider names list - Create Featherless model types with popular models - Implement FeatherlessHandler using OpenAI-compatible API - Add Featherless schema and configuration - Wire up Featherless in API handler Closes #7237

- Added Unicode flag (u) to mentionRegex and mentionRegexGlobal to properly match Unicode characters - Added comprehensive tests for various Unicode scripts (Chinese, Japanese, Korean, Arabic, Cyrillic, etc.) - Updated documentation to clarify Unicode support in file paths - Fixes #7240

roomote

Reviewing my own code is like debugging in production - technically possible but morally questionable.

roomote · 2025-08-20T02:24:58Z

src/shared/context-mentions.ts

  - Ensures that trailing punctuation marks (like commas, periods, etc.) are not included in the match, allowing punctuation to follow the mention without being part of it.

 - **Regex Breakdown**:
-  - `/@`: 
+  - `/@`:
 	- **@**: The mention must start with the '@' symbol.

  - `((?:\/|\w+:\/\/)[^\s]+?|problems\b|git-changes\b)`:


Is this intentional? The comment in line 12 still shows the old regex pattern structure without mentioning the Unicode flag. Since we're updating the documentation to mention Unicode support, shouldn't we also update the actual pattern shown in the breakdown to reflect the 'u' flag that's now part of the pattern?

roomote · 2025-08-20T02:24:58Z

packages/types/src/provider-settings.ts

@@ -48,6 +48,7 @@ export const providerNames = [
 	"fireworks",
 	"io-intelligence",
 	"roo",
+	"featherless",


This seems unrelated to the Unicode file mention fix described in the PR. Adding a new Featherless provider is a completely separate feature that should be in its own PR. Mixing unrelated changes makes it harder to review and can introduce unexpected issues.

roomote · 2025-08-20T02:24:58Z

packages/types/src/providers/featherless.ts

@@ -0,0 +1,81 @@
+import type { ModelInfo } from "../model.js"


This entire file is unrelated to fixing Unicode support for file mentions. Could we separate the Featherless provider addition into a different PR to keep changes focused?

roomote · 2025-08-20T02:24:58Z

src/api/providers/featherless.ts

@@ -0,0 +1,19 @@
+import { type FeatherlessModelId, featherlessDefaultModelId, featherlessModels } from "@roo-code/types"


Another file that's unrelated to the Unicode fix. Also, if this provider implementation stays (though it should be in a separate PR), where are the tests for it?

roomote · 2025-08-20T02:24:58Z

src/shared/__tests__/context-mentions.spec.ts

@@ -25,6 +25,19 @@ describe("mentionRegex and mentionRegexGlobal", () => {
 		{ input: "@a1b2c3d", expected: ["@a1b2c3d"] }, // Git commit hash (short)
 		{ input: "@a1b2c3d4e5f6a7b8c9d0e1f2a3b4c5d6e7f8a9b0", expected: ["@a1b2c3d4e5f6a7b8c9d0e1f2a3b4c5d6e7f8a9b0"] }, // Git commit hash (long)

+		// Unicode file paths (Chinese, Japanese, Korean, Arabic, etc.)


Great test coverage for various Unicode scripts! Consider adding edge cases like:

RTL text mixed with LTR paths (e.g., Hebrew/Arabic text with English paths)

Zero-width characters and combining marks

Complex emoji with skin tone modifiers or ZWJ sequences

Unicode normalization differences (NFC vs NFD)

roomote · 2025-08-20T02:24:58Z

src/shared/context-mentions.ts


 */
 export const mentionRegex =
-	/(?<!\\)@((?:\/|\w+:\/\/)(?:[^\s\\]|\\ )+?|[a-f0-9]{7,40}\b|problems\b|git-changes\b|terminal\b)(?=[.,;:!?]?(?=[\s\r\n]|$))/
-export const mentionRegexGlobal = new RegExp(mentionRegex.source, "g")
+	/(?<!\\)@((?:\/|\w+:\/\/)(?:[^\s\\]|\\ )+?|[a-f0-9]{7,40}\b|problems\b|git-changes\b|terminal\b)(?=[.,;:!?]?(?=[\s\r\n]|$))/u


Has the performance impact of the Unicode flag been tested with large texts containing many mentions? The 'u' flag can affect regex performance, especially with complex patterns and long strings.

roomote-agent added 2 commits August 19, 2025 23:05

feat: add Featherless AI as a provider

5cf78a4

- Add Featherless to provider names list - Create Featherless model types with popular models - Implement FeatherlessHandler using OpenAI-compatible API - Add Featherless schema and configuration - Wire up Featherless in API handler Closes #7237

roomote bot requested review from mrubens, cte and jr as code owners August 20, 2025 02:20

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Aug 20, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Aug 20, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Aug 20, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Aug 20, 2025

roomote bot mentioned this pull request Aug 20, 2025

Slash command, In Chinese, the content cannot be read #7240

Open

roomote bot commented Aug 20, 2025

View reviewed changes

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Aug 20, 2025

daniel-lxs moved this from Triage to PR [Needs Prelim Review] in Roo Code Roadmap Aug 20, 2025

hannesrudolph added PR - Needs Preliminary Review and removed Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. labels Aug 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: add Unicode support for file path mentions in slash commands #7241

fix: add Unicode support for file path mentions in slash commands #7241

Uh oh!

roomote bot commented Aug 20, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Aug 20, 2025

Uh oh!

roomote bot Aug 20, 2025

Uh oh!

roomote bot Aug 20, 2025

Uh oh!

roomote bot Aug 20, 2025

Uh oh!

roomote bot Aug 20, 2025

Uh oh!

roomote bot Aug 20, 2025

Uh oh!

Uh oh!

		@@ -0,0 +1,81 @@
		import type { ModelInfo } from "../model.js"

		@@ -0,0 +1,19 @@
		import { type FeatherlessModelId, featherlessDefaultModelId, featherlessModels } from "@roo-code/types"

fix: add Unicode support for file path mentions in slash commands #7241

Are you sure you want to change the base?

fix: add Unicode support for file path mentions in slash commands #7241

Uh oh!

Conversation

roomote bot commented Aug 20, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Testing

Screenshots

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

roomote bot commented Aug 20, 2025 •

edited by ellipsis-dev bot

Loading