Add MCP support to command mode by grohith327 · Pull Request #290 · altic-dev/FluidVoice

grohith327 · 2026-04-19T23:17:44Z

Description

This PR adds a bunch a features in order to support MCP for command mode execution. The list of the features are:

Add separate strategies for routing Anthropic requests and OpenAI compatible requests.
- If the model provider is anthropic, messages api is used. Else responses api is preferred and completions is used as fallback
Minor updates to the command mode chat UI - moved around the buttons a little bit
Added a mechanism to define the MCP config and edit it from the FluidVoice app
Fixed some issues on tool calls are rendered on the chat
Once action is completed mode, depending on whether the user active display screen has notch - a compact dynamic island notification is shown for success, else a system notification is sent for success
Added Markdown syntax rendering in the command mode chat

Type of Change

🐞 Bug fix (non-breaking change which fixes an issue)
✨ New feature (non-breaking change which adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to not work as expected)
📝 Documentation update

Related Issues

Closes [✨ FEATURE] MCP server support in Command Mode #275

Testing

Tested on Apple Silicon Mac
Tested on macOS [version]
Ran linter locally: swiftlint --strict --config .swiftlint.yml Sources
Ran formatter locally: swiftformat --config .swiftformat Sources
Built locally: sh build_incremental.sh

Notes

Due to the size of the change, would recommend pulling the code and testing once.
Do a beta release first

The Ollama docs at github.com/ollama/ollama/blob/main/docs/openai.md now 404 — the OpenAI-compatibility doc was moved to docs/api/openai-compatibility.mdx and published at https://docs.ollama.com/api/openai-compatibility. Point the Setup Guide link there so users can actually read the docs.

Sub-1s recordings were silently returning an empty string, so short utterances like 'yes', 'no', or 'stop' never made it into the transcript and the user had no indication anything went wrong. The original guard existed to avoid a whisper.cpp assertion on buffers shorter than 1s, but it applied unconditionally — including to Parakeet, Apple Speech, and Cohere which have no such constraint. Rather than branch per provider, pad the buffer with trailing silence on the way in: whisper.cpp no longer asserts, every other provider just sees a moment of silence after the speech. An empty buffer (no audio at all) is still an early-return, since padding zero samples wouldn't help. Closes #276

Claude Opus 4.7 uses extended thinking by default and rejects the `temperature` parameter. Sending it causes every request to fail with HTTP 400: `temperature is deprecated for this model`. Add `SettingsStore.isTemperatureUnsupported(_:)` that covers both reasoning models (o1/o3/gpt-5/...) and Claude Opus 4.7+, and use it at each live call site (ContentView, CommandModeService, RewriteModeService) to gate the temperature parameter. `isReasoningModel` still gates `max_completion_tokens` / reasoning token budgets, which remain OpenAI-specific. Fixes #285

Adds abstractation to choose provider strategy based on model provider, for anthropic we use their messages API for others, we start with responses and fallback to completions api

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d8de00483b

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-19T23:22:45Z

            "type": "function",
-            "function": [
-                "name": "execute_terminal_command",
-                "description": """
+            "name": "execute_terminal_command",
+            "description": """


Keep chat tool definitions in function wrapper

This tool schema was flattened to top-level name/description/parameters, but the chat-completions path still sends config.tools as-is, and chat-completions expects each tool under {"type":"function","function":{...}}. When /responses is unavailable and the client falls back to /chat/completions, command-mode tool calls can be rejected or ignored, so terminal/MCP execution stops working on chat-only OpenAI-compatible endpoints.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-19T23:22:45Z

+        } else if endpoint.contains("/responses") || endpoint.contains("/chat/completions") || endpoint.contains("/api/chat") || endpoint.contains("/api/generate") {
            fullURL = endpoint
        } else {
-            fullURL = endpoint + "/chat/completions"
+            fullURL = endpoint + "/responses"
        }


Retry API verification on chat-completions endpoint

The connection test now defaults non-Anthropic providers to .../responses and treats failures as hard errors, but there is no fallback to .../chat/completions. For providers that are OpenAI-compatible but chat-only, verification will report failure even though runtime requests can still work via chat-completions, creating false negatives in settings and blocking a reliable “verify connection” flow.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: bf353d4ff6

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-20T00:50:10Z

+                where !attemptedResponsesFallback &&
+                self.isResponsesRequest(currentRequest) &&
+                routePlan.fallbackFormat != nil &&


Allow fallback for responses payloads on chat-style endpoints

executeWithRetry only triggers the Responses→chat-completions fallback when isResponsesRequest(currentRequest) is true, but this predicate is path-based. In the same change, ResponsesRouteStrategy.endpoint intentionally keeps /api/chat and /api/generate unchanged, so those requests are sent with Responses-format bodies but are not recognized as “responses requests” here. On chat-only OpenAI-compatible providers, a 400/404 from that first attempt will now fail the turn instead of retrying with chat-completions, breaking the intended compatibility flow.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-20T00:50:10Z

+        while usedToolNames.contains(candidate) {
+            candidate = Self.sanitizeToolName("\(base)_\(counter)")
+            counter += 1


Avoid non-terminating dedupe loop for long MCP tool names

This deduplication loop can become non-terminating when the sanitized base name is already 64 characters. sanitizeToolName truncates to 64 chars, so "\(base)_2", "\(base)_3", etc. collapse back to the same truncated string, candidate never changes, and while usedToolNames.contains(candidate) never exits. A server set with two colliding long tool names would hang MCP tool catalog rebuild/reload.

Useful? React with 👍 / 👎.

altic-dev · 2026-04-24T07:32:13Z

One blocker before merge: chat completions fallback still receives Responses-style tool definitions.

In buildChatCompletionsBody, body["tools"] = config.tools forwards tools shaped like {"type":"function","name":...,"parameters":...}. If /v1/responses is rejected and we fall back to /v1/chat/completions, Chat Completions expects {"type":"function","function":{"name":...,"parameters":...}}, so command mode tool calls can 400 or be ignored exactly on fallback providers.

Fix should be small: add a normalizeToolsForChatCompletions(...) helper and use it in buildChatCompletionsBody, while keeping normalizeToolsForResponses(...) for Responses.

altic-dev · 2026-04-24T07:34:05Z

One more concrete blocker I found in MCP tool registration: uniqueToolName can loop forever when two tool names collide after the 64-character truncation.

base is already sanitized/truncated to 64 chars. On collision, candidate = sanitizeToolName("\(base)_\(counter)") appends a suffix, but sanitizeToolName truncates back to 64 chars, so the suffix is dropped and candidate remains identical to base forever.

Scenario: two MCP tools with names that differ only after the first 64 sanitized characters, or long server/tool combinations with the same prefix. That would hang MCP bootstrap/tool catalog rebuild.

Fix should be small: when adding a collision suffix, reserve suffix length before truncating the base, e.g. truncate to 64 - suffix.count then append _2, _3, etc.

… API fallback

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7d9c8f5711

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-25T19:13:48Z

+                } else {
+                    // Auto-execute MCP tool
+                    await self.executeMCPTool(name: tc.name, arguments: tc.arguments, callId: tc.id)


Apply approval gating to MCP tool execution

When approval mode is enabled, only terminal commands go through the confirmation branch; MCP calls in the else path are always executed immediately. In a session where an MCP server exposes state-changing actions, the model can invoke them without any user confirmation, which bypasses the expected manual-approval safety behavior.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-25T19:13:48Z

-                self.currentStep = stepType == .checking ? .checking(tc.command) : .executing(tc.command)
+            switch response.turnKind {
+            case .toolCallOnly, .toolCallWithText:
+                let tc = response.toolCalls[0]


Handle all tool calls emitted in a single assistant turn

This path only executes response.toolCalls[0] and drops any additional tool calls returned by the model. For Responses/Anthropic turns that include multiple function calls, only one result is produced and sent back, leaving unresolved calls and causing incorrect follow-up reasoning or incomplete workflows.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-25T19:13:48Z

+                    source: "LLMClient"
+                )
+                guard let fallbackFormat = routePlan.fallbackFormat else { continue }
+                currentRequest = try self.buildRequest(config, forcedFormat: fallbackFormat)


Reapply timeout on responses→chat fallback request

The initial request gets config.timeoutSeconds, but on fallback the code rebuilds currentRequest without restoring that timeout. If the fallback endpoint is slow/unreachable, the retry uses default URLRequest/session timeouts instead of the configured limit, so requests that should fail fast can block much longer.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f6f671218d

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-25T23:02:36Z

+                        let id = item["call_id"] as? String ?? item["id"] as? String ?? "call_\(UUID().uuidString.prefix(8))"
+                        var accumulator = toolCallAccumulators[id] ?? StreamingToolAccumulator()


Key streamed Responses tool calls by item ID

response.function_call_arguments.* events are accumulated under item_id, but response.output_item.* switches to call_id as the dictionary key here. On Responses streaming turns where item_id and call_id differ (the common case), the name lands in one accumulator and the arguments in another, so the emitted tool call often has empty {} arguments. In command mode this can turn valid tool invocations into empty/invalid executions (e.g., missing command), breaking tool-calling on streamed Responses providers.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d1ad9f9c12

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-27T00:44:12Z

+        if format == .responses {
+            return self.parseResponsesResponse(json)


Parse chat-style payloads before accepting Responses success

In processNonStreaming, .responses requests are always parsed as Responses JSON without validating schema compatibility. If an OpenAI-compatible /api/chat endpoint returns HTTP 200 with chat-completions payload (choices) instead of output, this path returns an empty response instead of throwing, so the Responses→chat fallback is never triggered and command mode degrades into empty/incorrect assistant turns.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-27T00:44:12Z

+            if let runtime = self.runtimes[server.id], runtime.config == server {
+                continue


Reconnect unchanged MCP servers on forced reload

The forced reload flow (reloadConfiguration(force: true)) still skips servers whose config matches the cached runtime, so “Reload MCP” does not actually reconnect existing sessions. If a server process has died or its tool list changed without a config edit, this continue keeps stale runtimes/tool catalogs in place and the user cannot recover via reload alone.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8e5e3c5c3f

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-05-01T02:42:06Z

+            if isResponses {
+                let eventType = json["type"] as? String ?? ""
+


Parse chat-completions SSE when responses stream shape is absent

When format == .responses, the loop always takes this branch and later continues, so events without a Responses type field are dropped before reaching the choices/delta parser below. On chat-only OpenAI-compatible endpoints that return HTTP 200 with chat-completions SSE (choices) instead of Responses events, streaming turns become empty (no content/tool calls) and the Responses→chat fallback never triggers because no error is thrown.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-01T02:42:06Z

+    private func showCompletionBadgeIfNeeded(success: Bool) {
+        guard success else { return }


Show completion feedback for failed command-mode runs

This early return makes failure completions skip notch/system feedback entirely, even though callers pass success: false and the badge pipeline already supports failure state. As a result, failed command runs can end without any compact completion indicator/notification, which regresses user-visible completion feedback outside the in-app window.

Useful? React with 👍 / 👎.

# Conflicts: # Sources/Fluid/ContentView.swift # Sources/Fluid/Services/CommandModeService.swift # Sources/Fluid/Services/NotchOverlayManager.swift # Sources/Fluid/Views/NotchContentViews.swift

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4338514ac8

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-05-01T02:58:00Z

+            let mcpServers =
+                try container.decodeIfPresent([String: Server].self, forKey: .mcpServers) ?? [:]
+            self.servers = mcpServers.keys.sorted().compactMap { mcpServers[$0] }


Decode legacy servers array in MCP settings

SettingsDocument.init(from:) only reads mcpServers and silently ignores the servers key even though it is declared in CodingKeys. Any existing settings.json that uses a top-level servers array will therefore load as an empty configuration (no validation error), which disables all MCP servers on reload and makes migration/debugging difficult.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: fc551995f0

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-09T05:10:50Z

+    ) async throws -> T {
+        let timeoutNanoseconds = UInt64(max(1, Int(seconds.rounded())) * 1_000_000_000)
+
+        return try await withThrowingTaskGroup(of: T.self) { group in


Make MCP timeout return without waiting on hung child

When an MCP connect/listTools/callTool operation blocks in non-cancellable I/O, this timeout branch can throw first but the withThrowingTaskGroup scope still has to cancel and await the operation child before returning to the caller. In that scenario the configured MCP timeout does not actually unblock bootstrap or tool execution, so a wedged server can still hang the app despite timeoutSeconds; use an unstructured task/continuation or an API-level timeout that can abandon/close the underlying transport.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 58b5ded1e7

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-09T05:23:45Z

+    private static func sanitizeToolName(_ value: String) -> String {
+        let allowed = CharacterSet.alphanumerics.union(CharacterSet(charactersIn: "_-"))
+        var scalars: [UnicodeScalar] = []
+        scalars.reserveCapacity(value.unicodeScalars.count)
+
+        for scalar in value.unicodeScalars {
+            if allowed.contains(scalar) {
+                scalars.append(scalar)


Sanitize MCP tool names to ASCII

When an MCP server id or tool name contains non-ASCII letters, CharacterSet.alphanumerics preserves those Unicode scalars, so the generated mcp_... name can violate provider tool-name validators; Anthropic documents the tool name regex as ^[a-zA-Z0-9_-]{1,64}$ (https://platform.claude.com/docs/en/agents-and-tools/tool-use/define-tools). In that configuration, one localized/accented MCP tool can make the entire tools payload fail with a 400 before command mode can use either MCP or terminal tools, so this should whitelist ASCII letters/digits explicitly.

Useful? React with 👍 / 👎.

grohith327 and others added 22 commits April 14, 2026 16:29

Add MCP support and migrate to respones api

b234625

Add markdown rendering

af70b73

Minor tweaks

ae0716c

Add swift markdown package dependency

61c81b2

Prevent reloading of MCP servers on all messages

ff63e14

Update rendering of tool calls

d89a808

Update command mode chat view

abb3e23

Add editor to update MCP settings

7a74931

Command mode bug fix

a34676a

Add markdown support in notch overlay

559b3fd

Update notch notifications for command mode

d86b36f

Add notification setup

1fa0fe5

Update command mode prompt

d688e54

Add MCP support and migrate to respones api

1e29f80

Add anthropic API support for tool calling

bc07c93

Adds abstractation to choose provider strategy based on model provider, for anthropic we use their messages API for others, we start with responses and fallback to completions api

Fix lint errors

8a60c11

Fix build errors due to rebase

d8de004

Merge branch 'main' into mcp-support

f1aef83

Fix lint errors

d908aad

chatgpt-codex-connector Bot reviewed Apr 19, 2026

View reviewed changes

Update integration tests

bf353d4

chatgpt-codex-connector Bot reviewed Apr 20, 2026

View reviewed changes

Fix bug where tool definition format is inaccurate during completions…

7d9c8f5

… API fallback

chatgpt-codex-connector Bot reviewed Apr 25, 2026

View reviewed changes

Update tool approval logic

f6f6712

chatgpt-codex-connector Bot reviewed Apr 25, 2026

View reviewed changes

Update tool call accumaltor field for respones api

d1ad9f9

chatgpt-codex-connector Bot reviewed Apr 27, 2026

View reviewed changes

Fix bottom overlay and system notification

8e5e3c5

chatgpt-codex-connector Bot reviewed May 1, 2026

View reviewed changes

Merge remote-tracking branch 'origin/main' into codex/pr-290-conflict

4338514

# Conflicts: # Sources/Fluid/ContentView.swift # Sources/Fluid/Services/CommandModeService.swift # Sources/Fluid/Services/NotchOverlayManager.swift # Sources/Fluid/Views/NotchContentViews.swift

chatgpt-codex-connector Bot reviewed May 1, 2026

View reviewed changes

fix(command): harden MCP tool handling

fc55199

chatgpt-codex-connector Bot reviewed May 9, 2026

View reviewed changes

fix(mcp): avoid waiting on timed-out calls

58b5ded

chatgpt-codex-connector Bot reviewed May 9, 2026

View reviewed changes

		let id = item["call_id"] as? String ?? item["id"] as? String ?? "call_\(UUID().uuidString.prefix(8))"
		var accumulator = toolCallAccumulators[id] ?? StreamingToolAccumulator()

		if format == .responses {
		return self.parseResponsesResponse(json)

		if let runtime = self.runtimes[server.id], runtime.config == server {
		continue

		if isResponses {
		let eventType = json["type"] as? String ?? ""

		private func showCompletionBadgeIfNeeded(success: Bool) {
		guard success else { return }

Conversation

grohith327 commented Apr 19, 2026

Description

Type of Change

Related Issues

Testing

Notes

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 19, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 19, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

altic-dev commented Apr 24, 2026

Uh oh!

altic-dev commented Apr 24, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment