Replies: 33 comments 59 replies
-
|
The ones from the handy-cli script seemed nice to me. The three remaining ones are:
|
Beta Was this translation helpful? Give feedback.
-
|
As I suggested here, I would like to see these additional shortcuts:
This would allow anyone to simply do what they need to do without interacting with the user interface. |
Beta Was this translation helpful? Give feedback.
-
|
This sort of a bug report but not for you Mac folks -> on Windows, ctrl+v fails in a lot of situations but shift+insert works universally. For example in ssh terminals. |
Beta Was this translation helpful? Give feedback.
-
|
It would be useful if users could assign custom keyboard shortcuts. After pressing a key and speaking, Handy would run a user-defined command, provide the spoken text as STDIN, and then insert the text from STDOUT at the cursor position. This would allow users to implement their own logic to process the input. For example, a user could use this to create a custom voice assistant using an LLM and MCP, leveraging the software's full potential. |
Beta Was this translation helpful? Give feedback.
-
|
I would love to use mouse keys ! Like Middle+Right click |
Beta Was this translation helpful? Give feedback.
-
self-serve keybinding suggestionHow are you overriding global keybindings across the many actively used Linux environments? If you use XDG Global Shortcuts that should work on recent Wayland versions GNOME and KDE, but there are many others where it will not work. It may be easier to offer an "outlet valve" in the form of a command that has the same effect as the keybinding. Then all Linux users can bind to that command, no matter what kind of environment they're running. The command would start/stop transcription - I don't think push-to-talk is viable this way. |
Beta Was this translation helpful? Give feedback.
-
|
Ability to change language depending on the shortcut. |
Beta Was this translation helpful? Give feedback.
-
|
Thanks a lot for Handy ! (discovered this week) |
Beta Was this translation helpful? Give feedback.
-
1. Separate keyboard shortcutsAdd two different hotkeys:
This would allow instant switching based on context without toggling settings each time. 2. Multiple post-processing profiles (user-defined)Let users create several post-processing profiles, for example:
3. Profile selection before / during / after recordingWhen starting a post-processed transcription, allow choosing which profile will be used. Possible UX:
This provides maximum convenience and avoids breaking the flow when switching contexts often. |
Beta Was this translation helpful? Give feedback.
-
|
Shortcut switch on/off for "Translate to English" would be really useful |
Beta Was this translation helpful? Give feedback.
-
|
I would love a smart Push-to-Talk option, where it enables Push-to-Talk mode if the shortcut is held vs pressed. Something like:
|
Beta Was this translation helpful? Give feedback.
-
|
If it's an extra feature, I would love to be able to have a mix of two languages in a sentence. Otherwise, a keyboard shortcut to toggle between two most commonly used languages would be ideal. |
Beta Was this translation helpful? Give feedback.
-
|
I'm using Handy on Pop!_OS with COSMIC (Wayland). Since Wayland doesn't allow apps to register global shortcuts the same way X11 does, I'm using the signal approach mentioned in the docs. For anyone else on COSMIC, you can add a global shortcut by editing:
{
(
modifiers: [
Ctrl,
],
key: "space",
description: Some("Toggle Handy Recording"),
): Spawn("pkill -USR2 -xo handy"),
}The toggle shortcut works great. However, I'd also like a way to cancel a recording without transcribing/pasting, similar to pressing Escape when the app is focused. Would it be possible to add another signal (e.g. SIGUSR1) for this? That would allow:
This is what I'd like to do in the future: (
modifiers: [
Ctrl,
Shift,
],
key: "space",
description: Some("Cancel Handy Recording"),
): Spawn("pkill -USR1 -xo handy"),Thank you for the very Handy app. |
Beta Was this translation helpful? Give feedback.
-
|
Please allow a modifier to be the short cut key (without needing to install Karabiner or anything else). In this case, right option key on MacOS. |
Beta Was this translation helpful? Give feedback.
-
|
What I would really like is a way to make it hit enter after the transcription automatically. The goal is to make any LLM coding tool to start right away after transcribing and no need for me to hit enter |
Beta Was this translation helpful? Give feedback.
-
|
Would it be possible to add the ability to differentiate between left and right cmd/ctrl/shift? I like to use my right command for transcribe but Handy does not pick up the difference between my left and right like another tool did. |
Beta Was this translation helpful? Give feedback.
-
|
When changing the keybaord shotcut it should not trigger. |
Beta Was this translation helpful? Give feedback.
-
|
I want press only one button (left ctrl / right ctrl / left shift / right shift / left alt / right alt) or some media buttons on keyboard. |
Beta Was this translation helpful? Give feedback.
-
|
Please consider adding an option to toggle Handy using a double press of a modifier key (for example Left Shift, Right Shift, Ctrl, or Alt). Behavior: Double tap -> start speech-to-text Benefits:
This would provide a simpler and more ergonomic way to control Handy. For example, the Spokenly app already uses this approach successfully. |
Beta Was this translation helpful? Give feedback.
-
|
I'm not sure if this is a Tauri limitation, but there is no possibility to switch on Windows from default CTRL+SPACE to CTRL+WIN(⊞). CTRL+SPACE is exactly the combination that VS Code is using for IntelliSense by default. |
Beta Was this translation helpful? Give feedback.
-
|
Using F15 on my keyboards (both with and without numpads have them). But generally it would be great to have the same shortcut on any machine. I really liked using for a couple of days the right control, which I'm not using for anything else. But handy doesn't differ left and right controls. I saw the discussion about adding that and liked it. |
Beta Was this translation helpful? Give feedback.
-
|
I'd like a keyboard shortcut to "paste last transcript". Right now, I have to move my hand to the mouse, navigate to the menu bar icon, click on copy last transcript, then CMD+V to paste it. this also has the side effect of killing whatever's inside my clipboard already. It'd be great if there was a keyboard shortcut like CMD+Option+V to "paste" what's in the last transcript directly |
Beta Was this translation helpful? Give feedback.
-
|
Hey! I really love Handy, it's exactly what I was looking for, clean, offline, and works great. Quick question: is there a way to trigger recording from the command line or an API on Windows? I saw that Linux users can use SIGUSR2 signals, but I couldn't find anything similar for Windows. I'm trying to use an external Bluetooth button (or other devices) to control recording instead of keyboard shortcuts. The problem is that keyboard shortcuts sometimes fail in certain applications (they get intercepted or blocked), and I'd prefer a more reliable method. If this doesn't exist yet, would you consider adding it? Something like a simple HTTP endpoint on localhost or a CLI command would be perfect for controlling Handy from external scripts or devices. Thanks for the awesome work on this project! |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
Using handy on desktop. Its fantastic! Thanks for making an amazing product. One thing that is noticeable sometimes that after I press the key that records, it can take a a few to several 100 ms for the app/widget to actually start and so the first few words can get missed. It could be that there is latency in the microphone(I use a Insta 360 web cam with speaker) or just a slow machine. This is a UX problem that could be solved in one or more ways e.g.
Screen.Recording.2026-03-04.at.9.29.55.AM.mov
|
Beta Was this translation helpful? Give feedback.
-
|
I've bought a small usb mic for my Windows machine, with a button on it. It can be detected by windows (sound no sound). |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Paste last transcript shortcutUse case: I use Handy with push-to-talk while working in a terminal app (Claude Code). While I'm dictating, the terminal sometimes displays a permission prompt that steals focus. When I release push-to-talk, Handy transcribes correctly but can't paste — the active input has changed and the transcription is lost. My workflow would be:
This is different from clipboard handling — the transcript may have been transcribed fine, it just couldn't be delivered to the right place. A simple "paste last transcript" shortcut (e.g. This was previously raised in #240 and implemented in #659, but I think the concrete problem wasn't fully captured at the time. It's not about convenience — it's about recovering a transcription that was lost due to a focus change during dictation. |
Beta Was this translation helpful? Give feedback.
-
|
First of all, I want to say that Handy is an absolutely fantastic application! It works incredibly well, and I use it a lot. I saw this thread and wanted to pitch in with an idea that builds a bit on what others have mentioned above. Currently, I use a custom post-processing script triggered by a shortcut. It takes my transcribed Norwegian text and translates it into Swedish using a local LLM. While this works, I find the current user interface a bit limiting when it comes to managing and switching between different tasks dynamically. My idea: Workflow "Profiles" with custom parameters Crucially, each shortcut/profile shouldn't just select the Language Model and Post-Processing script, but it should also save specific parameters for both. For example, I might want one shortcut to use a specific LLM model for translation, and another shortcut to use a different LLM model for summarizing. It is a bit hard to explain with words, so here is an example of how the UI and mapping could conceptually look:
(Note: The shortcuts above are just examples of mapping multiple keys to specific tasks). I fully understand that if a shortcut triggers a change in the primary language model, it might take some extra time to load the new model into memory before a new transcription can begin. I think users (including myself) are perfectly fine with this slight loading delay, as the massive benefit of having dedicated shortcuts with pre-configured parameters heavily outweighs it. Thanks for building and maintaining such a great tool! |
Beta Was this translation helpful? Give feedback.







Uh oh!
There was an error while loading. Please reload this page.
-
This is a thread for collecting ideas for other shortcuts that you might want in handy that right now are not supported, but may be supported in the future when we can figure them out in a nice unified way across the application and code base, as well as in the UI and user experience.
Beta Was this translation helpful? Give feedback.
All reactions