Skip to content

Conversation

ronb1964
Copy link

TalkType is a privacy-focused speech recognition application for Linux Wayland systems.

Key Features

  • Privacy-first: All processing happens locally using OpenAI's Faster-Whisper AI
  • Press-and-hold dictation: Default F8 hotkey (configurable)
  • Intelligent punctuation: Voice commands like "period", "comma", "new paragraph"
  • Auto-punctuation and auto-spacing: Smart text formatting
  • CPU and GPU support: Automatic CUDA detection and on-demand library download
  • Multiple model sizes: tiny, small, medium, large (1.5GB model for best accuracy)
  • System tray integration: Easy access to preferences and settings

Technical Details

  • Built for: Ubuntu 22.04+ (glibc 2.35)
  • Size: 892MB (optimized, under 1GB)
  • License: MIT
  • Desktop ID: io.github.ronb1964.TalkType
  • AppStream metadata: Included and validated

Testing Verification

  • ✅ Tested on Nobara Linux (Fedora-based) with Wayland
  • ✅ CPU mode with small model - working
  • ✅ GPU mode with large model - working
  • ✅ CUDA library auto-download - working
  • ✅ All voice commands functional
  • ✅ "new paragraph" command - fixed and working perfectly

What's Included

The AppImage is completely self-contained:

  • Python 3.11 runtime with full standard library
  • PyTorch with CUDA support (CUDA libraries downloaded on-demand)
  • faster-whisper AI model (downloaded on first run, multiple sizes available)
  • ydotool for Wayland text injection
  • GTK3 system tray interface

Download

AppImage: https://github.com/ronb1964/TalkType/releases/download/v0.3.7/TalkType-v0.3.7-x86_64.AppImage

AppImage submission for TalkType voice dictation app
TalkType is a privacy-focused speech recognition application for Linux Wayland systems.

Features:
- Privacy-first: All processing happens locally using OpenAI's Faster-Whisper AI
- Press-and-hold dictation with configurable hotkey (default F8)
- Intelligent punctuation and voice commands
- Auto-punctuation and auto-spacing
- CPU and GPU support with automatic CUDA detection
- Multiple model sizes (tiny, small, medium, large)
- System tray integration

Technical Details:
- Built for Ubuntu 22.04+ (glibc 2.35)
- Size: 892MB (optimized, under 1GB)
- License: MIT
- Desktop ID: io.github.ronb1964.TalkType
- AppStream metadata included and validated

Tested on Nobara Linux (Fedora-based) with Wayland.
Download: https://github.com/ronb1964/TalkType/releases/download/v0.3.7/TalkType-v0.3.7-x86_64.AppImage
@ronb1964
Copy link
Author

Closing this PR to submit a fresh one with the newly built AppImage that has all GTK dependencies properly bundled.

@ronb1964 ronb1964 closed this Oct 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant