ASTRA 🎤✨

Your personal voice assistant that types for you — no internet required.

Press a hotkey, speak, and watch your words appear in any app. It's like magic, but it's just really good technology.

⬇️ Download for macOS

What's This Thing Do?

Two Hotkeys, Two Superpowers:
- Cmd+Shift+V → Quick voice-to-text (super fast ⚡)
- Cmd+Shift+O → Voice + screenshot OCR (context-aware, catches UI text, app names, buttons, etc.)
Smart Text Fixer → Auto-corrects grammar & punctuation using a tiny local AI model (0.5s, not 15s)
Native OCR → Apple Vision Framework reads your screen (zero extra RAM, zero cloud)
Offline & Private → Everything happens on your Mac. Your voice never leaves your computer.
Auto-Paste → Types the transcribed text directly into whatever app you're using
Floating UI → Minimal recording window that plays nice with fullscreen apps

Quick Start

1️⃣ Download & Install

Go to Releases and grab:
- Apple Silicon Mac (M1/M2/M3/M4): ASTRA-x.x.x-arm64.dmg
- Intel Mac: ASTRA-x.x.x.dmg
Drag to Applications folder
First time? macOS may block the app because this project is open source and not Apple-notarized. To open it:
- Try right-clicking ASTRA in Applications and choose Open
- If macOS still blocks it, go to System Settings → Privacy & Security
- Scroll to Security and click Open Anyway for ASTRA
- Launch ASTRA again

2️⃣ First Launch

On first run, the app will:

Ask permission for microphone 🎙️
Ask permission for Accessibility so it can paste text and read selected text
Download a speech model (~400MB, one-time)
Optionally install Ollama for text polishing (we'll cover this below)

About macOS Warnings

ASTRA's downloadable builds are currently ad-hoc signed, not Apple-notarized. This means macOS may show a warning on first launch. The source code is public, and technical users can build it themselves if they prefer.

To verify a downloaded DMG, compare its SHA256 checksum with the checksums published in RELEASE_CHECKSUMS.txt:

shasum -a 256 ASTRA-2.0.0-arm64.dmg

3️⃣ Start Using

What You Want	Press	What Happens
Quick voice note	`Cmd+Shift+V`	Speak → Stop → Text appears in your app
Voice + screen context	`Cmd+Shift+O`	Speak → Screenshot → OCR context → Text appears

Tip: Press Escape during recording to cancel.

The "Smart Polish" Feature (Optional but 🔥)

Want your transcribed text to be grammar-perfect? Install Ollama — a tiny local AI that runs on your Mac.

Setup Ollama

# Install (if you haven't)
brew install ollama

# Start it (runs in background)
ollama serve

# Pull our recommended model (397MB, ~0.5s per polish)
ollama pull qwen2.5:0.5b

That's it! The app automatically connects to Ollama and polishes your text.

Why qwen2.5:0.5b?

Model	Size	Speed	Verdict
qwen2.5:0.5b	397MB	~0.5s	⭐ Perfect
qwen2.5:1.5b	1GB	~1s	Good
phi4-mini	2.5GB	~9s	Too slow
gemma4	7GB	~15s	Thinking mode = unusable

If you want to try other models, just run ollama pull <model-name> and change it in Settings.

Settings & Config

Click the Tray Icon (in menu bar) → Settings to customize:

Option	What It Does
Hotkeys	Change `Cmd+Shift+V` / `Cmd+Shift+O` to whatever you like
Auto-Paste	Toggle automatic typing into your active app
Polish Mode	Turn AI text fixing on/off
Ollama URL	Usually `http://localhost:11434` (don't change unless you know what you're doing)

Under the Hood

Your Voice → Recorded via macOS microphone
Whisper.cpp → Transcribes locally (no cloud, complete privacy)
Ollama (optional) → Fixes grammar/punctuation in ~0.5s
Auto-Type → Uses Mac's accessibility APIs to type into your active app

For Vision Mode (Cmd+Shift+O):

Screenshot → Captured via screencapture
Apple Vision Framework → Extracts text from screen (native, zero RAM)
LLM Context → Uses extracted text to correctly handle UI elements, button names, etc.

Troubleshooting

"Whisper not found" or "Library not loaded"

Make sure you have whisper-cli in your PATH, or use the pre-built DMG which includes it.

Ollama not responding

Run ollama serve in Terminal
Check Settings → Ollama URL is http://localhost:11434
Run ollama list to see available models

Hotkeys not working?

Go to System Settings → Privacy & Security → Accessibility and enable ASTRA

Still stuck?

Open an issue — we'll help!

Build from Source (For Developers)

git clone https://github.com/amateur-dev/local-hotkey-voice-mac-app.git
cd local-hotkey-voice-mac-app
npm install
npm start

Requirements:

Node.js 18+
FFmpeg (brew install ffmpeg)
whisper-cli in PATH

Tech Stack

Electron — Desktop app framework
Whisper.cpp — Local speech-to-text (OpenAI's Whisper, but faster)
Ollama — Local LLM with MLX optimization for Apple Silicon (2-4x faster on M1/M2/M3/M4)
Apple Vision Framework — Native OCR (zero RAM)
Node.js — Backend magic

Name		Name	Last commit message	Last commit date
Latest commit History 137 Commits
.github/workflows		.github/workflows
build		build
docs		docs
scripts		scripts
src		src
test		test
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LLM_SETUP.md		LLM_SETUP.md
README.md		README.md
RELEASE_CHECKSUMS.txt		RELEASE_CHECKSUMS.txt
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASTRA 🎤✨

What's This Thing Do?

Quick Start

1️⃣ Download & Install

2️⃣ First Launch

About macOS Warnings

3️⃣ Start Using

The "Smart Polish" Feature (Optional but 🔥)

Setup Ollama

Why qwen2.5:0.5b?

Settings & Config

Under the Hood

Troubleshooting

"Whisper not found" or "Library not loaded"

Ollama not responding

Hotkeys not working?

Still stuck?

Build from Source (For Developers)

Tech Stack

Like This? ❤️

Author

About

Uh oh!

Releases 4

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ASTRA 🎤✨

What's This Thing Do?

Quick Start

1️⃣ Download & Install

2️⃣ First Launch

About macOS Warnings

3️⃣ Start Using

The "Smart Polish" Feature (Optional but 🔥)

Setup Ollama

Why qwen2.5:0.5b?

Settings & Config

Under the Hood

Troubleshooting

"Whisper not found" or "Library not loaded"

Ollama not responding

Hotkeys not working?

Still stuck?

Build from Source (For Developers)

Tech Stack

Like This? ❤️

Author

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages