Coming Up

v1.0.0-rc.4

Released Apr 30, 2026

Download v1.0.0-rc.4

This release candidate is a substantial step up in voice-based identity, diarization accuracy, and meeting-assistant quality. The (You) badge is now gated on voice match and survives across meetings via stable profile-UUID colors, with tap-to-rename badges, an inline "Is this you?" pill, and name edits that cascade across history. Speaker recognition uses multi-centroid profiles for cold-start matching (−42% relative EER at 60 seconds of voice), and a new acoustic boundary-snap pass tightened diarization edges by another −0.76pp DER on top of a re-shipped attribution prompt.

Briefings and summaries got tighter prompts and stronger guardrails: TALKING POINTS is slimmer, USER_CONTEXT no longer bleeds product nouns into off-topic meetings, action-item due dates resolve against meeting timezone with a dual-clock display, and past-due items are preserved across refreshes. Dictation moved to a more interpretive prompt validated against a 37-case benchmark, and the vocabulary system now phonetically gates spelling hints (Double Metaphone), supports n-gram compound terms, and classifies user-supplied terms by domain so non-tech vocabularies aren't biased toward developer jargon.

File imports gained drag-drop, Open With support, and in-flight progress UX. Enrichment failures now surface a reason and a raw-transcript banner instead of failing silently — including the Gemini SAFETY block path, which was previously masked by content filters. The Pro tier is renamed to Unlimited everywhere.

Other improvements:

  • History view collapses to two filters (Meetings / Dictation & Instructions); "Use as Context" removed from the multi-select bar; copy-transcript button at bottom of detail
  • Status panel shows an empty-state hotkey card and auto-shrinks
  • Single-level undo for voice-driven meeting corrections (Cmd+Z)
  • Meeting Q&A: per-row copy / revert / delete actions; past corrections shown inline in the history-panel summary; classifier now extracts speaker-rename pairs cleanly
  • Overlay: skeleton loader for the first briefing, peek briefing without stealing focus, first-run empty-state card replaces the coaching popover, transform-independent toggle, redesigned waveform, press feedback on meeting-hub cards
  • Transcript and summary panels rendered with vendored Textual for cross-paragraph selection
  • Network calls now use jittered exponential backoff
  • Voice profile saved before renaming segments so the green check lights up reliably
  • Speakers in file imports now get voice embeddings; profile centroids un-freeze and recompute from samples; merging two profiles preserves embeddings
  • Outlier voice samples are trimmed before centroid averaging
  • Three-level speaker identity in the database with proper merge cascading and orphan sweep
  • Website: GDPR & CCPA trust link under the hero CTA, inline monthly/yearly billing toggle on the pricing card, copy refresh ("speaker separation" instead of "diarization", "Library" instead of "Knowledge Base")

Bug fixes:

  • Cmd+D delivery is reliable across prewarm, escape cancellation, and stop-during-startup races; "Listening…" holds through finalize and cancels cleanly on view teardown
  • Gesture state resets when recording ends externally
  • Meeting hub no longer claims "Processing" when a meeting is only queued
  • Briefing pinned at first streaming chunk to stop the floor from jumping
  • Overlay size jumps eliminated on start-meeting bootstrap and briefing refresh
  • Hub reopen no longer flashes; template chip first-click is reliable
  • Meeting corrections preserve verified speakers and dim the summary during regen
  • Empty LLM-output segments are dropped during fusion; orphan meeting_speakers are swept
  • Fallback speaker labels shortened to "Mic N" / "Remote N"; falls back to "Speaker N" when display_name resolves empty
  • Email correspondence in Insert mode now gets paragraph breaks
  • Floating panel toolbar view tree stays stable so the hub→summary morph fires
  • Meeting-hold pill aligned with the waveform capsule
  • "Reveal in History" wired in the unified overlay summary
  • Brighter assistant loading labels for legibility
  • Audio level overlay hidden on every recording cleanup path
  • Persistent billing gate banner with rate-limit attribution

v1.0.0-rc.3

Released Apr 21, 2026

The meeting overlay got a significant polish pass. The briefing panel now streams in as it generates and auto-refreshes when you pause after new speech, so the view stays current without you reaching for the refresh hotkey. The whole panel is draggable, speaker colors are unique per speaker (blue reserved for You), and chrome across every window now shares a unified look. Transform mode gained writing-sample support so it can better match your voice, plus anti-injection guardrails that prevent your source text from being treated as instructions.

Other improvements:

  • Briefing preserves durable facts in the summary across refreshes instead of rewriting it each time
  • Briefing template quotes captured questions verbatim
  • New setting to disable briefing auto-refresh if you prefer manual control
  • Settings > Dictation gained a short intro; meeting shortcut descriptions clarify global vs window scope
  • Billing migrated to Stripe; referral program added for existing customers

Bug fixes:

  • Audio-level capsule follows your active app window after the overlay resigns key
  • Floating panel no longer swallows global keystrokes or gets stomped by alt-tab
  • Click-anywhere-to-key restored on the floating panel
  • Overlay follows to whichever screen you interact with it on
  • Meeting assistant speech tracking uses sustained VAD state (fewer false triggers)
  • Concurrent live-transcript requests are deduped instead of racing

v1.0.0-rc.2

Released Apr 18, 2026

This release candidate focuses on reliability and responsiveness in long meetings. Gemini responses now stream end-to-end for attribution and summarization, eliminating Cloudflare timeouts that previously hit 90+ minute recordings. Diarization accuracy improves with word-cut boundary repair and re-tuned segmentation — measured +2.1pp speaker attribution on AMI.

The overlay gained a substantial stability pass: in-flight meetings can be reopened from the Hub while still processing, the assistant↔hub transition recovers cleanly from start errors, the summary view transitions correctly on stop, and the end-meeting flow is unified across live and deferred paths. Processing status throughout the UI is now truthful — the perpetual "Processing" badge has been replaced with actionable failure states, and spinners no longer appear when nothing is active.

Per-meeting summary templates (Discovery, Sales, Interview, Standup, Customer, Presenting) now ship with tuned instructions honed against an 18-scenario benchmark. MCP agents can stage a template before a meeting via set_meeting_context with template_id, or discover options with the new list_templates tool.

Briefing and summary prompts have been re-tuned from A/B data to improve requirements gathering, resurfacing of prepared context, and section routing. Transcript history search now supports voice input for follow-up queries, with skeleton loaders and unified result styling.

Other improvements:

  • Streaming Gemini calls for long attribution and summary prompts
  • Word-cut boundary repair in diarization; step=0.16, attribution temperature tuned to 0.1
  • Per-template summary instructions for all six built-in meeting types
  • MCP list_templates tool and template_id parameter on set_meeting_context
  • Voice input for transcript history search follow-ups
  • Copied meeting summaries now include user notes
  • macOS 15.0 set as the official minimum deployment target
  • New cost-comparison benchmark page on the website

Bug fixes:

  • Perpetual "Processing" badge replaced with actionable failure states
  • Spinners no longer shown when enrichment is idle
  • Meetings still enriching can be reopened from the Hub
  • Assistant and Hub transitions recover from start-error states
  • Summary view transitions correctly after stopping a meeting
  • Briefing state wiped on meeting stop, not only on start
  • Proxy allowlist now matches camelCase :streamGenerateContent
  • Explicit transcript content preserved during refinement safety passes
  • No-speech meetings handled consistently across live and deferred paths
  • Billing gating no longer triggers during upgrades or transient errors
  • Assistant header drag works on first click (SwiftUI gesture)
  • Speech-pace alert sensitivity reduced to cut false positives
  • Enrichment badges use larger caption font for readability

v1.0.0-rc.1

Released Apr 14, 2026

This release candidate sharpens the meeting assistant around discovery and requirements gathering, with substantial prompt and context work to reduce hallucinations, improve resurfacing of prepared context, and keep pace in long meetings through preemptive transcript compaction. A new Presenting template lets the assistant track coverage during demos and walkthroughs instead of nudging goals.

Context retrieval has been rebuilt on MiniLM embeddings with conversational search-phrase chunking, multi-query retrieval, and paragraph-split indexing — substantially improving recall on prepared reference documents and past meetings. Reference documents now accept a wider range of formats via drag-and-drop, including images through Gemini OCR.

The meeting Q&A experience now supports three distinct intents: questions answered against the transcript, user notes captured inline, and corrections that regenerate the summary in place. Conversations persist across reopens, user notes survive summary regeneration, and the classifier routes each turn without the main LLM having to guess.

Meeting recording is more resilient: state is preserved across quit, sleep, and crashes; billing credits are no longer rolled back on transient refinement errors; and code-switching is supported in primarily-English meetings. Silero VAD is now used for device-independent speech detection, powering the refresh badge and idle monitor.

Other improvements:

  • New dedicated hotkey to refresh the meeting assistant
  • Insert-mode history shows the original transcript alongside the refined text
  • Dual-channel diarization runs concurrently for faster meeting finalization
  • Meeting summaries gain a unified glass overflow menu and smoother processing-to-summary transitions
  • Onboarding and settings panels adopt macOS Tahoe glass styling throughout
  • Gemini proxy hardened with model allowlist, device ID validation, and a licensed-user monthly cap
  • Prompt injection mitigations added across all LLM prompt sites
  • Action items and summary prompts no longer extract from user/reference context
  • Speaker badges show consistent colors across overlay, history, and search

Bug fixes:

  • Bare-key global hotkeys no longer swallow system-wide input
  • Meeting assistant hotkey no longer gets permanently stuck
  • Per-meeting reference docs now display correctly instead of the current global state
  • Stale LLM stats cleared on metadata and correction turns
  • Briefing no longer resolves relative dates (keeps ISO in action items)
  • RAG pipeline triggers for MCP-set context and during-recording edits
  • Escape and Cmd+W dismiss all overlay modes
  • Accessibility permission state refreshes in the dictation UI
  • Overlay header drag works on first mouse click
  • First heading no longer shows a redundant divider

v1.0.0-beta.8

Released Apr 3, 2026

This release introduces the Meeting Hub — a redesigned overlay with one-tap meeting controls, live VU meters, and integrated meeting prep. The entire UI has been refreshed with macOS Tahoe-inspired glass styling, bringing a modern look to overlays, toolbars, and interactive elements.

Meeting assistant now auto-refreshes intelligently based on conversation activity, and briefings persist across dismiss/reopen cycles. Idle recording detection prompts you when no speech is detected, helping avoid forgotten recordings. Instruction mode has been streamlined to direct-paste with session history.

For privacy-conscious users, Local Mode (previously Offline Mode) is more prominent, and a new Bring Your Own Key option lets licensed users connect their own Gemini API key. The privacy page has been overhauled with clearer language around permissions and data handling, and the Gemini proxy is now open source.

Other improvements:

  • Unified voice typing quota for dictation and transform modes
  • Raised free-tier limits for beta
  • Reference documents now route through the embedding pipeline for better retrieval
  • Single Gemini call for context processing with concurrent chunk handling
  • Speaker attribution uses acoustic hints and conservative bias for better accuracy
  • Echo leakage detection in diarization

Bug fixes:

  • Fixed embedding model load race when multiple requests arrive simultaneously
  • Empty meetings no longer saved to database
  • Q&A responses now route to the correct view model
  • Fixed diarization crash on duplicate speaker mapping keys
  • Fixed instruction mode overlay race conditions during deferred upgrade
  • Preserved contractions in dictation refinement
  • Reduced over-eager vocabulary injection in refinement prompts

v1.0.0-beta.7

Released Mar 27, 2026

The meeting assistant now surfaces interpersonal awareness more accurately. Acoustic signal annotations — overlapping speech, laughter, crosstalk, and similar events — are detected in the transcript and sent to Gemini alongside the spoken content. This gives the assistant richer context for understanding conversation dynamics, not just what was said.

Onboarding has been redesigned with a hold-to-practice interaction and a coaching overlay that guides new users through their first recording. The meeting assistant overlay received a round of UI polish: no more width jumps on load, smoother transcript transitions, and clear empty states when no speech is detected.

Other improvements:

  • AI agents can now manage reference documents via the new set_context_source MCP tool — add, update, or remove context that informs your meetings
  • Meeting assistant transcript auto-refreshes in offline and billing-blocked modes
  • File imports now go through the full speaker attribution pipeline
  • Meeting summaries lead with decisions and use a bullet-only format for faster scanning

Bug fixes:

  • Hotkey monitors re-register correctly after a binding change
  • Microphone permission prompt no longer appears prematurely on first launch
  • Dictation enrichment timeout increased to 3s to reduce premature cutoffs
  • Meetings with no detected speech show a clear empty state
  • Fixed checkout API token mismatch and license key selection

v1.0.0-beta.6

Released Mar 27, 2026

Add reference documents to your meetings — upload agendas, briefs, or prior notes and they're chunked, embedded on-device, and retrieved during the meeting to give your assistant grounded context. Indicators show when documents are loaded and a test mode verifies retrieval quality.

MimicScribe now includes an MCP server that connects your AI coding agent to your meeting history. Eight tools — search meetings, get transcripts and summaries, query action items, set meeting context, manage reference documents, and import audio files — work with Claude Code, Cursor, Zed, and other MCP-compatible clients. The server sends real-time notifications when meetings are enriched, imported, deleted, or merged. (#27)

Batch file import lets you queue audio and video files for transcription with speaker diarization — drag files in, add them from the context menu, or use the MCP import_audio_file tool. A new queue display tracks processing status.

Meetings can now pull attendee names from your calendar automatically, and a new per-meeting Offline Mode disables all cloud AI while keeping on-device transcription running — you can retroactively generate a summary later. (#31)

Other improvements:

  • Unified meeting overlays into a single panel with Control+Space toggle
  • Meeting assistant auto-refreshes its briefing when a pause in conversation is detected (#23)
  • Custom vocabulary for specialized words and names that ASR and LLM should recognize (#22)
  • Inverse text normalization — numbers, dates, phone numbers, and times are auto-formatted in transcriptions (#24)
  • Configurable dictation timeout with raw text fallback (default 2s)
  • Double-tap recall now works in dictation mode (reopens last result)
  • Cmd+V paste from instruction overlay, mouse-cursor screen selection
  • Instruction mode matches the input format instead of defaulting to Markdown
  • Cmd+W closes all overlay windows
  • Error sounds and visual feedback for dictation/instruction failures
  • Accessibility permission is now optional (only required for voice typing)
  • Daily billing caps for assistant and follow-up queries (10/day), file transcription cap, briefings counter in free-tier panel
  • Two-column history panel with speaker colors

Bug fixes:

  • Fixed hotkeys not working after onboarding completion (#15)
  • Fixed overlays jumping during resize (now anchored from top edge)
  • Fixed system shortcut conflicts not being detected during onboarding
  • Fixed instruction mode stripping metadata wrapper from selection context
  • Fixed daily billing limits resetting at UTC instead of local midnight
  • Fixed dismiss-first tap behavior and minimize-aware overlay visibility
  • Suppressed network calls in offline mode and billing-blocked recordings
  • Fixed clipboard fallback for Zed and GPU-rendered editors

v1.0.0-beta.5

Released Mar 9, 2026

Dictation now adapts to the app you're using. Per-app profiles let you set different transcription styles — casual for messaging apps, polished for documents — and automatically switch based on the active window, with browser URL matching for web apps like Google Docs or Slack. (#12)

Meeting overlays are now hidden from screen sharing by default, so your AI assistant stays private during Zoom, Meet, and Teams calls. (#25) A new set of quick-fill context templates lets you describe your meeting in one click, and you can now add or edit meeting context after recording has started. (#17) Meeting summaries support custom format instructions — configure once in settings to get the output style you prefer. (#13)

Other improvements:

  • Inline speaker renaming in post-meeting transcripts with automatic profile saving (#19)
  • Audio level overlay follows your active window across screens (#14)
  • Meeting assistant overlay remembers its position between uses
  • Replaced trial token system with a permanent free tier
  • Added Gemini 2.5 Flash Lite model and thinking level selector
  • Hold-to-talk no longer triggers from held keys (key repeat filtering)
  • Human-readable dates in meeting assistant briefings

Bug fixes:

  • Fixed meeting summary window appearing over full-screen apps (now bounces Dock icon instead of stealing focus) (#18)
  • Fixed overlays showing on the wrong screen when the frontmost app is full-screen
  • Fixed speaker label colors resetting after a rename

v1.0.0-beta.4

Released Mar 2, 2026

Meeting summaries now open in a proper macOS window with a title bar, traffic lights, and keyboard shortcuts (Cmd+W to close, Shift+Cmd+C to copy). The window auto-sizes to fit content — starting compact during processing stages, then growing when the summary arrives (KVO-based content fitting, capped at 80% screen height). You can ask follow-up questions directly in the summary window, with full conversation history preserved between questions.

The menu bar icon has been updated to the new bird logo, and the overlay system has been migrated to @Observable for per-property tracking, reducing unnecessary SwiftUI re-renders.

Other improvements:

  • About You field now accepts longer text and includes writing style guidance
  • Selection capture works in canvas-based editors like Google Docs (AX + clipboard fallback)
  • Rich text paste no longer inserts spurious blank lines (block-aware markdown parser)

Bug fixes:

  • Fixed a crash when reusing the meeting summary window (Auto Layout constraint loop)
  • Fixed meeting recording getting stuck in "stopping" state when no active worker
  • Fixed the last few seconds of audio being lost when stopping a meeting (DTLN buffer not flushed before finalization)
  • Fixed meeting summary window appearing off-screen on smaller displays
  • Fixed meeting assistant panel appearing on the wrong screen in multi-monitor setups
  • Fixed login item re-registering on every app launch (SMAppService persists across launches)
  • Fixed onboarding window not behaving like a standard macOS window (activation policy)
  • Fixed stale selection badges and synchronous database reads in the status panel
  • Fixed phantom Q&A entries appearing in meeting history
  • Fixed window drag gesture using wrong window reference

v1.0.0-beta.3

Released Feb 27, 2026

Stability release focused on fixing the crashes that made beta.2 unusable on first launch. The DTLN echo cancellation model now loads correctly from the app bundle (SPM Bundle.module bypass), and the loading overlay no longer blocks your first dictation after onboarding.

Onboarding also got visual polish: an accessibility settings mockup shows exactly which permissions to grant, and interactive keyboard visualizations on the hotkey pages let you preview each shortcut before committing to it.

v1.0.0-beta.2

Released Feb 26, 2026

Onboarding has been completely redesigned into a 4-page flow with per-shortcut setup, so each recording mode is introduced and configured on its own page. Privacy policy, terms of service, and opt-in analytics are now presented during setup rather than buried in settings.

A new Privacy settings panel lets you control what analytics and crash logs are sent to the developer. Dictation mode now rewrites more aggressively—cleaning up contradictions, long pauses, and restarts into clear, concise language. Meetings now queue for offline enrichment when there's no network connection, processing automatically when connectivity returns (persistent SQLite queue).

Other improvements:

  • Launch at login enabled by default for new users (SMAppService)
  • Meeting assistant default shortcut changed to Control+Space
  • Monospaced font for keyboard shortcuts display
  • Consistent spacing hierarchy in settings window
  • Assistant usage hint added to meeting start panel

Bug fixes:

  • Fixed event tap re-enable loop when accessibility permission is revoked
  • Fixed database migration for missing enrichment columns (reused v10 migration)
  • Fixed volume limiting default to 10% for new installs
  • Fixed audio feedback sounds cutting off and engine lifecycle issues

v1.0.0-beta.1

Released Feb 20, 2026

First public beta of MimicScribe — a macOS menu bar app for speech-to-text. Transcription is powered by NVIDIA's Parakeet TDT 0.6B model compiled to CoreML, so your audio never leaves your Mac. Text refinement, meeting summaries, and follow-up questions use Gemini.

Meeting mode (Cmd+Shift+Alt+Space) captures system audio and microphone simultaneously, producing a diarized transcript with speaker labels and an AI-powered summary. Speaker diarization uses pyannote community-1 models with a Gemini pass to fix attribution errors. DTLN neural echo cancellation separates your voice from system audio. A real-time meeting assistant lets you ask questions about the conversation while it's still happening.

Insert mode (Alt+Space) transcribes speech and pastes it at your cursor. Instruction mode (Shift+Alt+Space) sends voice commands to Gemini with your selected text as context, displaying results in a dark glass overlay.

Other improvements:

  • Dark glass onboarding flow with card-based layout
  • Sparkle auto-updates with EdDSA signing
  • Custom CGEventTap hotkey system (replacing KeyboardShortcuts library)
  • Redesigned status panel trial card with purchase and activate links

Bug fixes:

  • Fixed 3-second hang on app quit (main-thread deadlock in audio teardown)
  • Fixed DTLN model crash in .app bundles (SPM bundle resolution)
  • Fixed onboarding accessibility setup not guiding users through permissions