v1.0.0-rc.12

Released Jul 1, 2026

Download v1.0.0-rc.12

Your meeting can no longer disappear when you tap Stop. If transcription hits a snag while finishing up — an audio-engine hiccup or a stalled decode — the app now recovers your recording from the saved audio instead of discarding it as "no speech," and reports an empty meeting only when the recording was genuinely silent. In the rare case nothing can be recovered, the audio is preserved so it's never lost. (#47)

Other improvements:

  • Onboarding now follows your system Light/Dark appearance, and the "customize shortcut" step responds correctly to clicks
  • Cleaner transcripts at pauses — a filler word can no longer knock out a real word where two segments meet, and a sentence spoken across a dramatic pause stays attributed to a single speaker

Bug fixes:

  • More reliable startup: a database-initialization failure now exits cleanly instead of hanging the app

v1.0.0-rc.11

Released Jun 26, 2026

The meeting overlay and status panel now follow your system appearance. Switch macOS to Light mode and the overlay switches with it — a clean light theme with readable cards and section tints — while Dark mode looks exactly as before.

The live meeting assistant works harder to put talking points in front of you the moment you need them — especially right after a remote speaker finishes a long turn and hands the floor back to you. Briefings now refresh when the conversation genuinely moves on rather than on raw talk time, and an adaptive cadence keeps short talking points from going stale on a fast-moving call. Questions surfaced to you no longer linger after they're answered, and live action items stop flickering in and out as the discussion develops.

Other improvements:

  • Fewer phantom speakers after a meeting — a reworked pass dissolves grab-bag clusters built from brief backchannels and noise, and recovers speaker tails that were split off into their own bogus speaker
  • More accurate speaker labeling overall, via a new content-reliability signal that down-weights low-information audio when resolving who spoke
  • The primary speaker is named from meeting context, with the match confirmed when a recognized voice and name line up
  • Dates in the live assistant are now localized to your region

Bug fixes:

  • Fixed a freeze (beachball / 100% CPU) when scrolling a long transcript — most visible reviewing Local-Only meetings, which open straight to the transcript view. The overlay no longer fights its own auto-resize (#44)
  • The app now renders correctly in Light mode — no more dark text on dark backgrounds across the overlay and windows (#40, #41, #45, #46)
  • The last words of an utterance are no longer dropped at a pause — a reworked transcript merge keeps the full tail of each spoken segment

v1.0.0-rc.10

Released Jun 22, 2026

Speaker attribution is substantially more accurate this release. A reworked speaker-resolution pass cleans up the most common errors after a meeting: phantom speakers spun up from played-back audio or brief background noise are dissolved, short interjections wrongly split off a speaker are reattached, and a quiet speaker absorbed into a louder one is recovered. A brief voice that can't be placed is now parked as "Unknown" rather than inflating the participant list, and a speaker recovered from context is marked with a tentative "?" so you know it's a best guess.

Every AI feature now follows your provider choice. Document chunking, vocabulary hints, OCR, and search context route through whichever provider you've selected — cloud Gemini, your own key, or a local endpoint — so a fully local setup keeps everything on-device. Custom endpoints gain a model dropdown to pick from the models your server offers, and verifying a provider auto-adopts it across every feature selector.

Other improvements:

  • Meetings show a green badge on speakers whose voice matches a saved profile, and a saved profile is backfilled onto past meetings when it's later recognized
  • Edit a transcript and the summary regenerates on its own, so action items and notes stay in sync with your corrections
  • Renaming, merging, or removing a speaker after a meeting applies instantly without re-running attribution
  • Full keyboard navigation and focus in the overlay's editing surfaces
  • Meeting notes render live as you type, support selectable markdown, and stay isolated per meeting
  • Better speaker accuracy in multilingual meetings — common backchannels and fillers across Western European languages are now recognized when speakers code-switch
  • A recording-consent reminder pill in the live assistant

Bug fixes:

  • Paid plans are no longer metered for AI calls that don't go through the MimicScribe proxy (your own key or a local endpoint)
  • The overlay re-fits its height when you switch between Summary and Transcript or move it to another screen
  • Notes added or deleted from the Transcript tab now render correctly
  • Closed a rare race that could drop the final words of a meeting when stopping

v1.0.0-rc.9

Released Jun 12, 2026

See exactly where the app connects. The new Network Log — in a new Activity section in Settings, alongside Transcription History — records every network request the app makes as it happens: host, purpose, and outcome, metadata only. Connection warm-up now follows your routing too: features pointed at a local endpoint or your own Gemini key warm that provider instead of the MimicScribe proxy, so an all-local setup makes no proxy connections at all, and the AI pane now labels Gemini with "your API key" when your own key handles the calls.

Meeting briefings start sooner: the opening briefing fires the moment you start a meeting, before models finish loading, so prepared context reaches the panel seconds earlier. Local and custom endpoints get a reasoning-effort control and timeouts sized for local models — larger on-device models can now finish briefings they previously timed out on — plus reliability fixes: instant failure when the server is offline, truncated-response detection, and smarter retry on rate limits.

Other improvements:

  • Press ⌘F anywhere in the hub to jump to meeting search; the back mouse button, Esc, Delete, and ⌘[ now navigate consistently across the app
  • Speaker attribution: a stricter merge gate stops distinct same-channel voices (e.g. two callers on a conference line) from being merged together
  • Leaner AI request formats and prompt caching cut response latency for speaker attribution and summaries
  • Action-item export now pre-checks only your own and unassigned items
  • The MCP server now works with standard MCP clients — a framing bug had broken every client except the bundled one — and guides agents to the action items you reviewed in the app

Bug fixes:

  • The assistant no longer shows doubled transcript text after a local-model hiccup
  • Prepared meeting context is used for spelling and terminology precision but no longer leaks into summaries as if it were said in the meeting
  • Onboarding window: drag it from anywhere; clicking buttons no longer moves the window
  • Verifying a Gemini API key mid-meeting no longer leaves the keep-alive pinging the proxy

v1.0.0-rc.8

Released Jun 11, 2026

Run the AI layer locally if you want to. This release adds an OpenAI-compatible endpoint — Ollama, LM Studio, or any local server — alongside cloud Gemini and bring-your-own-key. Dictation, Transform, meeting summaries, post-meeting Q&A, and the live assistant each route independently to cloud, local, or off, from a single redesigned AI settings pane.

The live assistant is faster and easier to skim. Refreshes land on conversational boundaries instead of waiting for silence, and an on-device classifier catches commitments as they're spoken — new action items reach the panel within a few seconds. Briefings and answers are shorter, with bolding where it helps scanning. The Light plan's briefing allowance doubles to 200 a month to match the quicker cadence.

Speaker attribution got a large round of work: audio playing from your speakers no longer surfaces as a phantom speaker (an echo-cancellation timing fix), multi-speaker backchannel clutter is dissolved into the right speakers, and quiet speakers absorbed into a dominant voice are recovered.

Other improvements:

  • Drag the overlay from any empty area in every mode — text still selects normally
  • Transcript edits save instantly without reloading the view; speaker renames apply in place
  • Custom endpoint setup verifies the model responds, not just that the server is reachable
  • MCP: clearer errors, import status polling, request IDs for correlating results
  • Security hardening: license and API keys moved to the Keychain, authenticated cross-process channels, tighter log hygiene

Bug fixes:

  • Fixed a system-audio converter bug that silently dropped ~0.4% of remote audio — the root cause of meeting clock drift
  • Transform mode verifies short selections against the clipboard, fixing bogus one-character captures in Google Docs
  • Assistant panel no longer jiggles, clips its last line, or flashes a loading dim on reopen
  • Played media no longer generates spurious action items, and the action-item list no longer drops or duplicates entries between refreshes

v1.0.0-rc.7

Released Jun 6, 2026

AI meeting summaries are now free on every tier — with separated, named speakers and transcript correction, and a generous daily fair-use limit. Paid plans now center on the live in-meeting assistant and bring-your-own-key. File transcription has its own daily limit, separate from meeting summaries.

Summary views replace meeting types, and you can now build your own. The four built-in views — Simple, Walkthrough, By Participant, and Decisions & Open Questions — sit alongside any custom view you create. Set a default in Preferences, or switch the view from the post-meeting summary header. Switching a view restyles only the narrative; the overview, action items, and speaker transcript stay put.

Other improvements:

  • Reworked the post-meeting summary header: an inline "View:" switcher in the metadata row, a cleaner title and speaker layout, and steadier spacing.
  • The pre-meeting context form now opens straight to a notes field instead of a row of template chips.
  • Evened out the spacing around the Q&A divider.

Bug fixes:

  • Switching summary views no longer dims or reloads the speaker transcript.
  • The overview and action items stay stable across view switches instead of flashing a reload.
  • Fixed a one-word phantom speaker (for example, "go.") being split off from the person who actually said it.

v1.0.0-rc.6

Released Jun 5, 2026

This is a focused follow-up to the speaker-identity work in rc.5, mostly making voice recognition stick more reliably. You can now be enrolled from a single long turn — a continuous solo talker is merged into one segment, which the old two-segment enrollment bar rejected no matter how long they spoke. Saving a global voice profile now always creates a durable, verified profile instead of occasionally leaving a "(You)" label with no recognized badge, and that badge persists after you leave and reopen a meeting.

Other improvements:

  • When two speakers are merged, both voices feed the saved profile instead of dropping the shorter cluster, so recognition holds up better after a correction
  • Transform mode is now marked Beta in Settings, with a heads-up that it may move to an overlay that previews edits before applying

v1.0.0-rc.5

Released Jun 5, 2026

Speaker separation is substantially more accurate. A rebuilt sentence-level pipeline, smarter cluster merging, and new guards against bleed and end-of-meeting artifacts mean far fewer split, duplicated, or phantom speakers, and the attribution prompt was rewritten for cleaner who-said-what. The app also recognizes you by voice across meetings: confirm "Is this you?" once and the (You) badge follows you forward and backward through your history, with inline renaming and one saved voice profile per person.

Much of that accuracy comes from a new echo canceller. Early builds used DTLN, which never fully removed the echo and left distracting artifacts; Apple's built-in voice processing cancelled cleanly but ducked the meeting audio itself, quieting the conversation you were trying to capture. RC5 switches to LocalVQE, an on-device neural model that runs on copies of the audio — it cancels more completely than DTLN without the ducking, so what you hear stays untouched while the mic is cleaned before transcription. It downloads in-app on first use and, with a stack of new mic-bleed defenses, clears most of the phantom "duplicate" speakers caused by a remote voice leaking into your mic.

The live meeting assistant gained an in-meeting Q&A bar and a running catch-up feed of summary points that updates and retracts itself as the conversation moves. Post-meeting summaries now adapt their structure to the meeting, lead with a tight overview and tiered action items, and can be reshaped with view lenses like Decisions & Open Questions. Every edit you make is reversible with multi-level undo and a one-click restore-to-original.

Pricing is now three tiers: a Free plan with daily on-device voice typing plus a starter allowance of AI features, a Light plan for regular meeting use, and Unlimited. Bring-your-own-key bypasses caps on any paid tier, and the in-app subscription panel was redesigned around the new tiers.

Other improvements:

  • Local Mode is enforced end-to-end — across follow-up Q&A, search, and the MCP tools — with on-device speaker recognition so a private meeting stays fully on your machine
  • The menu bar shows "Capturing audio" with elapsed time while a meeting records
  • Faster, smoother transcripts: the meeting transcript is virtualized and rendered natively, with per-turn selection and a lighter markdown path
  • Keyboard navigation across the meeting hub and summary; standard macOS pointer cursors throughout the overlay
  • Accessibility: opaque, bordered controls and high-contrast treatment under Reduce Transparency / Increase Contrast
  • Per-turn speaker reassignment directly in the summary, with clearer "Reassign to " labels
  • The entire AI pipeline moved to Gemini 3.1 Flash-Lite for faster, lower-cost responses
  • File imports run as explicit MCP tools with determinate progress and ETA
  • Summaries keep a dedicated section for notable personal updates and split long monologues into readable paragraphs
  • Security: Sparkle updater bumped to 2.9.2; server logs scrub email PII

Bug fixes:

  • Fixed two cases where a meeting could hang at "Identifying speakers," plus a database-init deadlock on startup
  • Hardened a range of crash and concurrency foot-guns across the recording, audio, and fusion paths
  • Interrupted recordings are recovered and surfaced with a one-time launch alert instead of being lost
  • Free-tier usage meter stays in sync with the server; billing charges once per action rather than per network call
  • Cold-start meetings no longer cancel on hotkey release or show a false "Live transcript paused" banner
  • Post-meeting Q&A no longer echoes the transcript when a question is unclear or unrelated

v1.0.0-rc.4

Released Apr 30, 2026

This release candidate is a substantial step up in voice-based identity, diarization accuracy, and meeting-assistant quality. The (You) badge is now gated on voice match and survives across meetings via stable profile-UUID colors, with tap-to-rename badges, an inline "Is this you?" pill, and name edits that cascade across history. Speaker recognition uses multi-centroid profiles for cold-start matching (−42% relative EER at 60 seconds of voice), and a new acoustic boundary-snap pass tightened diarization edges by another −0.76pp DER on top of a re-shipped attribution prompt.

Briefings and summaries got tighter prompts and stronger guardrails: TALKING POINTS is slimmer, USER_CONTEXT no longer bleeds product nouns into off-topic meetings, action-item due dates resolve against meeting timezone with a dual-clock display, and past-due items are preserved across refreshes. Dictation moved to a more interpretive prompt validated against a 37-case benchmark, and the vocabulary system now phonetically gates spelling hints (Double Metaphone), supports n-gram compound terms, and classifies user-supplied terms by domain so non-tech vocabularies aren't biased toward developer jargon.

File imports gained drag-drop, Open With support, and in-flight progress UX. Enrichment failures now surface a reason and a raw-transcript banner instead of failing silently — including the Gemini SAFETY block path, which was previously masked by content filters. The Pro tier is renamed to Unlimited everywhere.

Other improvements:

  • History view collapses to two filters (Meetings / Dictation & Instructions); "Use as Context" removed from the multi-select bar; copy-transcript button at bottom of detail
  • Status panel shows an empty-state hotkey card and auto-shrinks
  • Single-level undo for voice-driven meeting corrections (Cmd+Z)
  • Meeting Q&A: per-row copy / revert / delete actions; past corrections shown inline in the history-panel summary; classifier now extracts speaker-rename pairs cleanly
  • Overlay: skeleton loader for the first briefing, peek briefing without stealing focus, first-run empty-state card replaces the coaching popover, transform-independent toggle, redesigned waveform, press feedback on meeting-hub cards
  • Transcript and summary panels rendered with vendored Textual for cross-paragraph selection
  • Network calls now use jittered exponential backoff
  • Voice profile saved before renaming segments so the green check lights up reliably
  • Speakers in file imports now get voice embeddings; profile centroids un-freeze and recompute from samples; merging two profiles preserves embeddings
  • Outlier voice samples are trimmed before centroid averaging
  • Three-level speaker identity in the database with proper merge cascading and orphan sweep
  • Website: GDPR & CCPA trust link under the hero CTA, inline monthly/yearly billing toggle on the pricing card, copy refresh ("speaker separation" instead of "diarization", "Library" instead of "Knowledge Base")

Bug fixes:

  • Cmd+D delivery is reliable across prewarm, escape cancellation, and stop-during-startup races; "Listening…" holds through finalize and cancels cleanly on view teardown
  • Gesture state resets when recording ends externally
  • Meeting hub no longer claims "Processing" when a meeting is only queued
  • Briefing pinned at first streaming chunk to stop the floor from jumping
  • Overlay size jumps eliminated on start-meeting bootstrap and briefing refresh
  • Hub reopen no longer flashes; template chip first-click is reliable
  • Meeting corrections preserve verified speakers and dim the summary during regen
  • Empty LLM-output segments are dropped during fusion; orphan meeting_speakers are swept
  • Fallback speaker labels shortened to "Mic N" / "Remote N"; falls back to "Speaker N" when display_name resolves empty
  • Email correspondence in Insert mode now gets paragraph breaks
  • Floating panel toolbar view tree stays stable so the hub→summary morph fires
  • Meeting-hold pill aligned with the waveform capsule
  • "Reveal in History" wired in the unified overlay summary
  • Brighter assistant loading labels for legibility
  • Audio level overlay hidden on every recording cleanup path
  • Persistent billing gate banner with rate-limit attribution

v1.0.0-rc.3

Released Apr 21, 2026

The meeting overlay got a significant polish pass. The briefing panel now streams in as it generates and auto-refreshes when you pause after new speech, so the view stays current without you reaching for the refresh hotkey. The whole panel is draggable, speaker colors are unique per speaker (blue reserved for You), and chrome across every window now shares a unified look. Transform mode gained writing-sample support so it can better match your voice, plus anti-injection guardrails that prevent your source text from being treated as instructions.

Other improvements:

  • Briefing preserves durable facts in the summary across refreshes instead of rewriting it each time
  • Briefing template quotes captured questions verbatim
  • New setting to disable briefing auto-refresh if you prefer manual control
  • Settings > Dictation gained a short intro; meeting shortcut descriptions clarify global vs window scope
  • Billing migrated to Stripe; referral program added for existing customers

Bug fixes:

  • Audio-level capsule follows your active app window after the overlay resigns key
  • Floating panel no longer swallows global keystrokes or gets stomped by alt-tab
  • Click-anywhere-to-key restored on the floating panel
  • Overlay follows to whichever screen you interact with it on
  • Meeting assistant speech tracking uses sustained VAD state (fewer false triggers)
  • Concurrent live-transcript requests are deduped instead of racing

v1.0.0-rc.2

Released Apr 18, 2026

This release candidate focuses on reliability and responsiveness in long meetings. Gemini responses now stream end-to-end for attribution and summarization, eliminating Cloudflare timeouts that previously hit 90+ minute recordings. Diarization accuracy improves with word-cut boundary repair and re-tuned segmentation — measured +2.1pp speaker attribution on AMI.

The overlay gained a substantial stability pass: in-flight meetings can be reopened from the Hub while still processing, the assistant↔hub transition recovers cleanly from start errors, the summary view transitions correctly on stop, and the end-meeting flow is unified across live and deferred paths. Processing status throughout the UI is now truthful — the perpetual "Processing" badge has been replaced with actionable failure states, and spinners no longer appear when nothing is active.

Per-meeting summary templates (Discovery, Sales, Interview, Standup, Customer, Presenting) now ship with tuned instructions honed against an 18-scenario benchmark. MCP agents can stage a template before a meeting via set_meeting_context with template_id, or discover options with the new list_templates tool.

Briefing and summary prompts have been re-tuned from A/B data to improve requirements gathering, resurfacing of prepared context, and section routing. Transcript history search now supports voice input for follow-up queries, with skeleton loaders and unified result styling.

Other improvements:

  • Streaming Gemini calls for long attribution and summary prompts
  • Word-cut boundary repair in diarization; step=0.16, attribution temperature tuned to 0.1
  • Per-template summary instructions for all six built-in meeting types
  • MCP list_templates tool and template_id parameter on set_meeting_context
  • Voice input for transcript history search follow-ups
  • Copied meeting summaries now include user notes
  • macOS 15.0 set as the official minimum deployment target
  • New cost-comparison benchmark page on the website

Bug fixes:

  • Perpetual "Processing" badge replaced with actionable failure states
  • Spinners no longer shown when enrichment is idle
  • Meetings still enriching can be reopened from the Hub
  • Assistant and Hub transitions recover from start-error states
  • Summary view transitions correctly after stopping a meeting
  • Briefing state wiped on meeting stop, not only on start
  • Proxy allowlist now matches camelCase :streamGenerateContent
  • Explicit transcript content preserved during refinement safety passes
  • No-speech meetings handled consistently across live and deferred paths
  • Billing gating no longer triggers during upgrades or transient errors
  • Assistant header drag works on first click (SwiftUI gesture)
  • Speech-pace alert sensitivity reduced to cut false positives
  • Enrichment badges use larger caption font for readability

v1.0.0-rc.1

Released Apr 14, 2026

This release candidate sharpens the meeting assistant around discovery and requirements gathering, with substantial prompt and context work to reduce hallucinations, improve resurfacing of prepared context, and keep pace in long meetings through preemptive transcript compaction. A new Presenting template lets the assistant track coverage during demos and walkthroughs instead of nudging goals.

Context retrieval has been rebuilt on MiniLM embeddings with conversational search-phrase chunking, multi-query retrieval, and paragraph-split indexing — substantially improving recall on prepared reference documents and past meetings. Reference documents now accept a wider range of formats via drag-and-drop, including images through Gemini OCR.

The meeting Q&A experience now supports three distinct intents: questions answered against the transcript, user notes captured inline, and corrections that regenerate the summary in place. Conversations persist across reopens, user notes survive summary regeneration, and the classifier routes each turn without the main LLM having to guess.

Meeting recording is more resilient: state is preserved across quit, sleep, and crashes; billing credits are no longer rolled back on transient refinement errors; and code-switching is supported in primarily-English meetings. Silero VAD is now used for device-independent speech detection, powering the refresh badge and idle monitor.

Other improvements:

  • New dedicated hotkey to refresh the meeting assistant
  • Insert-mode history shows the original transcript alongside the refined text
  • Dual-channel diarization runs concurrently for faster meeting finalization
  • Meeting summaries gain a unified glass overflow menu and smoother processing-to-summary transitions
  • Onboarding and settings panels adopt macOS Tahoe glass styling throughout
  • Gemini proxy hardened with model allowlist, device ID validation, and a licensed-user monthly cap
  • Prompt injection mitigations added across all LLM prompt sites
  • Action items and summary prompts no longer extract from user/reference context
  • Speaker badges show consistent colors across overlay, history, and search

Bug fixes:

  • Bare-key global hotkeys no longer swallow system-wide input
  • Meeting assistant hotkey no longer gets permanently stuck
  • Per-meeting reference docs now display correctly instead of the current global state
  • Stale LLM stats cleared on metadata and correction turns
  • Briefing no longer resolves relative dates (keeps ISO in action items)
  • RAG pipeline triggers for MCP-set context and during-recording edits
  • Escape and Cmd+W dismiss all overlay modes
  • Accessibility permission state refreshes in the dictation UI
  • Overlay header drag works on first mouse click
  • First heading no longer shows a redundant divider

v1.0.0-beta.8

Released Apr 3, 2026

This release introduces the Meeting Hub — a redesigned overlay with one-tap meeting controls, live VU meters, and integrated meeting prep. The entire UI has been refreshed with macOS Tahoe-inspired glass styling, bringing a modern look to overlays, toolbars, and interactive elements.

Meeting assistant now auto-refreshes intelligently based on conversation activity, and briefings persist across dismiss/reopen cycles. Idle recording detection prompts you when no speech is detected, helping avoid forgotten recordings. Instruction mode has been streamlined to direct-paste with session history.

For privacy-conscious users, Local Mode (previously Offline Mode) is more prominent, and a new Bring Your Own Key option lets licensed users connect their own Gemini API key. The privacy page has been overhauled with clearer language around permissions and data handling, and the Gemini proxy is now open source.

Other improvements:

  • Unified voice typing quota for dictation and transform modes
  • Raised free-tier limits for beta
  • Reference documents now route through the embedding pipeline for better retrieval
  • Single Gemini call for context processing with concurrent chunk handling
  • Speaker attribution uses acoustic hints and conservative bias for better accuracy
  • Echo leakage detection in diarization

Bug fixes:

  • Fixed embedding model load race when multiple requests arrive simultaneously
  • Empty meetings no longer saved to database
  • Q&A responses now route to the correct view model
  • Fixed diarization crash on duplicate speaker mapping keys
  • Fixed instruction mode overlay race conditions during deferred upgrade
  • Preserved contractions in dictation refinement
  • Reduced over-eager vocabulary injection in refinement prompts

v1.0.0-beta.7

Released Mar 27, 2026

The meeting assistant now surfaces interpersonal awareness more accurately. Acoustic signal annotations — overlapping speech, laughter, crosstalk, and similar events — are detected in the transcript and sent to Gemini alongside the spoken content. This gives the assistant richer context for understanding conversation dynamics, not just what was said.

Onboarding has been redesigned with a hold-to-practice interaction and a coaching overlay that guides new users through their first recording. The meeting assistant overlay received a round of UI polish: no more width jumps on load, smoother transcript transitions, and clear empty states when no speech is detected.

Other improvements:

  • AI agents can now manage reference documents via the new set_context_source MCP tool — add, update, or remove context that informs your meetings
  • Meeting assistant transcript auto-refreshes in offline and billing-blocked modes
  • File imports now go through the full speaker attribution pipeline
  • Meeting summaries lead with decisions and use a bullet-only format for faster scanning

Bug fixes:

  • Hotkey monitors re-register correctly after a binding change
  • Microphone permission prompt no longer appears prematurely on first launch
  • Dictation enrichment timeout increased to 3s to reduce premature cutoffs
  • Meetings with no detected speech show a clear empty state
  • Fixed checkout API token mismatch and license key selection

v1.0.0-beta.6

Released Mar 27, 2026

Add reference documents to your meetings — upload agendas, briefs, or prior notes and they're chunked, embedded on-device, and retrieved during the meeting to give your assistant grounded context. Indicators show when documents are loaded and a test mode verifies retrieval quality.

MimicScribe now includes an MCP server that connects your AI coding agent to your meeting history. Eight tools — search meetings, get transcripts and summaries, query action items, set meeting context, manage reference documents, and import audio files — work with Claude Code, Cursor, Zed, and other MCP-compatible clients. The server sends real-time notifications when meetings are enriched, imported, deleted, or merged. (#27)

Batch file import lets you queue audio and video files for transcription with speaker diarization — drag files in, add them from the context menu, or use the MCP import_audio_file tool. A new queue display tracks processing status.

Meetings can now pull attendee names from your calendar automatically, and a new per-meeting Offline Mode disables all cloud AI while keeping on-device transcription running — you can retroactively generate a summary later. (#31)

Other improvements:

  • Unified meeting overlays into a single panel with Control+Space toggle
  • Meeting assistant auto-refreshes its briefing when a pause in conversation is detected (#23)
  • Custom vocabulary for specialized words and names that ASR and LLM should recognize (#22)
  • Inverse text normalization — numbers, dates, phone numbers, and times are auto-formatted in transcriptions (#24)
  • Configurable dictation timeout with raw text fallback (default 2s)
  • Double-tap recall now works in dictation mode (reopens last result)
  • Cmd+V paste from instruction overlay, mouse-cursor screen selection
  • Instruction mode matches the input format instead of defaulting to Markdown
  • Cmd+W closes all overlay windows
  • Error sounds and visual feedback for dictation/instruction failures
  • Accessibility permission is now optional (only required for voice typing)
  • Daily billing caps for assistant and follow-up queries (10/day), file transcription cap, briefings counter in free-tier panel
  • Two-column history panel with speaker colors

Bug fixes:

  • Fixed hotkeys not working after onboarding completion (#15)
  • Fixed overlays jumping during resize (now anchored from top edge)
  • Fixed system shortcut conflicts not being detected during onboarding
  • Fixed instruction mode stripping metadata wrapper from selection context
  • Fixed daily billing limits resetting at UTC instead of local midnight
  • Fixed dismiss-first tap behavior and minimize-aware overlay visibility
  • Suppressed network calls in offline mode and billing-blocked recordings
  • Fixed clipboard fallback for Zed and GPU-rendered editors

v1.0.0-beta.5

Released Mar 9, 2026

Dictation now adapts to the app you're using. Per-app profiles let you set different transcription styles — casual for messaging apps, polished for documents — and automatically switch based on the active window, with browser URL matching for web apps like Google Docs or Slack. (#12)

Meeting overlays are now hidden from screen sharing by default, so your AI assistant stays private during Zoom, Meet, and Teams calls. (#25) A new set of quick-fill context templates lets you describe your meeting in one click, and you can now add or edit meeting context after recording has started. (#17) Meeting summaries support custom format instructions — configure once in settings to get the output style you prefer. (#13)

Other improvements:

  • Inline speaker renaming in post-meeting transcripts with automatic profile saving (#19)
  • Audio level overlay follows your active window across screens (#14)
  • Meeting assistant overlay remembers its position between uses
  • Replaced trial token system with a permanent free tier
  • Added Gemini 2.5 Flash Lite model and thinking level selector
  • Hold-to-talk no longer triggers from held keys (key repeat filtering)
  • Human-readable dates in meeting assistant briefings

Bug fixes:

  • Fixed meeting summary window appearing over full-screen apps (now bounces Dock icon instead of stealing focus) (#18)
  • Fixed overlays showing on the wrong screen when the frontmost app is full-screen
  • Fixed speaker label colors resetting after a rename

v1.0.0-beta.4

Released Mar 2, 2026

Meeting summaries now open in a proper macOS window with a title bar, traffic lights, and keyboard shortcuts (Cmd+W to close, Shift+Cmd+C to copy). The window auto-sizes to fit content — starting compact during processing stages, then growing when the summary arrives (KVO-based content fitting, capped at 80% screen height). You can ask follow-up questions directly in the summary window, with full conversation history preserved between questions.

The menu bar icon has been updated to the new bird logo, and the overlay system has been migrated to @Observable for per-property tracking, reducing unnecessary SwiftUI re-renders.

Other improvements:

  • About You field now accepts longer text and includes writing style guidance
  • Selection capture works in canvas-based editors like Google Docs (AX + clipboard fallback)
  • Rich text paste no longer inserts spurious blank lines (block-aware markdown parser)

Bug fixes:

  • Fixed a crash when reusing the meeting summary window (Auto Layout constraint loop)
  • Fixed meeting recording getting stuck in "stopping" state when no active worker
  • Fixed the last few seconds of audio being lost when stopping a meeting (DTLN buffer not flushed before finalization)
  • Fixed meeting summary window appearing off-screen on smaller displays
  • Fixed meeting assistant panel appearing on the wrong screen in multi-monitor setups
  • Fixed login item re-registering on every app launch (SMAppService persists across launches)
  • Fixed onboarding window not behaving like a standard macOS window (activation policy)
  • Fixed stale selection badges and synchronous database reads in the status panel
  • Fixed phantom Q&A entries appearing in meeting history
  • Fixed window drag gesture using wrong window reference

v1.0.0-beta.3

Released Feb 27, 2026

Stability release focused on fixing the crashes that made beta.2 unusable on first launch. The DTLN echo cancellation model now loads correctly from the app bundle (SPM Bundle.module bypass), and the loading overlay no longer blocks your first dictation after onboarding.

Onboarding also got visual polish: an accessibility settings mockup shows exactly which permissions to grant, and interactive keyboard visualizations on the hotkey pages let you preview each shortcut before committing to it.

v1.0.0-beta.2

Released Feb 26, 2026

Onboarding has been completely redesigned into a 4-page flow with per-shortcut setup, so each recording mode is introduced and configured on its own page. Privacy policy, terms of service, and opt-in analytics are now presented during setup rather than buried in settings.

A new Privacy settings panel lets you control what analytics and crash logs are sent to the developer. Dictation mode now rewrites more aggressively—cleaning up contradictions, long pauses, and restarts into clear, concise language. Meetings now queue for offline enrichment when there's no network connection, processing automatically when connectivity returns (persistent SQLite queue).

Other improvements:

  • Launch at login enabled by default for new users (SMAppService)
  • Meeting assistant default shortcut changed to Control+Space
  • Monospaced font for keyboard shortcuts display
  • Consistent spacing hierarchy in settings window
  • Assistant usage hint added to meeting start panel

Bug fixes:

  • Fixed event tap re-enable loop when accessibility permission is revoked
  • Fixed database migration for missing enrichment columns (reused v10 migration)
  • Fixed volume limiting default to 10% for new installs
  • Fixed audio feedback sounds cutting off and engine lifecycle issues

v1.0.0-beta.1

Released Feb 20, 2026

First public beta of MimicScribe — a macOS menu bar app for speech-to-text. Transcription is powered by NVIDIA's Parakeet TDT 0.6B model compiled to CoreML, so your audio never leaves your Mac. Text refinement, meeting summaries, and follow-up questions use Gemini.

Meeting mode (Cmd+Shift+Alt+Space) captures system audio and microphone simultaneously, producing a diarized transcript with speaker labels and an AI-powered summary. Speaker diarization uses pyannote community-1 models with a Gemini pass to fix attribution errors. DTLN neural echo cancellation separates your voice from system audio. A real-time meeting assistant lets you ask questions about the conversation while it's still happening.

Insert mode (Alt+Space) transcribes speech and pastes it at your cursor. Instruction mode (Shift+Alt+Space) sends voice commands to Gemini with your selected text as context, displaying results in a dark glass overlay.

Other improvements:

  • Dark glass onboarding flow with card-based layout
  • Sparkle auto-updates with EdDSA signing
  • Custom CGEventTap hotkey system (replacing KeyboardShortcuts library)
  • Redesigned status panel trial card with purchase and activate links

Bug fixes:

  • Fixed 3-second hang on app quit (main-thread deadlock in audio teardown)
  • Fixed DTLN model crash in .app bundles (SPM bundle resolution)
  • Fixed onboarding accessibility setup not guiding users through permissions