Capture

Recording engines, screen-off capture, meeting mode, and text input.

ThoughtSift captures a thought as audio and transcribes it. Several transcription engines are available, and you can also type directly.

Screen-off capture

On iOS, a volume-button or voice trigger can start recording with the phone locked — no unlock, no app navigation, no broken walk. A haptic confirmation fires when capture completes.

Transcription engines

  • On-device Whisper (WhisperKit) — local transcription; falls back to cloud only if needed.
  • Apple Speech Analyzer — the on-device engine on iOS 26+.
  • Gemini cloud transcription — a cloud option, with optional refinement.

When transcription runs on-device, your voice audio does not leave the device.

Meeting mode

Meeting mode asks the transcriber for speaker labels and structured fields, so multi-speaker captures come back organised.

Text input

If you’d rather type, the text-input path skips recording and feeds your text straight into content intelligence.

Manual review

You can enable a manual review step to edit the transcript before a diagram is generated, so the visual reflects exactly what you meant.