Capture
Recording engines, screen-off capture, meeting mode, and text input.
ThoughtSift captures a thought as audio and transcribes it. Several transcription engines are available, and you can also type directly.
Screen-off capture
On iOS, a volume-button or voice trigger can start recording with the phone locked — no unlock, no app navigation, no broken walk. A haptic confirmation fires when capture completes.
Transcription engines
- On-device Whisper (WhisperKit) — local transcription; falls back to cloud only if needed.
- Apple Speech Analyzer — the on-device engine on iOS 26+.
- Gemini cloud transcription — a cloud option, with optional refinement.
When transcription runs on-device, your voice audio does not leave the device.
Meeting mode
Meeting mode asks the transcriber for speaker labels and structured fields, so multi-speaker captures come back organised.
Text input
If you’d rather type, the text-input path skips recording and feeds your text straight into content intelligence.
Manual review
You can enable a manual review step to edit the transcript before a diagram is generated, so the visual reflects exactly what you meant.