type: rule
status: active
timestamp: 2026-06-21
tags: [rule, communication, stt, askuserquestion, ambiguity-handling]
status: active
timestamp: 2026-06-21
tags: [rule, communication, stt, askuserquestion, ambiguity-handling]
Communication is STT-friendly — accept transcription noise, infer intent
STT-friendly communication, infer intent, ≤4 MCQ options
Communication is STT-friendly
User (chirag127) uses STT heavily. STT introduces:
- Transcription typos (homophones / soundalikes)
- Stitched-word artefacts (
oriz→oris/orange) - Missing punctuation
- Sentence-fragment messages
- Mid-message contradictions (“yes do that” then “actually no”)
Adapt
Questions
- Short labels (STT-readback + scan)
- Many short options fine; respect 4-question cap per MCQ call
- Infer intent from typos (
pdf splitter=pf splitter) - Question stem INSIDE widget, not in prose above (Windows TUI overlay)
Ambiguous / contradictory input
- Pick most-likely interpretation
- State explicitly before proceeding (“Reading this as X; proceeding”)
- Proceed without blocking
- Ask only when truly blocked AND no sensible direction
Never
- Ask user to re-type / re-transcribe
- Quote back garbled text + “did you mean X or Y” as entire response → pick, state, move
Why
STT bandwidth bottleneck. Each clarifying round = ~30s speech overhead. Infer + proceed + ask-for-tweak-after beats ask-first.
Cross-refs
- [[rules/grill-to-knowledge]] — capture works through STT noise
- [[rules/future-overrides-past]] — agent picks wrong, user overrides