Interview Transcription: Preparing Audio for Publication
Interview Transcription: Preparing Audio for Publication
Between the audio recording of an interview and the published text lies a multi-step process. Transcription is the first step — and the one that used to consume the most time. Here's the complete workflow from recording to publication-ready text.
Stage 1: Recording Quality Matters
Transcription accuracy is only as good as your audio. A few principles:
Location. Quiet room without echo. Avoid coffee shops, open-plan offices, and outdoor settings. A conference room or home office works well.
Microphone. For in-person interviews: a lavalier mic on the interviewee (15–20 cm from mouth). For remote: ask the subject to use a headset, not their laptop's built-in microphone.
Format. MP3 at 128+ kbps or M4A. WAV gives slightly better quality but files are 10x larger — the accuracy difference for speech recognition is negligible.
Backup recording. Record on two devices. Phone + dedicated recorder, or Zoom cloud + local recording. Losing an interview recording is unprofessional and painful.
Stage 2: Transcription
Upload the recording to Nagovori. A one-hour interview processes in 2–5 minutes. Copy the text into your editor (Google Docs, Notion, Word).
Handling Recognition Errors
Every model makes mistakes. Common issues:
- Names and proper nouns — "Tim Ferriss" might become "Tim Ferris" or "Tim ferris." Check capitalization and spelling.
- Numbers — the model may write "fifteen" instead of "15" or vice versa. Standardize.
- Domain jargon — specialized terminology may be garbled. Cross-reference with context.
- Homophones — "their/there/they're," "affect/effect" — check these manually.
Stage 3: Editorial Processing
Spoken language differs from written language. A verbatim transcript contains:
- Filler words: "um," "uh," "you know," "like," "so basically"
- Repetition: "we decided, we made the decision, we agreed"
- False starts: "and then we — well actually, what happened was —"
- Grammatical irregularities: people don't speak in perfectly formed sentences
The editor's job is to remove noise while preserving the speaker's voice and meaning. Guidelines:
- Remove fillers, keep personality. If your subject says "that's a game-changer" — keep it, it's their style. But "um, so, like" — remove.
- Consolidate repetition. "We decided, we made the decision to pivot" becomes "We decided to pivot."
- Complete fragments. If a thought trails off but the meaning is clear from context, complete it. If not, cut it.
- Verify edited quotes. If you significantly altered a statement, send it to the subject for approval. This is both ethical and protective.
Stage 4: Structure
Interviews rarely follow a clean narrative arc. Subjects jump between topics, circle back, and digress. For publication, the text needs structure:
- Group answers by theme, not chronological order
- Add subheadings that tell the story
- Pull the strongest quotes and feature them prominently
- Cut sections that don't add value for the reader
Stage 5: Format and Publish
Q&A Format
Classic interview format: journalist's question, subject's answer. Best for expert interviews and profile pieces. Clean, easy to follow, lets the subject's voice dominate.
Narrative With Quotes
Author-driven text with embedded quotes from the subject. Best for feature stories, profiles, and investigative pieces. Gives the writer more control over pacing and context.
Summary Format
Brief overview with key quotes. Best for news articles and digests. Fastest to produce, least editorial investment.
Pre-Publication Checklist
- All names, titles, and company names verified
- Numbers and dates cross-checked with sources
- Quotes approved by subject (if required by your publication's policy)
- Text proofread for typos and grammatical errors
- Headline and meta description written
- Featured image selected and captioned
- Internal and external links added where relevant
Time Comparison
| Stage | Manual workflow | With transcription |
|---|---|---|
| Transcription (1 hour) | 3–4 hours | 5 minutes |
| Editorial processing | 1–2 hours | 1–2 hours |
| Structuring | 30–60 minutes | 30–60 minutes |
| Formatting | 30 minutes | 30 minutes |
| Total | 5–7.5 hours | 2–3.5 hours |
The time saved on transcription alone — 3+ hours per interview — is the biggest single productivity gain in the workflow.
Conclusion
Automated transcription eliminates the most tedious stage of interview publishing. The creative work — editing, structuring, polishing — still requires human judgment and skill. But now you have the time and energy to do it well, instead of arriving at the editing stage already exhausted from hours of manual transcription.