Text-Based Podcast Editing
Descript transcribes your podcast episode and links every word in the transcript to the corresponding moment in the audio. You edit the episode by editing the transcript — delete a sentence, and the audio disappears. Move a paragraph, and the audio reorders. Search for a specific phrase your guest said and jump directly to that moment. For a 60-minute interview, this means you can find and cut a 30-second tangent by reading text instead of scrubbing through audio trying to find where it starts and ends. It's the single biggest time-saver in podcast editing today. The limitation: text-based editing depends on accurate transcription. For clean audio with a single speaker, accuracy is around 95-98%. When two people talk over each other, accents are heavy, or audio quality is poor, accuracy drops and you'll need to correct the transcript before it's useful for editing. Also, sections without speech — music intros, sound effects, silence — don't appear in the transcript and need to be managed in the waveform timeline view. For shows that are 90% conversation, this barely matters. For produced shows with lots of non-speech audio, you'll spend more time in the timeline than the transcript.