Gentle: Combine audio and transcripts to get exact word timings and phonemes | Dark Hacker News