EDIT: I could also definitely see Audapolis being useful if you could integrate it into a podcast's post processing flow (volume normalization, de-essing) by recognizing certain verbal tics and automatically removing them from the audio such as "ummmm...", etc.
Disclaimer: I work at Descript
I've always liked the idea of Descript and was considering building something similar before it came out. The problem is my use case is a couple of videos a year so doesn't fit with an expensive monthly subscription
Take a look at https://matcha.video
This functionality is some of my favorite when editing videos in Descript. It’s so much easier than chopping up waveforms in Audacity
[0] descript.com/
> Hindenburg’s manuscript feature gives you a complete overview of your audio. You can select the text just as you would in a text document and watch as your edits are made in real-time. If you need to export your text in a specific format, no problem. Hindenburg supports the most common text and transcription export formats.
I built something similar here: https://bigwav.app
What do you typically do with the text on export? E.g. Do you parse the times?
A number of comments turned me onto Descript -- made a similar comment on another audio thread recently: drives me absolutely insane how all audio tools with any AI are web based monthly saas instead of offline private gpu upfront purchase.
What does that have to do with non-Web-based applications?
Is 1 emoji for each commit title a new trend?
Can anyone clarify if this project is active?
I'm stuck in editing hell right now, and it would be very nice to just visually scroll past a few pages of pre-episode bullshitting and be able to wipe out whole minutes at a stretch, without having to listen to the whole thing. Even at increased speed, it's a bit of a slog.
And why the rude response?
It's not a good idea, but then tons of the LLM ideas we see here aren't either.
I think I just lack experience in this area. I've used Audacity to cut out parts of audio / splice together two clips and that's about it, so I clearly don't have enough background to understand what this tool does.
Can someone clarify what this tool does, please? :)
What it does not do is generate new words (ie you type a sentence and it adds that to your file as voice).
I haven't found another app that reduces or removes these.
1) You can add more timestamps by adding paragraphs with enter.
2) You can playback at any word by highlighting it and pressing space. You can also cut with right click on the wav ui.
Maybe there's not much value in editing what someone said after all
criminal value... maybe?
There are absolutely scams right now that use deepfakes to trick people.
Of course there is commercial value. The cost of reshooting video materials is huge. You made an advert mentioning 3 features, but by the time the product is about to be released one of them got dropped or even worse changed? Congrats, you need to get the talent and the studio rebooked, you need to find a new tech crew, who need to set up again. Probably things won't cut seamlessly so you need to re-record the whole thing.
On the other hand it does say "not waveform" which I think makes it pretty clear. What would you suggest instead?
Also, there's often no perfect combo of words, there's a spectrum of options and you just pick an operating point. Transcription is a longer word than "word" so there's a tradeoff. It doesn't feel like a chasm to me.
At any rate when I first read it I thought it was going to be some sort LLM thing where you said "remove the third bridge and increase pitch by one octave in the outro" and it would give you back an edited mp4 which you could then listen and cringe to and sometimes say "whoa, that's amazing!"
Believe me, I don't care that you disagree. I just don't like to see people breaking the civility guidelines here as it's just about one of the last places online where discourse is largely held to a a civil level for disagreements.
I write copy professionally, among other things. If you don't care whether what you write is clear to almost all readers... then I suppose it doesn't matter. Most people do not want misunderstandings of their copy and most copy editors would flag that as unclear. The new version is much better.
I seriously disagree that this breaks any sort of social contract between you and I on the internet. It was intended to be mildly dismissive but not overly rude. There's a higher standard for communicating with care at work (you should care about your coworkers), but do you really think people on the internet have time for this shit? I don't know you guy.
If it makes y'all feel better my gen alpha kid will hurt my feelings in different ways and get revenge for you =_=
There may be 80-year-olds on here, but younger people don't have to buy your lame excuse either. Learn something and move on.