Show HN: I've built a spectrogram analyzer web app

Show HN: I've built a spectrogram analyzer web app(webfft.net)

244 points by ssgh 3 years ago | 69 comments

jboy55 3 years ago |

Here is a spectrogram of the track, "Look" from the album, "Songs about my Cats" by Venetian Snares.

Aphex twin did something similar, but this is more playful in my opinion.

epiccoleman 3 years ago | |

Mick Gordon did some fun hidden spectrogram imagery in the Doom 2016 soundtrack.

He talks about that and plenty of other cool stuff in his talk at the 2017 GDC conference. One of my favorite conference talks ever, he did so much cool experimentation to get the sounds he used on the soundtrack, and watching his talk is one of those moments where you really get to see a master of his craft let loose and explain his process.

https://youtu.be/U4FNBMZsqrY

quickthrower2 3 years ago | |

https://venetiansnares.bandcamp.com/track/look

Warning - this music freaked my dog out!

jo-m 3 years ago | |

I once (badly) did something similar as a student [1].

Unfortunately it's in Matlab so I can not run it any more.

[1] https://jo-m.ch/posts/2015/01/hack-the-spectrum-hide-images-...

sorenjan 3 years ago | | |

If you want to run it again you could try Octave, an open source Matlab alternative.

https://octave.org/

scotteric 3 years ago | | |

You know a personal license is only $149 USD right? You could then run your old code. Toolkits for home are $49.

lightweightbaby 3 years ago | | |

I also made something similar using Python a long time ago [1]. It's a extremely simple script so it should still work.

[1] https://github.com/DanielAllepuz/ImageToSound

syx 3 years ago | |

This is very interesting what was the Aphex Twin’s track with this concept?

twelvechairs 3 years ago | | |

Its usually called 'formula' or 'equation' - B side of 'windowlicker'. There's a video at the link below

https://www.reddit.com/r/Damnthatsinteresting/comments/kvjil...

swah 3 years ago | | |

https://www.youtube.com/watch?v=wSYAZnQmffg at around 5m30

jcelerier 3 years ago | |

From the look of the pictures there's a log() missing somewhere, no?

ssgh 3 years ago |

Author here. This is a basic spectrogram visualizer that's mobile friendly. It allows to select regions on the spectrogram and play them separately. There is no grand plan behind this web app: it's just a handy basic tool to capture sounds on your phone and see what they look like.

orbisvicis 3 years ago |

When I read about ultrasonic cross-device trackers in advertising [1], I installed "org.woheller69.audio_analyzer_for_android" and "hans.b.skewy1_0" (automatic ultrasonic detection) and started scanning through TV channels after running some test tones. Suffice to say I didn't find any, but the entire process was quite fun. There's also "org.billthefarmer.scope" which is an oscilloscope with a spectrum (not spectrogram).

1. https://arstechnica.com/tech-policy/2015/11/beware-of-ads-th...

vishnuharidas 3 years ago |

Web apps like this that accesses user's data should provide samples for users to experiment and explore before they have to give access to their actual data.

ssgh 3 years ago | |

Very reasonable ask, sir. In fact, I had added sample.mp3, but forgot to add a button in the UI. Now that's fixed.

nick_m 3 years ago |

Brilliant work - I "get" how this works, I've just spent about half-an-hour playing with this (Chrome browser on my kitchen ChromeBook), singing into it and letting it "listen" to the ambient background noise here (old cooker clock ticking, fridge compressor rumbling occasionally). Useful, educational, and fun also - thanks for publishing/hosting this so others can enjoy it!

vjerancrnjak 3 years ago |

Very nice app.

I usually use Audacity to inspect the spectrogram of FLAC files and see if they really are 44100Hz or if someone packaged a constant rate 320kbps mp3 encode into a FLAC file.

Now I can just check it in my browser :D

firefoxd 3 years ago |

Simple, straight to the point, and super useful.

One place I used these was on a toy AI assistant. I recorded myself saying a trigger word thousands of times, cut the audio in pieces and converted each to a spectrogram image. I then feed those to a training model to help recognize the trigger word.

Before the spectrogram, i was feeding the wav file directly, it was incredibly intensive on my laptop. But the image files were easier to process in real time. This tool can be used for debugging.

jxmorris12 3 years ago | |

How would this work with AI? Don’t you need to train the model to discriminate between the trigger word and other words? If all that’s seen during training is the trigger word, the model will just learn to say “yes” to everything, if you get what I mean.

firefoxd 3 years ago | | |

Yes, i have recorded myself talking on the phone for hours as well. I should have clarified that.

HarHarVeryFunny 3 years ago |

Nice - very fast (using WebGPU?).

I like the interesting ability to play a "rectangular" (time + frequency limited) section of the audio.

ssgh 3 years ago | |

I do have a WebGL-based implementation of FFT, but here I used good old JS. When properly written, it gets translated into really fast machine code, which is even faster than WebAssembly (I tried!). WebGL's problem is the high toll on the CPU--GPU bridge. When you need to transfer a block of audio data from CPU to GPU to perform calculations, you wait. When you need to transfer the FFT data back, you wait. These waits quickly outweight everything else. However on wavelet transforms GPU comes first because you can store some pre-computed FFTs on GPU and reuse them in multiple runs.

brunorsini 3 years ago |

Izotope, associated with MIT researchers, makes arguably the best such tool for the pro audio industry. Their RX suite is truly miraculous, allowing audio engineers to visualize frequencies in a similar manner, but also offering brush-like tools to do things such as "deleting a dog bark from a guitar take" fairly easily.

an-unknown 3 years ago | |

Seems like you never saw or used SpectraLayers (commercial tool from Steinberg) or Sonic Visualiser (OSS project). Both have much more advanced visualization capabilities than RX. However, RX definitely has the more advanced "semi-automated" editing / repair features.

brunorsini 3 years ago | | |

I've witnessed a large number of studios across the US and Latam using RX on a regular basis — places recording anything from indie stars to Grammy-winning artists.

peepwaah 3 years ago |

Can you recommend any good references to begin understanding the Spectrogram ? I work in DL based Noise cancellation - major part of my work involves analyzing spectrograms - I find it very difficult to do my work without having an ability to critically analyze these images. Any help from anybody ?

bitsinthesky 3 years ago |

This is a lot of fun

nixpulvis 3 years ago |

Would be awesome if it told me what file types it supported and also helped transcode some things like videos.

Looks very interesting though.

djmips 3 years ago |

Very nice, I tried on my phone and was really enjoyed being able to intuitively select regions to playback. Very fun.

k8si 3 years ago |

What we really need is PraaS (Praat as a Service). Praat Cloud Edition. Etc.

ddingus 3 years ago |

I like this! Easy to use, fun, useful.

Nice work.