A general representation modal across vision, audio, language modalities | Dark Hacker News