What Extropic is building

185 points by jonbraun 2 years ago | 140 comments

I’m not a fan of Extropic, but I’m seeing a lot of misconceptions here.

They’re not building “a better rng”- they’re building a way to bake probabilistic models into hardware and then run inference on them using random fluctuations. Theoretically this means much faster inference for things like PGMs.

See here for similar things: https://arxiv.org/abs/2108.09836

There’s a company called Normal Computing that did something similar: https://blog.normalcomputing.ai/posts/2023-11-09-thermodynam...

winwang 2 years ago | |

Skimmed the litepaper. Has the flavor of: you can do "simulated" annealing by literally annealing. I like the idea of using raw physics as a "hardware" accelerator, i.e. analog computing. fwiw, quantum computing can be seen as a form of analog computing.

I do think that a "better rng" can be interesting and useful in and of itself.

Thanks for the Normal Computing post, it felt more substantial.

pclmulqdq 2 years ago | | |

I make a better RNG right now (https://arbitrand.com).

We experimented with doing ML training with it, but it's not clear that it trains any better than a non-broken PRNG. It might be fun to feed the output into stable diffusion and see how cool the pictures are, though.

apognwsi 2 years ago | | |

with error correction, qc is entirely distinct from analog computing. that is what makes it even remotely viable, theoretically.

lumost 2 years ago | |

It did make me curious however, if we dropped the requirement that operations return correct values in favor of probably correct values - would we see any material computing gains in hardware? Large neural models are intrinsically error correcting and stochastic.

I’m unfortunately not familiar enough with hardware to weigh in.

IshKebab 2 years ago | | |

The trouble is if you use actual randomness then you lose repeatability which is an incredibly useful property of computers. Have fun debugging that!

What you want is low precision with stochastic rounding. Graphcore's IPUs have that and it's a really great feature. It lets you use really low precision number formats but effectively "dithers" the error. Same thing as dithering images or noise shaping audio.

throwawaymaths 2 years ago | |

So it sounds like this startup is explicitly not using foundation models?

Is there any evidence that such a probabilistic model can run better than a state of the art model?

Or alternatively what would it take to convert an existing model (let's say, an easy one like llama2-7b) into an extropic model?

p1esk 2 years ago | | |

Is there any evidence that such a probabilistic model can run better than a state of the art model?

No, but they got 15M seed funding anyway.

autonomousErwin 2 years ago |

I wouldn't want to write this off because you get the feeling these guys are on to something that could be hugely important (ignoring quantum this thermodynamic that) - but surely it feels like they need to get to the point a lot faster e.g.

"We're taking a new approach to building chips for AI because transistors can't get any smaller."

I really don't know what they gain by convoluting the point and it's pretty hard to follow what the CEO is talking about half the time.

vipshek 2 years ago |

I have no idea about the merits of this approach, but I found this interview with the founders a lot more sensical than the linked article:

https://twitter.com/Extropic_AI/status/1767203839818781085

ein0p 2 years ago |

People need to read Hamming’s old papers in which he very clearly explains why analog circuits are not viable at scale. This is also why the brain uses spikes rather than continuous signals. The issue is noise, interference, and attenuation. There’s no way to get around this. If they have invented a way, I’d like to see it. But until it’s demonstrated, I’d take such things with a large grain of salt.

Animats 2 years ago | |

You can re-quantize analog signals into a finite number of levels to prevent noise accumulation. That's how TLC (8 levels) and QLC (16 levels) flash memory cells work. The cells store an analog value, but it's forced to a value close to one of N discrete values. The same approach is used in modems.

Deep learning doesn't seem to need that much numerical precision. People started with 32-bit floats, then 16-bit floats, now sometimes 8-bit floats, and recently there are people talking up 2-bit trinary. The number of levels needed may not be too much for analog. If you have a regenerator once in a while to slot values back to the allowed discrete levels, you can clean up the noise. That's an analog to digital to analog conversion, of course.

That's not what these guys are talking about, as far as I can tell.

twobitshifter 2 years ago | |

analog circuits are making a comeback because they are great for simulating the equations of the physical world more efficiently than a digital approach. https://spectrum.ieee.org/not-your-fathers-analog-computer

sfnrm 2 years ago | |

Sounds interesting. Do you have a link? (or at least a title?)

ein0p 2 years ago | | |

Not at the moment, but I do recall he has a chapter on this in his book “The Art of Doing Science and Engineering”, which I also recommend. He uses very long transmission lines to explain this, but the same thing applies at the nano scale, and perhaps to an even greater extent due to the much noisier environment and higher frequencies.

binoct 2 years ago |

I really hope this was an experiment in using gen AI:

“Create a website for a new company that is building the next generation of computing hardware to power AI software. Make sure it sounds science-y but don’t be too specific.”

thatguysaguy 2 years ago |

Uninmportant, but if you're citing Moore's paper I feel like you're just trying to pad out the references to make it look like you're serious

gitfan86 2 years ago | |

At a high level it is the right answer to the data center electricity demand problem. Which is that we need to make AI hardware more efficient.

Pragmatically, it doesn't make much sense given that it would take years for this approach to have any real work use cases in a best case scenario. It seems way more likey that efficiency gains in digital chips will happen first making these chips less economically valuable.

kneel 2 years ago |

This guy spends an extraordinary amount of time posting memes and e/acc silliness.

So much so I wonder what the hell they're doing with this company. Is he a prolific poster and an engineering genius? Or is he just another poster

Bjorkbat 2 years ago | |

For the longest time I thought the person behind the account was just some random guy who was probably very into crypto and decided to dabble in AI because of the parallels between e/acc and the whole "to the moon" messaging you find in crypto communities.

Never would have guessed the guy was an actual physicist

trzy 2 years ago |

Hard time believing this is legit given how much time the CEO spends goofing around on social media. If it were possible to short startups, this would be a top candidate.

jp42 2 years ago | |

honestly, it would be too early to say this. Considering the people who invested in this startup, its better to assume CEO is capable. If he is not able to deliver in reasonable timeline then, we all are free to blame him for posting things on SM. actually many knows his company because he is goofing around on SM especially e/acc stuff.

danielmarkbruce 2 years ago | | |

It's more interesting to see who passed on it. There isn't a single top tier VC here.

This whole pitch sounds like the usual quantum computing babble.

empath-nirvana 2 years ago |

So, basically this seems to be a way to replace PRNGs with real randomness with some knobs so you can adjust the distribution. Let's assume for the sake of argument that this can replace every single PRNG call in inference and training, how much savings in cost/energy/run time would there actually be?

Filligree 2 years ago | |

Assuming they're free: Essentially nothing. PRNGs are incredibly cheap.

pclmulqdq 2 years ago | |

This is a quantum computing company, specifically for quantum ML.

jdulay19 2 years ago |

Could someone smarter than me explain if this is a big deal or just hype? The work sound promising, but I wonder how long it would take to build and validate.

zachbee 2 years ago |

They're not wrong that sampling a complex, higher-dimensional probability distribution is hard to do efficiently. I'm not sure how useful it is to do it more efficiently, though.

Also, the fact that they're using ultra-cold superconductors makes me wonder how much noise helps and how much it hurts. If your system is all about leveraging noise well, but you can only use super special well-behaved noise, then "bad noise" could easily ruin the quality of your generated solutions.

It's cool to see something so wacky out there, though!

brizzbuzz 2 years ago |

interesting that a company w/ no public repositories has 1.1k github followers https://github.com/extropic-ai

echelon 2 years ago | |

It's led by the e/acc [1] founder, BasedBeffJezos [2]. He has a huge cult following. It's turned into a lot of Twitter memes and shitposting [3].

[1] https://en.wikipedia.org/wiki/Effective_accelerationism

[2] https://twitter.com/BasedBeffJezos

[3] https://knowyourmeme.com/memes/cultures/eacc-effective-accel...

Bjorkbat 2 years ago | | |

Honestly, as interesting as the the chip sounds, I'm admittedly kind of biased against the company's probability of success simply because the founder is basically the #1 e/acc meme account/shitposter on Twitter.

Like, it's hard to take someone seriously when they spend tons of time shitposting on Twitter, it's even harder when it's revealed that they're behind one of the most popular shitposting accounts within a niche, almost cult-like community.

ozr 2 years ago | |

The founder ('Beff Jezos') has a large twitter presence.

delichon 2 years ago | |

To be fair it isn't very common to detail proprietary hardware in github repos. And any code for such novel processors would be fascinating but useful only for theory rather than practice at the moment. The lack of open code is a missing merit badge rather than a demerit.

fermionik 2 years ago |

Physical learning machines require noise to learn. They are also necessarily dissipative. See https://arxiv.org/abs/2209.11954. The key is to engineer the noise to maximise the learning rate. In classical devices, stochastic switching is controlled by temperature through the Kramers rate. This means kT controls energy loss. If you use dissipative quantum tunnelling this is not the true thermodynamic lower bound. Any quantum nonlinear dissipative system, with a far from equilibrium steady state, is a good case to consider. Dispersive optical bistability, realised in SC quantum circuits, is the way to go. And quantum error correction is unnecessary.

laserbeam 2 years ago |

The amount of buzzwords on this page should disqualify this from even getting votes on HN. Anyone who writes like this is trying to confuse and mislead the reader.

rvz 2 years ago |

Too early to tell about what this will be in the future. Either it turns out to be a foundational startup or a flash in a pan.

But at least it is not the 5000th so-called AI-powered SaaS company that is using OpenAI API that has raised $20M+ to VCs and burning hundreds of thousands every month with little to no plan to generate revenue.

Will be watching this one closely, but highly skeptical of this company.

ac2u 2 years ago | |

Hear hear, better to see someone go for broke trying something novel.

At best they advance the field massively, at worst the backers lose their money but the tech/knowledge finds a home elsewhere and the knowledge in the field is nudged forward.

AbrahamParangi 2 years ago |

Man, I am not a pessimist and I am very bullish on AI-the-field but my spidey sense is tingling that this is BS.

- It is written in a way that sacrifices legibility for supposed precision but because the terms used can't really be applied precisely, it's equivalent to spurious digits in a scientific calculation. The usual reason this occurs is to obfuscate or to overawe the audience.

- It is hard to overstate the difficulty of beating semiconductor with a wholly new branch of technology. They're so insanely good. People have been trying to beat them for decades and there's not even a solid theoretical thesis as to how to do so. Even the theoretical advantage of quantum computing is predicated on error correction being scalable which is a totally open question even theoretically.

liveoneggs 2 years ago | |

If room-temperature-stable bio-enhanced AI-specific-computer-powered chatbots don't seem like a realistic goal then maybe you should have clicked "play" on the linked spotify widget.

015a 2 years ago | |

For me its the dichotomy between how absolutely impenetrable the blog post is, combined with the "Set the tone fam, play 'Entropy' by Noizinski on Spotify :)" widget in the bottom right. Like they're trying to check every box on the engagement farming list (something, to be sure, beff jezos is famous for).

Very bad vibes. Hire someone who can communicate, and demonstrate what you're building.

danielmarkbruce 2 years ago |

It feels like serious people would have said something more like "we are going to improve the performance (measured in s), of the algorithms/models such X, Y, Z which are used in a, b, c."

Can anyone name a company which used such absurd language to describe themselves and then actually delivered something valuable? There must be one.

Eliezer 2 years ago |

I'm saddened to see the honorable name of Extropy and Extropianism, which carefully never descended to this level or anything like it, be stolen and captured by this nonsense.

arduanika 2 years ago | |

Is this sarcasm? (Genuinely can't tell.)

And also, are you the real Eliezer?

Eliezer 2 years ago | | |

No, not sarcasm, and I am Eliezer Yudkowsky. I was around on the old Extropians mailing list starting in 1996, and their leadership did not talk like this. Max More (the founder of Extropianism) was a careful thinker then, and I haven't heard anything different about him more recently than that.

"Extropy" is a term that was previously coined by a group of fairly nice people to describe themselves, and so far as I know is being stolen here without permission.

thom 2 years ago |

I've thought for a while that what quantum computing will probably deliver is not going to be magical infinite processing power, but extremely fast, computational access to parameterizable physical processes. That is, a rock can simulate being a rock better than a computer can, but how do you hook it up to the rest of your system? But while I can imagine replacing a simple MCMC model, for example, with a stack of physics-based chips, is there a path all the way to designing, training and executing something LLM sized on top of that technology? I'm not smart enough to know, but as esoteric as it sounds, it feels like it's drawing on the less speculative end of the spectrum, and seems like a noble effort and not an actual scam.

5cott0 2 years ago |

so far the only thing they’ve built is more posts

pphysch 2 years ago | |

TBF that's not a bad place to be in the current hype cycle. Better than releasing and being permanently written off as yet-another-ChatGPT-wrapper.

patcon 2 years ago |

I believe this link is communicating within the family of thought from which this blog post also comes:

https://knowm.org/thermodynamic-computing/

It's a random, unassuming 7-year-old blog post from a DARPA-funded and defense-involved inventor. They happen to work in neuromorphic computing. Their other posts talk about some of that work. A cynical take is that it can seem like just hand-wavey garbage, but then again, it's been quietly getting tons of defense contractor money.

I came across it years ago, and it has greatly accelerated my worldview, and has made me feel ahead of the curve in understanding what is going on in the universe. It's informed my community organizing. It's informed how I understand AI and consciousness and language, and the intersection of all these things.

I'm inclined to believe that the people in this area are clued into something very substantial about how the universe works.

EDIT: oops, shared the wrong link. This one is about thermodynamic evolution

Delumine 2 years ago |

Seems like they're "passive" energy chips are only gonna be targeted $$$ towards big organizations, which make use of the Josephon effect. But if they're targeting transistor technology for the masses, how will they have an advantage against the incumbents

rabidsnail 2 years ago |

fund my new simulated annealing accelerator startup where we etch your model onto an aluminum flake and then hit it with a blowtorch

dark_jensen 2 years ago |

for all the hype around building alien tech, this is a bit underwhelming. the stuff from this startup feels more alien than what extropic is talking about - https://www.emergentia.tech/technology

powera 2 years ago |

Not only does this read like pure bullshit, it is bullshit on a website that crashes the Apple Vision Pro (and makes my laptop suffer).

My prediction is that they will raise a nine-figure sum over the next decade, and never release a product that comes close to the performance of an NVIDIA card today.

rho4 2 years ago |

improve title pls

htrp 2 years ago |

the engineering alone will be a nightmare

DrDroop 2 years ago |

I know everyone is calling BS on this, and I am just a simple web developer so what do I know but there are at least two priors that make me think that what is discussed here could have some validity.

* The stochastic/random nature of processors is already used in cryptography for physically uncloneable functions. Dunno if this has any practical uses in industry, and it is crypto, so it is probably also BS, but it is the same phenomena you get if you log in into your BIOS and turn off ECC of your RAM.

* The very first computer capable of MCMC was designed by von Neumann himself and used uranium as a source of randomness as part of the Manhattan project.

Anyway semiconductors have never been my strong suit, but I guess this is more of a IP play then a consumer product business. Now let me get back to writing unit tests.

spiantino 2 years ago |

Obvious grifty nonsense.

dkarras 2 years ago |

smells like snake oil. will probably end up becoming a cryptocurrency scam? or some other grift? time will tell.

>Extropic is also building semiconductor devices that operate at room temperature to extend our reach to a larger market.

funny stuff

Bnjoroge 2 years ago |

meh, lmk when they actually ship something that's not bs

jason-phillips 2 years ago |

Comments read like a confessional from out of the loop.

sergiotapia 2 years ago |

The litepaper discusses Extropic's mission to develop a novel hardware platform that harnesses the natural fluctuations of matter as a computational resource for Generative AI.

Key Points

The demand for computing power in AI is increasing exponentially, but Moore's Law is slowing down due to fundamental physical limitations of transistors at the atomic scale.

Biology hosts more efficient computing circuitry than current human-made devices by leveraging intrinsic randomness in chemical reaction networks.

Energy-Based Models (EBMs) are a potential solution, as they are optimal for modeling probability distributions and require minimal data. However, sampling from EBMs is difficult on digital hardware.

Extropic is implementing EBMs directly as parameterized stochastic analog circuits, which can achieve orders of magnitude improvement in runtime and energy efficiency compared to digital computers.

Extropic's first processors are nano-fabricated from aluminum and run at low temperatures where they are superconducting, using Josephson junctions for nonlinearity.

Extropic is also developing semiconductor devices that operate at room temperature, sacrificing some energy efficiency for scalability and accessibility.

A software layer is being built to compile abstract specifications of EBMs to the relevant hardware control language, enabling Extropic accelerators to run large programs.

---

Is this real or just theoretical?