AI Datacenters Were Built for GPUs. What Happens When You Remove the GPUs?

AI Datacenters Were Built for GPUs. What Happens When You Remove the GPUs?(almartis.xyz)

32 points by AlassaneSakande 2 days ago | 22 comments

arnitdo 2 hours ago |

The whole website is genuinely unreadable / illegible with poor contrast.

zarzavat 1 hour ago | |

This is the third time I've seen a website with this styling (serif, yellow and white on black). What's going on? Is it a template or some AI induced convergence?

zknill 50 minutes ago | | |

Yeah, with only minimal guidance it's what you get out of claude. The colours and layout are pretty 'default'.

jbaiter 58 minutes ago | | |

I think it's Claude, yeah.

Zababa 1 hour ago | | |

Default output of claude code. Another obvious example is https://trumprx.gov/, with the background beige that's kinda close to the Hacker News one (to my eyes at least)

ptx 25 minutes ago | |

And the text appears to fade in on scroll, so you see literally nothing while scrolling down. Wonderful UX. It's like deliberately reintroducing a worse version of the checkerboard pattern the original iPhone used to display when the CPU couldn't handle the page rendering in time.

youngtaff 1 hour ago | |

For what ever reason reader mode appears to show different content

Jolter 51 minutes ago | | |

It does for me too, in Safari on iPhone.

DesaiAshu 1 hour ago | |

Looks like a font rendering or retina issue

yashasolutions 1 hour ago | |

yes, they should increase the font size as a first step.

xxs 1 hour ago | | |

I can do "ctrl + +" to increase the font, but it's still serif and low contrast, so I have to do "ctrl + A". Or better yet - press "reader view" on firefox.

adev_ 1 hour ago |

> For the past few decades, building a datacenter has been a well-understood, predictable exercise in utility engineering.

> In modern AI clusters, the network is no longer just infrastructure sitting beneath compute

It always make me smile when someone is presenting these kind of topologies as "New", "Modern A.I" or anything remotely "Revolutionary".

The HPC domain and any decent supercomputers have been doing RDMA networking centered around "all-to-all" and "all-reduce" operations for at least 3 fucking decades now.

They are the main reasons supercomputers are almost always constructed around stupidly complex Torus or Dragonfly network topologies.

MPI itself has these primitives defined from v1.

The only difference now is that it switch from "This niche thing 3 nerds were using for weather simulations" to "this cool thing any hyperscaler NEED to have for *A.I*"

ramon156 50 minutes ago | |

> someone

in this context, that someone is an AI bot that spat out words.

ElFitz 1 hour ago |

I never considered the implications and impacts on datacenters' architecture and organisation. It’s fascinating.

AlassaneSakande 2 days ago |

We're two data center networking engineers who've spent years designing and operating data center infrastructure for governments, telcos and banks in West Africa. This piece came out of our work on a new AI architecture based on associative memory rather than transformers. The GPU-free argument here is something we think about the next phase of AI networking. Happy to discuss further about it.

PS: Taking a look at our manifesto (https://almartis.xyz/) can help with more context.

throw23232 1 hour ago | |

Very interesting of course, but stuff like this just needs a demo not a book. Can be super simple, but it needs to be demonstrated somehow.

Animats 2 hours ago | |

Yes, read that. What these people are talking about seems to replacing training of NNs by something else entirely. The big question is, does that work? At all?

It's premature to discuss network architecture until that basic question is answered.

fc417fc802 1 hour ago | | |

I'm maybe 10% of the way in but I find I'm increasingly skeptical. If the basic building block dates back to the 1970s haven't other people tried this by now? If not, isn't the first order of business to throw together a prototype that solves MINST or one of the many other small datasets floating around out there as a proof of concept?

So unfortunately I'm inclined to assume this is empty conjecture shat out by an LLM. Because who would write something up in this much detail rather than typing `import numpy as ...` and going to town?

I'll also note that the document has all the usual crank signs. Lots of grand visions, hypotheses, and expounding at an overly high level on how various things work with hardly anything concrete.

RetroTechie 53 minutes ago |

Datacenters are being built for AI. What happens when you remove the AI workload?

Don't get me wrong. I don't mind when some tech bros burn billions of venture capital & nothing much (?) comes out of it.

But those datacenters embody a lot of resources. Raw materials, complex/resource heavy manufacturing processes for IC's, servers, networking gear, etc etc.

I sure hope that doesn't go to waste when the AI bubble pops. Datacenter stuffed with AI optimized hardware any good for general engineering? Science projects? Weather prediction? Web hosting? ...??

sinfulprogeny 34 minutes ago | |

Well there's always predictive policing, fully automated facial recognition, chat control surveillance systems, nation-scale fingerprinting, and location tracking via inference of arbitrary signals.

amelius 40 minutes ago | |

Rendering commercials, probably.