Nvidia R&D chief on how AI is improving chip design

Nvidia R&D chief on how AI is improving chip design(hpcwire.com)

225 points by chuckjchen 4 years ago | 78 comments

I work on this team! (Specifically: applied deep learning research, chip design).

It's a shame to see so many people dismissing this work as marketing. I see lots of clever people working hard on really novel and interesting stuff, and I really do think that ML has real potential to customize a design much more "deeply" than traditional automation tools.

marginalia_nu 4 years ago | |

This is directed at AI marketing in general: "AI" has been used to market so much nonsense it's probably becoming a problem communicating actual interesting uses of AI. I very much get a dot com vibe off it, like nobody on the team knows how it works but we're sure we're gonna be rich somehow! In my head, I've begun substituting AI with "wizards" when I read it.

It's very much the sort of problems crypto is having. So many grifters actual interesting uses of the technology are very hard to identify and take seriously.

maxwells-daemon 4 years ago | | |

I guess so, but the fact of the matter is that ML/AI is actually, right now, doing useful things that would have been impossible 10 years ago. I don't think I could say the same about crypto (as distinct from cryptography).

q-big 4 years ago | | |

> "AI" has been used to market so much nonsense it's probably becoming a problem communicating actual interesting uses of AI.

On the other hand: if the people who do serious work in this area don't call out this nonsense, they must accept that their (serious) work becomes devalued.

> It's very much the sort of problems crypto is having. So many grifters actual interesting uses of the technology are very hard to identify and take seriously.

Here, the same holds.

PaulHoule 4 years ago | |

I think it's funny how "the old AI" had combinatorical optimization as a major theme, for instance

https://en.wikipedia.org/wiki/Travelling_salesman_problem

which is closely related to the central operation of logic, the canonical NP problem

https://en.wikipedia.org/wiki/Boolean_satisfiability_problem

as well as the playing of games like Chess, Poker, etc.

Modern neural networks also have optimization as a theme even when the output is a classification or something that doesn't look like optimization... That is, the network itself is trained to minimize an error function. People used these kind of algorithms back in the 1980s to layout chips

https://en.wikipedia.org/wiki/A*_search_algorithm

and it's only natural that new techniques of optimization (both direct and through heuristics like the neural network used in AlphaGo) are used today for chips.

maxwells-daemon 4 years ago | | |

Yes, the way I see it, one of the major benefits of deep learning is that it lets you define functions (in the R^n -> R^m sense) that would be basically impossible to define with traditional programming techniques. I think this comes up a lot in subroutines of combinatorial optimization, like heuristics for guiding search on subsets of NP-complete problems. The fact that you can automatically evaluate the heuristic and train by RL is also very convenient.

xbmcuser 4 years ago | |

It is the same with a lot of the machine learning stuff posted here the 2nd or 3rd comment is that how it could be achieved with normal algos etc. But slowly as more people start applying to different problems machine learning is solving many of them.

lvl102 4 years ago | |

To be fair, Nvidia does a lot of “selling” when they’re basically making money from crypto and CUDA monopoly.

hoosieree 4 years ago | |

When I briefly used Cadence's stuff I always thought about how fixing DRC errors could be crowdsourced as an "idle game" because it's so puzzle-like. The other thing was how it's even slower than Vivado...

Using RL to automate DRC fixes, and modeling standard cells as graph/flow problems are things I'd love to learn more about. What papers would you recommend reading to get started (for a grad student already familiar with machine learning basics)?

kraussvonespy 4 years ago | |

A quick tangent if you have time and can discuss it: some really interesting, effective and odd antenna designs came from AI:

https://ti.arc.nasa.gov/m/pub-archive/1244h/1244%20(Hornby)....

Have there been any odd, surprising or wildly efficient chip designs that have come out of the AI designs?

hardolaf 4 years ago | |

I saw the in-depth presentation at DAC. Until your company is willing to actually release your work, it's marketing.

CreateAccntAgn 4 years ago | |

Any word on how the accuracy/quality of final results compare to traditional flows? Are process variations handled differently (with regards to training or modelling) compared to IR? I assume traditional vendors (CDNS/SNPS/MENT) all have (or working on) AI driven tools as well. How do they compare?

maxwells-daemon 4 years ago | | |

In general, a function approximation solution like deep learning does worse on cases where exhaustively finding the exact optimum is possible (small combinatorial problems), but can be applied to much larger instances than the exact algorithms can.

selimthegrim 4 years ago | |

Ha, this does sound awesome. Are you guys hiring?

rektide 4 years ago |

Did nvidia just promise us singularity? :)

Hard to read a talk like this from a pulpit & not see shout outs to the incredibly super-fantastic open-source innovative projects like OpenROAD which have been shipping amazingly well-routed-by-AI chips for a while now. There's papers you can cite, galore, many open source designs[1].

It's not like Nvidia is promising anyone else will benefit from this work. This seems to be very high level coverage their R&D department is looking at, perhaps/perhaps not using. The article makes it hard to find out what is available, what has been published or otherwise deeply discussed (which is I think the best we can hope from Nvidia not real participation). There's only one paper linked, on NVCell[2], described as:

> The first is a system we have called NVCell, which uses a combination of simulated annealing and reinforcement learning to basically design our standard cell library.

This just feels like so much else going on in computing. WSL coming to windows, the recent Unity vs Unreal topic[3]. It's hard to imagine refusing to participate with others. It's hard to imagine not being part of the open source community working shoulder to shoulder to push for better. NVidia patently doesn't get it, patently isn't participating, patently isn't there. It's cool we can hear what they are up to, but it's also extremely NVidia that they're doing it all on their own. Anyhow, Looking forward to more AI based chip power system design starting to emerge; that sounds like a good idea NV.

[1] https://theopenroadproject.org/

[2] https://research.nvidia.com/publication/2021-12_nvcell-stand...

[3] https://news.ycombinator.com/item?id=31064552 (412 points, 3 days ago, 311 comments)

ImportOllie 4 years ago |

I don't understand the backlash here. The jist seemed to be traditional tools that are exact take a long time to process complex designs. Deep learning offers a statistical approach that can give a 'coarse' prediction and they're using this to reduce development time. That seems to make sense to me, especially in the earlier verification phases of the hardware design lifecycle.

To me this sounds like a good use-case of AI and Neural Nets. It doesn't appear to be looking to replace the traditional tools, just augment.

jonnycomputer 4 years ago | |

I seem to recall that the original title of the post was more sensationalist; something about replacing human designers.

b20000 4 years ago |

the last time I checked autorouters were still not capable of doing all the routing on a multi layer PCB properly, and manual work was still required to produce a decent design.

TomVDB 4 years ago | |

How is that a relevant comment in a discussion about ASIC design?

I hope you don't have the idea that chip routing is done manually.

beambot 4 years ago | | |

IIRC, Place & Route is a known NP complete problem. In this regard, autorouters (whether IC or PCB) can benefit from "better" heuristics -- i.e. it's an optimization problem where AI can help.

tboerstad 4 years ago | | |

Are the analog parts (current nitrist etc) autorouted now?

I worked on MCU layout around 2011, and only the digital logic was autorouted/placed.

bsder 4 years ago |

What is extremely telling is what is missing ... Design Rule Checking (DRC) and Layout Vs Schematic (LVS).

These require:

1) Longer bit length arithmetic

32-bit float simply isn't enough. 64-bit float is close, but limited. You really want 128-bit integer. And nVidia isn't delivering that.

2) Real algorithmic improvements

We're still stuck with computational geometry algorithms that don't parallelize. It would be awfully useful if nVidia would actually research some new algorithms instead of just waving around the ML/AI marketing wand.

But, then, this is the company that built itself on benchmarketing, so ...

productceo 4 years ago |

Please keep up the processing power progress!

Economics of the software industry (or at least the products that I work on) depend on the assumption that cost of computing (including storage) diminish exponentially over time! <3

kevincox 4 years ago | |

You say this like it is a good thing. It seems to me that if a whole industry is dependant on exponential growth of another than the former is being quite reckless.

Of course exponential growth will help, but relying on it seems like a bit too much risk.

hurflmurfl 4 years ago | | |

I think that's the point GP is making with his sarcastic remark :)

W-Stool 4 years ago |

I've got a whole "HAL9000" feeling going here right now.

"Sorry Dave - I can't quite do that ..."

_blz2 4 years ago |

I remember him from the vlsi text dally and poulton.

orangebeet 4 years ago |

I really hope that they can apply some of these AI approaches on the driver situation on Linux as well. I will never buy an Nvidia product after the nightmares they've put me through.

#include <iostream> #include <iomanip> #include <cmath> #include <tuple> // solve ax^2+bx+c=0 template<typename T> auto solve_quadratic(T const& a, T const& b, T const& c) -> std::tuple<T, T> { auto t = std::sqrt(b*b - T(4)*a*c); return {(-b + t) / (T(2)*a), (-b - t) / (T(2)*a)}; } int main() { std::cout << "Expected x1=0.000000075 x2=-200.000000075\n" std::cout << "Single precision:\n"; { float a=1.0f, b=200.0f, c=-0.000015f; auto [x1, x2] = solve_quadratic<float>(a, b, c); std::cout << "x1=" << x1 << ", x2=" << x2 << "\n"; std::cout << "ax1^2+bx1+c=" << (a*x1*x1 + b*x1 + c) << "\n"; std::cout << "ax2^2+bx2+c=" << (a*x2*x2 + b*x2 + c) << "\n"; } std::cout << "\nExpected x1=1.000000028975958 x2=1.000000000000000\n"; std::cout << "Double precision:\n"; { double a=94906265.625, b=-189812534, c=94906268.375; auto [x1, x2] = solve_quadratic<double>(a, b, c); std::cout << std::setprecision(16); std::cout << "x1=" << x1 << ", x2=" << x2 << "\n"; std::cout << "ax1^2+bx1+c=" << (a*x1*x1 + b*x1 + c) << "\n"; std::cout << "ax2^2+bx2+c=" << (a*x2*x2 + b*x2 + c) << "\n"; } }

template<typename T> auto solve_quadratic(T const& a, T const& b, T const& c) -> std::tuple<T, T> { auto sign_b = b < 0.0 ? -1.0 : 1.0; auto t = -b - sign_b*std::sqrt(b*b - T(4)*a*c); return {(T(2)*c) / t, t / (T(2)*a)}; }