Nvidia to Acquire Arm for $40B(nvidianews.nvidia.com) |
Nvidia to Acquire Arm for $40B(nvidianews.nvidia.com) |
Right now, I can see nVidia replacing Mali smartphone GPUs in low to mid-end Exynos SoCs and the like. But it's not like nVidia to want to be in that low-margin area.
Replacing these with what? What nvidia gpus can operate at that power envelope ?
Does anyone know if or how Apple will be affected by this? What are the licensing agreements on the ISA?
Owning ARM would make no sense for them as they would gain no IP but would have to deal with antitrust which would force them to continue licensing the ip to others, which is not a business they are in.
If ARM vanished tomorrow I doubt it would affect apple’s business at all.
I think it will also bring Google and Nvidia closer together
Nvidia already own the parallel compute/ML part of the datacenter and the Mellanox acquisition had brought the ability to compete in the networking part of the datacenter - but they were missing CPU IP, for tasks that aren't well matched to the GPU. This plugs that hole. They are in control of a complete data-center solution now.
(a) NVIDIA becomes a full-fledged full-stack house, they have both CPU and GPU now. They can now compete with AMD and Intel on equal terms. That has huge implications in the datacenter.
(b) GeForce becomes the reference implementation of the GPU, ARM processors now directly fund NVIDIA's desktop/datacenter R&D in the same way consoles and Samsung SOCs fund Radeon's R&D. CUDA can be used anywhere on any platform easily.
(c) Acqui-hire for CPU development talent. NVIDIA's efforts in this area have not been very good to date. Now they have an entire team that is experienced in developing ARM and can aim the direction of development where they want.
Basically there's a reason that NVIDIA was willing to pay more than anyone else for this property. And (Softbank's) Son desperately needed a big financial win to show his investors that he's not a fucking idiot for paying $32b for ARM and to make up for his other recent losses.
Or maybe, more accurately, the middle of the supply chain doesn't matter. The most value is at either end: raw materials and energy, and end products.
Or so it seems :p ;) xx
What a wild year! Let's not forget Apple announce they are transitioning from x86 to Arm :b.
Pure speculation (of course)...
To me (from a tech standpoint) this acquisition centers around three things we already know about Nvidia:
- Nvidia is pushing to own anything and everything GPGPU/TPU related, from cloud/datacenter to edge. Nvidia has been an ARM licensee for years with their Jetson line of hardware for edge GPGPU applications:
https://developer.nvidia.com/buy-jetson
Looking at the architecture of these devices (broadly speaking) Nvidia is combining an ARM CPU with their current gen GPU hardware (complete with Tensor Cores, etc). What's often left out of this mention is that they utilize a shared memory architecture where the ARM CPU and CUDA cores share memory. Not only does this cut down on hardware costs and power usage, it increases performance.
- Nvidia has acquired Mellanox for high performance network I/O across various technologies (Ethernet and Infiniband). Nvidia is also actively working to be able to remove the host CPU from as many GPGPU tasks as possible (network I/O and data storage):
https://developer.nvidia.com/gpudirect
- Nvidia already has publicly available software in place to effectively make their CUDA compute available over the network using various APIs:
https://github.com/triton-inference-server/server
Going on just the name Triton is currently only available for inference but it provides the ability to not only directly serve GPGPU resources via network API at scale but ALSO accelerate various models with TensorRT optimization:
https://docs.nvidia.com/deeplearning/triton-inference-server...
Given these points I think this is an obvious move for Nvidia. TDP and performance is increasingly important across all of their target markets. They already have something in place for edge inference tasks powered by ARM with Jetson but looking at ARM core CPU benchmarks it's sub-optimal. Why continue to pay ARM licensing fees when you can buy the company, collect licensing fees, get talent, and (presumably) drastically improve performance and TDP for your edge GPGPU hardware?
In the cloud/datacenter, why continue to give up watts in terms of TDP and performance to sub-optimal Intel/AMP/x86_64 CPUs and their required baggage (motherboard bridges, buses, system RAM, etc) when all you really want to do is shuffle data between your GPUs, network, and storage as quickly and efficiently as possible?
Of course many applications will still require a somewhat general purpose CPU for various tasks, customer code, etc. AWS already has their own optimized ARM cores in place. aarch64 is more and more becoming a first class citizen across the entire open source ecosystem.
As platform and software as a service continues to eat the world cloud providers likely have already started migrating the underlying hardware powering these various services to ARM cores for improved performance and TDP (same product, more margin).
Various ARM cores are already showing to be quite capable for most CPU tasks but given the other architectural components in place here even the lowliest of modern ARM cores is likely to be asleep most of the time for the applications Nvidia currently cares about. Giving up licensing, die space, power, tighter integration, etc to x86_64 just seems to be foolish at this point.
Meanwhile (of course) if you still need x86_64 (or any other arch) for whatever reason you can hit a network API powered by hardware using Nvidia/Mellanox I/O, GPU, and ARM. Potentially (eventually) completely transparently using standard CUDA libraries and existing frameworks (see work like Apex):
https://github.com/NVIDIA/apex
I, for one, am excited to see what comes from this.
Maybe Nintendo/Sony uses Nvidia cards on their developer machines? I imagine FreeBSD drivers aren't simply altruism on their part.
On the other hand, stagnation on other fronts:
- Nouveau (tried recently) is basically unusable on Ubuntu. As in the mouse/keyboard locks every 6 seconds.
- Proprietary drivers won't work with wayland
And since their stuff isn't open, the community can't do much to push Nouveau forward.
It's less of a driver and more of an operating system. Basically self-contained with all the support libraries it needs. Super easy to port to a new operating system and any driver improvement work on all OSes.
But the approach also has many downsides. It's big. It ignores all the native stuff (like linux's GEM interface).
It also has random issues with locking up the entire system. Like if you are debugging a process with the linux drivers, a breakpoint or a pause in the wrong place can deadlock the system.
I don't really see how this deal makes the CPU market worse -- wasn't the ARM market for mobile devices basically dominated by Qualcomm for years? Plus, the other existing ARM licensees don't seem to be impacted. On the other hand, I do see a lot of potential if Nvidia is serious about innovating in the CPU space.
Many of their AI libraries/tools are in fact open source.
They stand to be a force that could propel ARM’s strength in data center and desktop computing. For some reason you’re okay with the current x86 duopoly held by AMD and Intel, both who have their own destiny over CPUs and GPUs.
The HN crowd is incredibly biased against certain companies. Why not look at some of the potential bright sides to this for a more nuanced and balanced opinion?
because as a NVIDIA user of the last 20 years, I never saw such bright side when it comes to open source.
Its possible to dislike two things at once, its also possible to be wary of new developments that give much more market power to a notoriously uncooperative and closed company.
There is a big difference between having interest in a market, and being able to compete in it. There are also many trade-offs.
Nobody has designed yet a GPU architecture that works at all from 500W HPC clusters to sub 1W embedded/IoT systems, much less that works well to be a market leader in all segments. So AFAICT whether this is even possible is an open research problem. If this were possible, there would already be nvidia GPUs at least in some smartphones and IoT devices.
Aside from that, ARM was one of the only actual tech companies the UK could talk about on the so-called "world stage", that has survived more than 2 decades. But instead, they continue to sell themselves and their businesses to the US instead of vice versa.
In 2011, I thought that they would learn the lessons and warnings highlighted from Eric Schmidt about the UK creating long standing tech companies like FAANMG. [0] I had high hopes for them to learn from this, but after 2016 with Softbank and now this, it is just typical.
ARM will certainly be more expensive after this and will certainly be even more closed-source, since their Mali GPUs drivers were already as closed as Nvidia's GPUs. This is a terrible outcome I have seen but from Nvidia's perspective, it makes sense. From a FOSS perspective, ARM is dead, long live RISC-V.
[0] https://www.theguardian.com/technology/2011/aug/26/eric-schm...
Jensen's not an idiot, how many OG 90s tech CEOs are still at the helm of the company they founded? Any severely negative moves towards their customers just drives them into RISC-V and they know that.
Yes, ARM customers will be paying more for their ARM IP. No, NVIDIA is not going to burn ARM to the ground.
I think it's enough to know what Nvidia is, how they operate, and what their general strategies are.
Not saying I agree with the analysis... but I am not that optimistic.
https://www.oculus.com/blog/a-single-way-to-log-into-oculus-...
Discussed here on HN, with the top comment being a copy and paste of Palmer Lucky's acknowledgement that the early critics turned out to be correct:
https://news.ycombinator.com/item?id=24201306
Different situation, different companies, but if you "follow the money" you'll never be too far wrong.
If you take at face value anything from a press release, earnings call or investor relations website, then I would like to take a moment to share with you the prospectus of Brooklyn Bridge LLC.
I'm starting to feel like social media based on upvotes is a utter waste of time. Echo chambers and groupthink. People commenting on things they barely know anything about and getting validation from others who don't know anything. I'd rather pay for insightful commentary and discussion. I feel like reddit going downhill has pushed a new group of users to HN and it's sending it down the tube. Maybe it's time for me to stop participating and get back to work.
Tell me, what other AI platform works with x86 and PowerPC and ARM? Currently NVIDIA’s GPUs do.
Would you rather have TSMC in control of ARM? Maybe have access to new architectures bundled with a mandate that you have to build them on TSMC's processes?
How about Samsung? All of the fab ownership concerns of TSMC plus they also make basically any tech product you care to name, so all the integration concerns of NVIDIA.
https://asia.nikkei.com/Business/Technology/Key-Apple-suppli...
Microsoft? Oracle? None of the companies who could have afforded to pay what Son wanted for Softbank were any better than NVIDIA.
There are a lot of good things that will come out of this as well. NVIDIA is a vibrant company compared to a lot of the others.
Never mind the FTC - the rest of the semiconductor industry has to be [very] strongly opposed.
Anyone has a proper analysis on the ramifications of this acquisition for Apple's future in ARM?
I quite expect AMD for example to drop ARM chips from their hardware. Others should also follow suit. Nvidia is an awful steward for ARM.
> Despite popular misconceptions to the contrary, Horizon [The switch software's codename] is not largely derived from FreeBSD code, nor from Android, although the software licence[7] and reverse engineering efforts[8][9] have revealed that Nintendo does use some code from both in some system services and drivers.
That being said, at least one of the Playstation's runs a modified form of FreeBSD.
Edit: add [The ... codename]
I expect more of the same. The only way it could go wrong is if they lose customer focus. Microcontrollers are a competitive, near-commodity market, so companies have to provide valuable features.
I don't really know Nvidia well - I only buy their stock! - but they seem to be keeping their customers happy by paying attention to what they need. Perhaps their fabrication will be a boon to micros, as they're usually a few gens behind laptop/server processors.
FT provides insightful commentary from finance/business side of things and their subscription is expensive - rightfully so.
I'm not quite ready to end my participation in HN, but I'm close. I am looking back on the last 10 years of participation in forums like this and wondering what the hell good it did. I am also suddenly very worried for what sites like Reddit are doing to kids. That process of validation is going to produce some very anti-social, misguided adults.
I would rather participate in an argument map style discussion, or, frankly, just read the thoughts of 'experts'.
I don't see this as having that much of an impact on any short to medium term. ARM has too much intricate business dependencies and contracts nVidia can't just get out of.
My speculation is that nVidia might be what it takes to push arm to overcome the final huddle into more general purpose and server cpu, and achieve that pipedream of a single binary/ISA running everywhere. Humanity would be better off if a single ISA does become truly universal. Whether business/technology politics will allow that to happen and whether nVidia has enough understanding and the shrewdness to pull that off is to be seen.
This is an interesting thought. I think I would agree with this in the CPU world of the last 20-30 years, but it makes me wonder a few things. Might a universal ISA eliminate major pieces of competitive advantage for chip makers, and/or stall innovation? It does feel like non-vector instructions are somewhat settled, but vector instructions haven't yet, and GPUs are changing rapidly (take NVIDIA's Tensor cores and ray tracing cores for example). With Moore's law coming to an end, a lot of people are talking about special purpose chips more and more, TPUs being a pretty obvious example, and as nice as it might be to settle on a universal ISA, it seems like we're all about to start seeing larger differences more frequently, no?
> 80% of comments are low information, Dunning-Kruger effect in action.
I really liked all of your comment except this. I have a specific but humble request aside from the negative commentary & assumption about behavior. Please consider erasing the term "Dunning-Kruger effect" from your mind. It is being used here incorrectly, and it is very widely misunderstood and abused. There is no such effect. The experiments in the paper do not show what was claimed, the paper absolutely does not support the popular notion that confidence is a sign of incompetence. (Please read the actual paper -- the experiments demonstrated a positive correlation between confidence and competence!) There have been some very wonderful analyses of how wrong the Dunning-Kruger paper was, yet most people only seem to remember the (incorrect) summary that confidence is a sign of incompetence.
https://www.talyarkoni.org/blog/2010/07/07/what-the-dunning-...
ARM as an independent company, has been profoundly "neutral", allowing many companies to benefit from the ARM instruction set. It has been run very well: slightly profitable and an incremental value to all parties involved (be you Apple's iPhone, NVidia's Tegra (aka Nintendo Switch chip), AMD's EPYC, Qualcomm's Snapdragon, numerous hard drive companies, etc. etc.). All in all, ARM's reach has been because of its well-made business decisions that have been fair to all parties involved.
NVidia, despite all their technical achievements, is known to play hardball from a business perspective. I don't think anyone expects NVidia to remain "neutral" or "fair".
ARM being owned by an organisation deeply embedded in processor design and manufacturing, will now be licensing designs to competitors, as well as getting a lot of intel on its competitors.
ARM supercomputers were poised to take on Nvidia. Now its all one and the same.
As others have said, this will do wonders for RISC-V.
1. The continued conglomeratization in the tech sector is a worrying trend as we see fewer and fewer small players.
2. Only a large-ish company could provide effective competition in the CPU/ISA/architecture space against the current x86 duopoly.
The big practical issue is probably software compatibility and the like, and it seems to me that the Apple/macOS adoption will do more for that than nVidia ownership.
That's a good question. I didn't try to put all the necessary nuances in a single sentence, so you're right to question a lot of the unsaid assumptions. I don't know for sure at this point the innovation in ISA has run most of its course yet, but I do feel like we kind of have, given how relatively little difference it makes. I think a "truly" universal ISA, if it ever happens, would necessarily have to have a governance and evolution cycles, so that people will have to agree to the core part of the universal ISA, yet have a room and a way to let others experiment various extensions, including vector extensions for example, and have a process to reconcile and agree on the standard adoption. I don't know if that's actually possible - it might be very difficult or impossible for many different reasons. But if such can happen, that would be beneficial, as it would reduce certain amount of duplication, and unlock certain new possibilities.
> Please consider erasing the term "Dunning-Kruger effect" from your mind. It is being used here incorrectly
Duly noted.
If the only differing bits were the portability framework, this would just be a matter of adding missing support. But it isn't — the FreeBSD object file Nvidia publishes lacks the internal symbols used by the Linux driver.
In fact it is. For Linux and FreeBSD Nvidia distributes exactly the same blob for compilation into nvidia.ko; the blobs for nvidia-modeset.ko are slightly different. (Don't take my word for it, download both drivers and compare kernel/nvidia/nv-kernel.o_binary with src/nvidia/nv-kernel.o.) Nothing is locked in the closed source part.
It's maintained while simultaneously not receiving any new features.
The latest FreeBSD driver is 450.66, published 2020-8-18. Supports RTX 20xx, GTX 16xx, GTX 10xx, and older.
Are we going to simply say "Nu uh" at each other, or do you want to throw down some specific examples so I can show you how mistaken they are?
linus is a hyperbolic jerk (as he admitted himself for a few months before resuming his hyperbolic ways) who is increasingly out of touch with anything outside the direct sphere of his projects. Like his misguided and completely unnecessary rants about ZFS or AVX.
if there are technical merits to discuss you can post those instead of just appealing to linus' hyperbole.
(I won't even say "appeal to authority" because that's not what you're appealing to. You're literally appealing to his middle finger.)
You generally have to wrap those and use them while not being foss-compatible.
GPT-2 was trained on TPUs. (There are explicit references to TPUs in the source code: https://github.com/openai/gpt-2/blob/0574c5708b094bfa0b0f6df...)
GPT-3 was trained on a GPU cluster probably because of Microsoft's billion-dollar Azure cloud credit investment, not because it was the best choice.
To be fair, TPUv4 is not out yet, and it might catch up using the latest processes (7nm TSMC or 8nm Samsung).
For MLPerf 0.7, it's true that Google's software isn't available to the public yet. That's because they're in the middle of transitioning to Jax (and by extension, Pytorch). Once that transition is complete, and available to the public, you'll probably be learning TPU programming one way or another, since there's no other practical way to e.g. train a GAN on millions of photos.
You'd think people would be happy that there are realistic alternatives to nVidia's monopoly for AI training, rather than rushing to defend them...
You mean this Weyland "support"?
I'm not coming out and saying it's gotten significantly better, but that is a three year old article and Nvidia-wayland does work on KDE and Gnome.
For operating income it's 25%, for net income it's 18%; not 5%.
Last four quarters operating income for AMD: $884 million
Last four quarters operating income for Nvidia: $3.5 billion
This speaks to the dramatic improvement in AMD's operating condition over the last several years. For contrast, in fiscal 2016 AMD's operating income was negative $382 million. Op income has increased by over 300% in just ~2 1/2 years. Increasingly AMD is no longer a profit lightweight.
AMD 2019 Revenue: $6.73b [1] NVIDIA 2019 Revenue: $11.72b [2]
Roughly half, as I said.
AMD 2019 Profit (as earnings per share): $0.30 [1] NVIDIA 2019 Profit (as earnings per share): $6.63 [2]
4.52%, rounds to 5%, as I said.
However, you still proved my point. Lightweight or not, they do not, and have not had the amount of money available compared to NVidia and Intel. It's growing, they'll be able to continue to invest, and they have an advantage in the CPU space that should last for another year or two, giving them a great influx of cash, and their focus on Zen 2 really paid off allowing them greater cash flow to focus on GPUs as well.
[1] https://ir.amd.com/news-events/press-releases/detail/930/amd... [2] https://nvidianews.nvidia.com/news/nvidia-announces-financia...
> 4.52%, rounds to 5%, as I said.
You're misunderstanding how to properly compare profitability between two companies.
If Company A has 1 billion shares outstanding and earns $0.10 per share, that's $100m in profit.
If Company B has 10 billion shares outstanding and earns $0.05 per share, that's $500m in profit.
Company A is not 100% larger on profit just because they earned more per share. It depends on how many shares you have outstanding, which is what you failed to account for.
AMD's profit was not close to 5% of Nvidia's in 2019. That is what you directly claimed (as you're saying you went by the last full fiscal year).
AMD had $341m in net income in their last full fiscal year. Nvidia had $2.8 billion in net income for their last full fiscal year. That's 12%, not 5%. And AMD's operating income was 22% of Nvidia for the last fiscal year.
The trailing four quarters and operating income, is the superior way to judge the present condition of the two companies, rather than using the prior fiscal year. Especially given the rapid ongoing improvement in AMD's business. Regardless, even going by the last full fiscal year, your 5% figure is still wrong by a large amount.
Operating income is a key measure of profitability and it's a far better manner of gauging business profitability than net income at this point. That's because the modern net income numbers are partially useless as they will include such things as asset gains/losses during the quarter. If you want to read up more on it, Warren Buffett has pointed out the absurdity of this approach on numerous occasions (if Berkshire's portfolio goes up a lot, they have to report that as net income, even though it wasn't a real profit generation event).
I didn't say anything refuting your revenue figures, because I wasn't refuting them. I'm not sure why you mention that.
Wait, what? Why would transition to Jax imply transition to Pytorch?
Pointing this out is not aggressive.
Yes, absolutely.
NVIDIA's not going to burn the ARM ecosystem to the ground. They just paid $40 billion for it. And they only had $11b of cash on hand, they really overpaid for it (because SoftBank desperately needed a big win to cover for their other recent losses).
Now: will everybody (including AMD) probably be paying more for their ARM IP from now on? Yes.
Or more recently, when Facebook bought Oculus for $2 Billion, did you expect Facebook to betraying the customer's trust and start pushing Facebook logins?
The Oculus / Facebook login thin just happened weeks ago. Companies betraying the promises they made to their core audience is like... bread-and-butter at this point (and seems to almost always happen after an acquisition play). We know Facebook's modus operandi, and even if its worse for Oculus, we know that Facebook will do what Facebook does.
Similarly, we know NVidia's modus operandi. NVidia is trying to make a datacenter play and create a vertical company for high-end supercomputers. Such a strategy means that NVidia will NOT play nice with their rivals: Intel or AMD. (And the Mellanox acquisition is just icing on the cake now).
NVidia will absolutely leverage ARM to gain dominance in the datacenter. That's the entire point of this purchase.
--------
There's a story about scorpions and swimming with one on your back. I'm sure you've heard of it before. Just because its necessary for the scorpion's survival doesn't mean it is safe to trust the scorpion.
Yes, yes, and yes. This is Oracle we're talking about. Of course they'll invest more in lawyers than tech. The only reason they still invest in Java is the lawsuit potential. If only Google had the smarts to buy Sun instead...
As for NVidia, their play probably is integration and datacenters. At the moment, going after other ARM licencees will hinder NVidia more than help (they're going after x86, no time to waste on bad PR and legal issues with small time ARM datacenter licencees; Qualcomm and Apple are in a different segment altogether). Of course, we can't guarantee it stays that way.
yes? I mean that felt eminently possible from the get-go.
Look at them as a solutions company and the first question to answer is: why is it fine for companies like Oracle, Microsoft, Adobe etc and any other software company to profit from closed software yet a company should be ostracized for it as soon as hardware becomes part of the deal?
Nvidia invested 15 years in developing and promoting a vast set of GPU compute libraries and tools. AMD has only paid lip service and to this day treats it as an ugly stepchild, they don't even bother anymore to support consumer architectures. Nvidia is IMO totally justified to reap the rewards of what they've created.
wrt Oracle, Microsoft, Adobe, and the others: I've been asking them to open source key products every chance I get. Just so you know where I'm coming from.
CUDA support for docker containers is provided through the open source Nvidia-Docker project maintained by Nvidia[1]. If anything this is a great argument for NVIDIAs usage of open source.
Searching that project's issues shows that Nvidia-Docker support on MacOS is blocked by the VM used by Docker for Mac(xhyve) not supporting PCI passthrough, which is required for any containers to use host GPU resources.[2]
xhyve has an issue for PCI passthrough, updated a few months ago, which notes that the APIs provided by Apple through DriverKit are insufficient for this use case[3]
So your comment should really say "Because Apple"
[1] https://github.com/NVIDIA/nvidia-docker
[2] https://github.com/NVIDIA/nvidia-docker/issues/101#issuecomm...
[3] https://github.com/machyve/xhyve/issues/108#issuecomment-616...
I'm all for railing on the shitty things Nvidia does do, but no reason to add some made up ones onto the pile.
https://mesa-dev.freedesktop.narkive.com/qq4iQ7RR/egl-stream...
https://www.zdnet.com/article/linus-torvalds-i-hope-intels-a...
/shrug. The guy can't keep from popping off with obviously false statements about things he knows nothing about. What exactly do you want me to say? Yes, he's been a good manager for the linux kernel, but he is self-admittedly hyperbolic and statements like these show that he really doesn't have an issue running his mouth about things that he really doesn't understand.
It is the old problem with software engineers: they think expertise in one field or one area makes them a certified supergenius with relevant input in completely unrelated areas. I can't count how many times I've seen someone on HN suggest One Weird Trick To Solve Hard Problems in [aerospace/materials sciences/etc]. Linus suffers from the same thing.
His experiences with NVIDIA are probably relevant, and if so we can discuss that, but the fact that he gave someone the middle finger in a Q+A session is not. That's just Linus being an asshole.
(and him being a long-term successful project manager doesn't make him not an asshole either. Jensen's an asshole and he's one of the most successful tech CEOs of all time. Linus doesn't mince words and we should do the same - he's an asshole on a professional level, the "I'm just being blunt!" schtick is just a nice dressing for what would in any other setting be described as a textbook toxic work environment, and his defenses are textbook "haha it's just locker room talk/we're all friends here" excuses that people causing toxic work environment are wont to make. He knows it, he said he'd tone it down, that lasted about a month and he's back to hyperbolic rants about things he doesn't really understand... like ZFS and AVX. But hey I guess they're not directed at people this time.)
Again, if he's got relevant technical input we can discuss that but "linus gives the middle finger!!!!" is not the last word on the topic.
https://www.phoronix.com/scan.php?page=news_item&px=EGLStrea...
AFAIK it also ended up being literally a couple thousand lines of code, not some massive endeavor, so the Wayland guys don’t come off looking real great, looks like they have their own Not Invented Here syndrome and certainly a lot of generalized hostility towards NVIDIA. Like Torvalds, I'll be blunt, my experience is that a lot of people just know NVIDIA is evil because of these dozens of little scandals they’ve drummed up, and they almost all fall apart when you look into them, but people just fall back on asserting that NVIDIA must be up to something because of these 27 other things (that also fall apart when you poke them a bit). It is super trendy to hate on NVIDIA in the same way it’s super trendy to hate on Apple or Intel.
Example: everyone used to bitch and moan about G-Sync, the biggest innovation in gaming in 10 years. Oh, it's this proprietary standard, it's using a proprietary module, why are they doing this, why don't they support the Adaptive Sync standard? Well, at the time they started doing it, Adaptive Sync was a draft standard for power-saving in laptops that had languished for years, there was no impetus to push the standard through, there were no monitors that supported it, and no real push to implement monitors either. Why take 10 years to get things through a standards group when you can just take a FPGA and do it yourself? And once you've done all that engineering work, are you going to give it away for free? Back in 2016 I outright said that sooner or later NVIDIA would have to support Adaptive Sync or else lose the home theater market/etc as consoles gained support. People told me I was loony, "NVIDIA'S just not that kind of company", etc. Well, turns out they were that kind of company, weren't they? Turns out people were mostly mad that... NVIDIA didn't immediately give all their engineering work away for free.
The GPP is the only thing I’ve seen that really stank and they backed off that when they saw the reaction. Other than that they are mostly guilty of... using a software license you don’t like. It says a lot about the success of copyleft that anyone developing software with a proprietary license is automatically suspect.
The truth is that NVIDIA, while proprietary, does a huge amount of really great engineering in novel areas that HNers would really applaud if it were any other company. Going and making your own monitor from scratch with a FPGA so you can implement a game-changing technology is exactly the kind of go-getter attitude that this site is supposed to embody.
Variable refresh rate/GSync is a game changer. DLSS 2.0 is a game changer. Raytracing is a game changer. And you have NVIDIA to thank for all of those, "proprietary" and all. They would not exist today without NVIDIA, AMD or Intel would not have independently pushed to develop those, even though they do have open-source drivers. What a conundrum.
I haven't used nvidia products for about 10 years and I' m not really in to gaming or graphics, so I don't really have an opinion on them either way, either business or technical. I used their FreeBSD drivers back in the day and was pretty happy it allowed me to play Unreal Tournament on my FreeBSD machine :-)
Linus is not always right, but a lot of what he says is often considerably more nuanced and balanced than his "worst-of" highlight reel suggests. There are plenty of examples of that in the presentation/Q&A he did from which this excerpt comes, for example (but of course, most people only see the "fuck you" part).
So if Linus – the person responsible for writing operating systems with their hardware – says they're the "worst company we deal with" then this strikes me as a good reason to at least do your research if you plan to buy hardware from them, if you intend to use it with Linux anyway. I'll take your word for it that they're doing great stuff, but if it outright refuses to work on my Linux box then that's kinda useless to me.
This was also 6 or 7 years ago I think, so perhaps things are better now too.
It really is possible to be critical of a monopoly without disparaging the product itself. It is when a true competitor arises that we see the monopolist's true capabilities (see Intel and AMD).
There is literally no benefit to 90% of the audience if they doubled the RAM. Of course, they also want to do market segmentation too, but you can’t blame GeForce for not being designed for ML training.
Perhaps I'm just not experienced enough with the programming model, but I've found them to be strictly less flexible/more tricky than GPUs, especially for things like conditional execution, multiple graphs, variable size inputs and custom ops.
The central reason that TPUs feel less flexible is Google's awful mistake in encouraging everyone to use TPUEstimator as the One True API For Doing TPU Programming. Getting off that API was the single biggest boost to my TPU skills.
You can see an example of how to do that here: https://github.com/shawwn/ml-notes/blob/master/train_runner.... This is a repo that can train GPT-2 1.5B at 10 examples/sec on a TPUv3-8 (aka around 10k tokens/sec).
Happy to answer any specific questions or peek at codebases you're hoping to run on TPUs.
I'll make it easier for you, directly from Google's website:
TPUs Cloud TPUs are optimized for specific workloads. In some situations, you might want to use GPUs or CPUs on Compute Engine instances to run your machine learning workloads.
Please tell me a workload a gpu can't do that a TPU can.
In my experience, well over 80% of these operations are implemented on TPU CPUs, and at least 60% are implemented on TPU cores.
Again, if you give a specific example, I can simply write a program demonstrating that it works. What kind of custom reduction do you want? What's a peak search?
As for workloads that GPUs can't do, we regularly train GANs at 500+ examples/sec across a total dataset size of >3M photos. Rather hard to do that with GPUs.
You did not give an example of something GPUs can't do. all you said was that TPUs are faster for a specific function in your case.
I wonder how Linux would react to this news.
Can someone explain this? (From the bullet points of the article)
I looked up the definition of accretive: "characterized by gradual growth or increase."
So it seems like they expect this to increase their margins. Does that mean ARM had better margins than NVIDIA?
Edit: I don't know what non-GAAP and EPS stand for
This article is old, but suggests a 48% operating margin: https://asia.nikkei.com/NAR/Articles/ARM-posts-strong-profit...
Non-GAAP -> doesn’t follow generaly accepted accounting practices. There are alternative accounting methods. GAAP is very US-centric (not good or bad, just stating a fact).
Does this mean Nvidia will have to deal with the hot mess at China ARM?
"ARM was probably what sank MIPS" - saagarjha
This is going to do especially bad things for anyone who needs to buy a cell phone or the SoC that powers one. There's no real alternative to ARM-based phone SoCs. Given Nvidia's business practices, any manufacturer who doesn't already have a perpetual ARM license should expect to have to pay a lot more money into Jensen Huang's retirement fund going forward. These costs will be passed on to consumers and will also provide an avenue for perpetual license holders to raise their consumer prices to match.
If it makes you feel any better, studies of acquisitions show that most of them are duds and destroy acquirer shareholder value.
RISV-V is a lot of talk and hype but the actual silicon that you could buy and implement into a product is hard to come by. With the exception of a few small microcontroller efforts (GigaDevice, Sipeed).
The problem is that there are no x86 SoCs that are sufficiently power efficient to be battery-life competitive with ARM SoCs in phones.
But more to the point, this is also China. If you want to do business in China, you're going to do as they tell you, or you get the stick. And if you don't like it, what are you going to do?
If patents did not exist, and nVidia were to close down ARM and tell people "no more ARM GPUs; only nVidia GPUs from now on", then a competitor who offers ARM-compatible ISAs would quickly appear. But in the real world, nVidia just bought the monopoly rights to sue such a competitor out of existence.
It's really no wonder nVidia did this given the profits they can extract from this monopoly (on the ARM ISA).
They're also building an ARM supercomputer at Cambridge, but server-ARM doesn't sound like a focus.
I'm just hoping for some updated mobile Nvidia GPUs... and maybe the rumoured 4K Nintendo Switch.
They say they won't muck it up, and it seems sensible to keep it profitable:
> As part of NVIDIA, Arm will continue to operate its open-licensing model while maintaining the global customer neutrality that has been foundational to its success, with 180 billion chips shipped to-date by its licensees.
ARM doesn't have good server CPU IP. Graviton, A64FX, etc. belong to other companies.
On a different topic, how would this influence Raspberry Pis going forward?
I bet they'd make a completely custom ISA if they could. Heck, maybe they plan to some day, and that's why they're calling the new Mac processors "Apple Silicon".
That was my first thought.
I'd be surprised if Apple wasn't already working on this.
If they could pull it off I would be very impressed.
- "We changed from the 68K to PPC" "Agh!...fine"
- "We changed from PPC to x86" "What, again?"
- "We changed from x86 to Apple Silicon" "...oooo...kay..."
- "We changed from Apple Silicon to, uh, Apple Silicon - like it's still called Apple Silicon, but the architecture is different" "What's an architecture?" "The CPU ISA." "The CPU is a what?"
Otherwise ARM would become a US company, and then the US would have another weapon in their arsenal to sanction China's tech companies (e.g. Huawei).
There isn't a huge amount of competition in the consumer/midrange sector. Nvidia has almost total market domination here. Really we just need a credible cross platform solution that could open up gpgpu on AMD. I'm surprised Apple isn't pushing this more, as they heavily use ML on-device and to actually train anything you need Nvidia hardware (eg try buying a Macbook for local deep learning training using only apple approved bits, it's hard!). Maybe they'll bring out their own training silicon at some point.
Also you need to make a distinction between training and inference hardware. Nvidia absolutely dominate model training, but inference is comparably simpler to implement and there is more competition there - often you don't even need dedicated hardware beyond a cpu.
Is it possible/legal for nvidia to charge a billion dollars per cpu for the isa license or are these things perpetual?
Only about 50% is gaming and nascent divisions like data centres can get a big boost from the acquisition.
We only connected Nvidia with GPUs, perhaps AI & ML. Now they are going to be a dominant player everywhere from consumer devices, IOT, cloud, HPC & Gaming.
And since Nvidia does not FAB its own chips like intel, this transformation is going to be pretty quick.
If only they go into public cloud business, we as costumers would have one other strong vendor to choose from.
"The proposed transaction is subject to customary closing conditions, including the receipt of regulatory approvals for the U.K., China, the European Union and the United States. Completion of the transaction is expected to take place in approximately 18 months."
Hopefully, the EU does its job, laughs at this and tells Nvidia to either go home or forces them to FRAND licensing of ARM IP.
FRAND licensing is worthless, if Qualcomm taught us anything.
Why make generalizations like this? It's not true, and we've devolved back into the "nu uh" we originally started with.
This is trivial to do on a GPU, and is built into the library
Yes, I'm sure there are hardwired operations that are trivial to do on GPUs. That's not exactly a +1 in favor of generic programmability. There are also operations that are trivial to do on TPUs, such as CrossReplicaSum across a massive cluster of cores, or the various special-case Adam operations. This doesn't seem related to the claim that TPUs are less flexible.
The raw functions it provides is not direct access to the hardware and memory subsystem.
Not true. https://www.tensorflow.org/api_docs/python/tf/raw_ops/Inplac...
Jax is also going to be giving even lower-level access than TF, which may interest you.
You did not give an example of something GPUs can't do. all you said was that TPUs are faster for a specific function in your case.
Well yeah, I care about achieving goals in my specific case, as you do yours. And simply getting together a VM that can feed 500 examples/sec to a set of GPUs is a massive undertaking in and of itself. TPUs make it more or less "easy" in comparison. (I won't say effortless, since it does take some effort to get yourself into the TPU programming mindset.)
Your last sentence is pretty funny: a GPU can't do certain workloads because one it can do is too slow for you. Yet it remains a fact that TPU cannot do certain workloads without offloading to the CPU (making it orders of magnitude slower), and that's somehow okay? It seems where this discussion is going is you pointed to a TensorFlow library that may or may not offload to a TPU, and it probably doesn't. But even that library is incomplete to implement things like a 5G LDPC decoder.
You'll need to link me to some specific implementation that you want me to port over, not just namedrop some random algorithm. Got a link to a github?
If your point is "There isn't a preexisting operation for overlap-save FFT" then... yes, sure, that's true. There's also not a preexisting operation for any of the hundreds of other algorithms that you'd like to do with signal processing. But they can all be implemented efficiently.
Yet it remains a fact that TPU cannot do certain workloads without offloading to the CPU (making it orders of magnitude slower), and that's somehow okay?
I think this is the crux of the issue: you're saying X can't be done, I'm saying X can be done, so please link to a specific code example. Emphasis on "specific" and "code".
Trust me, I would love if TPUs could do what you're saying, but they simply can't. There's no direct DMA from the NIC to where I can do a streaming application at 40+Gbps to it. Even if TPU could do all the things you claim, if it's not as fast as the A100, what's the point? To go through undocumented pain to prove something?
10Gbps isn't quite 40Gbps, but I think you can get there by streaming to a few different TPUs on different VPC networks. Or to the same TPU from different VMs, possibly.
The point is that there's a realistic alternative to nVidia's monopoly.
ARM created a business model for itself where they had to act as a "BDFL" for the ARM architecture and IP. They made an architecture, CPU designs, and GPU designs for others. They had no stake in the chip making game, and they had others - Samsung, Apple, Nvidia, Qualcomm, Huawei, Mediatek, Rockchip and loads of others make the chip. Their business model was to make the ARM ecosystem accessible for as many companies as possible, so they could sell as many licenses as possible. In that way, ARM's business model enabled a very diverse and thriving ARM market. I think this is the sole reason we see ARM eating the chip world today.
This business model would continue to work perfectly fine as a privately held company, or being owned by a faceless investor company that wants you to make as much money as possible. But it's not fine if you are owned by a company that wants to use you to control their own position in the chip market. There is no way Nvidia (any other chip company, but as laid out previously Nvidia might even be more concerning) will spend 40 billion on this without them deliberately or inadvertently destroying ARM's open CPU and GPU ecosystem. Will Nvidia allow selling ARM licenses to competitors of Nvidia's business? Will Nvidia reserve ARM's best IP as a selling point for its own chips? Will Nvidia allow Mali to continue existing? Any innovations ARM made previously it sold to anyone mostly indiscriminatorily (outside of legal restrictions), but now every time the question must be asked "does Nvidia have a better propietary purpose for this?". For any ARM chip maker the situation will be that Nvidia is both your ruthless competitor, but it also sells you the IP you need to build your chips.
EDIT: ARM's interests up to last week were to create and empower as many competitors for Nvidia as possible. They were good at that and was the root of the success of the ARM ecosystem. That incentive is completely gone now.
Unless Nvidia leaves ARM alone (and why would they spend $40B on that??), this has got to be the beginning of the end of ARM's golden age.
- It will know of their product plans (as they will need to buy licenses for new products).
- It will know their sales volumes by product (as they will need to pay fees for each Arm CPU sold).
- If they need technical help from Arm in designing a new SoC then the details of that engagement will be available to Nvidia.
How does this not give Nvidia an completely unfair advantage?
But I do think it's important that we recognize that we're going from a position of tremendous competitiveness to a much less competitive situation. And that will be a situation where ARM will be tightly controlled and much less inducive to the innovation we've seen in the last years.
Samsung comes to mind as another company that makes their own TVs, phones, SSDs, ect., but is also perfectly happy to license the underlying screens and chips in those products to other companies. From my vantage point, the setup seems to be working well?
EDIT: Let's be clear that ARM's incentive last week was to create and empower as many competitors for Nvidia as possible. They were good at that and was the root of the success of the ARM ecosystem. That incentive is completely gone now.
I'm guessing Samsung has a track record where I'd feel a little more confidence in the situation if they'd taken over ARM here, but in general ARM's sale to Softbank and thereby its exposure to lesser competitive interests has been terrible. They could have remained a private company.
There's a whole lot of inertia for Nvidia to take advantage of here while the rest of the industry figures out where it's going.
I doubt Nvidia would substantially disrupt or cancel licensing to the many third-rate chip designers you listed. But, if they can leverage this acquisition to build Windows/Linux CPUs that can actually compete with AMD and Intel, that would be a win for consumers. And Nvidia has shown interest in this in the past.
Yes, its a massive disruption to the status quo. But it may be a good one for consumers.
This is nothing to do with extending Nvidia's ability to use Arm IP in its own products.
But why would a company spend that much money to buy a company and destroy it afterwards?
Personally I don't think that's true in this particular case, but the strategy isn't exactly unheard of.
/s
Why is that bad? Not only it's common business practice (the more you buy from us, the cheaper we sell), it also makes sense from the support perspective. Support the integration between their cores and a different GPU would be more work for them than integration of their cores with their own GPUs.
That's why companies expand to adjacent markets: efficiency.
A completely different thing would be to say: "if you want our latest AXX core, you have to buy our latest Mali GPU". That's bundling, and that's illegal.
There are three possibilities here: ARM's design is approximately the same as the competitory, ARM's design is inferior to the competitor, and ARM's design is superior to the competitor.
If faced with two equivalent products, staying with the same supplier for both is best (especially in this case where the IP isn't supply-limited). The discount means a reduction in costs to make the device. Instead of ARM making a larger profit, their customers keep more of their money. In turn, the super-competitive smartphone market means those savings will directly go to customers.
In cases where ARM's design is superior, why would they bundle? If they did, getting a superior product at an even lower price once again just means less money going to the big corporation and more money that stays in the consumer's pocket.
The final case is where ARM has an inferior design. I want to sell the most performance/features for the price so I can sell more phones. I have 2 choices: slight discount on the CPU but bundled with an inferior GPU or full price for the CPU and full price for a superior GPU. The first option lowers phone price. The second option offers better features and performance. For the high-end market, I'm definitely not going with the discount because peak performance reigns supreme. In the lesser markets, its a calculation of price for total performance and the risk that consumers might prefer an extra few FPS for the cost of another few dollars.
Finally, there are a couple small players like Vivante or Imagination Technologies, but the remaining competitors in the space (Intel, AMD, Nvidia, Qualcomm, Samsung, etc) aren't going to be driven under by bundle deals, so bundling seems to be pretty much all upside for consumers who stand to save money as a result.
It is actually the other way around. ARM is more like giving the Mali GPU for free ( or at a very low cost ) if you use their CPU.
>Also this caused obvious problems for IMGtec
Yes, part of the reason why PowerVR couldn't get more traction and Apple were unhappy with their GPU pricing.
What would be an issue would be if Arm used their market power in CPUs to try to control the GPU market - e.g. you can't have the latest CPU unless you buy a Mali GPU with it.
Just look at ARMs annual net, multiply by 10, multiply that by 2 assuming starry-eyed optimism about you being better at generating value from ARM IP, you’re still far from 40 billion.
Like AMD? Sure. None of the ARM IP compete with Nvidia much. Not to mention by "Not" Selling to AMD it create more problem for its $40B asset than anyone could imagine.
>Will Nvidia reserve ARM's best IP as a selling point for its own chips? Will Nvidia allow Mali to continue existing?
Sure. Mali dont compete with Nvidia at all. Unless Nvidia will put up their CUDA Core for IP licensing with similar price and terms to Mali. Could they kill it or raise the price of Mali? Sure. But there is always PowerVR. Not to mention AMD is also licensing out Radeon IP to Mobile. Mostly because AMD dont / cant compete in that segment.
>Unless Nvidia leaves ARM alone (and why would they spend $40B on that??)
It has more to do with Softbank being an Investor. They were already heavily invested in Nvidia. And they need money, they want out. And seriously no one sane would buy ARM for $40B ( It is actually $35B, with $5B as performance bonus, the number $40 was likely used only for headline. ) As a matter of fact I would not be surprised if Softbank promise to buy it back someday. This also paint a picture of how desperate Son / Softbank needs those Cash. So something is very wrong. ( Cough WeWork Cough )
But I do understand your point. Conflict of Interest. Similar to Apple wouldn't want to build their Chip in Samsung Foundry.
While I would have liked ARM to remain independent. I am not as pessimistic as some have commented. And normally I am the one who had people pointing at my pessimism.
On the optimistic side of things. There are quite a lot of cost could be shared with the tools used for TSMC and Samsung Foundry implementation. ( Nvidia is now in bed with Samsung Foundry ) For ARM that means higher margin, for its customers that mean access to Samsung Foundry Capacity where previously they are stuck with TSMC. Nvidia also gets to leverage ARM's expertise in licensing, so their Nvidia GPU could theoretically enter new market. The real IP with Nvidia isn't so much about the GPU design, but its Drivers and CUDA. So may be Nvidia could work towards being an Apple like "Software" company that works with specific Hardware. ( Pure Speculation only )
There are lots of talk about Nvidia and ARM. While I dont think the marriage make perfect sense, It is not all bad. There are more interesting point no one is talking about. Marvell, AMD, Xillix and may be Broadcom. The industry is consolidating rapidly because designing leading edge chip, even with the cheap IP licensing is now becoming very expensive. And the four mentioned above have less leverage than their competitors.
Interest Times.
"To pave the way for the deal, SoftBank reversed an earlier decision to strip out an internet-of-things business from Arm and transfer it to a new company under its control. That would have stripped Arm of what was meant to be the high-growth engine that would power it into a 5G-connected future. One person said that SoftBank made the decision because it would have put it in conflict with commitments made to the U.K. over Arm, which were agreed at the time of the 2016 deal to appease the government." (from https://arstechnica.com/gadgets/2020/09/nvidia-reportedly-to... )
and
"The transaction does not include Arm’s IoT Services Group." (nvidia news.)
which appear to contradict each other.
I'm not sure about the significance of this. I would have guessed Nvidia would have wanted the IoT group to remain.
Also, to first order, when a company issues stock to purchase another corporation, that cost is essentially "free" since the value of the corporation increases.
In other words, Nvidia is essentially paying $12 billion in cash for ARM up front, and that's all. (The extra $5B in cash or stock depends on financial performance of ARM, and thus is a second-order effect.)
I can not put into words how furious I am at the UK's Conservative party for not protecting our last great tech company.
Europe has been fooled into the USA's ultra free market system (which works brilliantly for the US but is terrible for everybody else). As such American tech companies have brought EVERYTHING and eventually moth balled them.
Take Renderware it was the leading game engine of the PS2 era consoles, brought by EA and mothballed. Nokia is another great example brought by Microsoft and mothballed. Imagination Technologies was slightly different in that it wasn't bought but Apple essentially mothballed them. Now ARM will undoubtedly be the next via an intermediate buyout.
You look across Europe and there is nothing. Deepmind could have been a great European tech company - it just needed the right investment.
The perpetual architecture license folks that make their own cores like Apple, Samsung, Qualcomm, and Fujitsu (I think they needed this for the A64FX, right?) will be fine, and may just fork off on the ARMv8.3 spec, adding a few instructions here or there. Apple especially will be fine as they can get code into LLVM for whatever "Apple Silicon" evolves into over time.
The smaller vendors that license core designs (like the A5x and A7x series, etc.) like Allwinner, Rockchip, and Broadcom are probably in a worse state - nVidia could cut them off from any new designs. I'd be scrambling for an alternative if I were any of these companies.
Long term, it really depends on how nVidia acts - they could release low end cores with no license fees to try to fend off RISC-V, but that hasn't been overly successful when tried earlier with the SPARC and Power architectures. Best case scenario, they keep all the perpetual architecture people happy and architecturally coherent, and release some interesting datacenter chips, leaving the low end (and low margin) to 3rd parties.
Hopefully they'll also try to mend fences with the open source community, or at least avoid repeating past offenses.
I never liked Softbank owning it, but hey someone has to.
Regarding the federal investment in FOSS thread that was here perhaps CPU architecture would be a good candidate.
NVIDIA was so well run, but boxed into a smaller graphics card market - ATI and it were forced into low margins since they were made replaceable by OpenGL and DirectX standards. For the standard fans - they resulted a wealth transfer from NVIDIA to Apple etc. and reduced capital available for R&D.
NVIDIA was constantly attacked by a much bigger Intel (which changed interfaces to kill products and was made to pay by a court)
Through innovation, developing new technologies (CUDA) they increased market cap, and have used that to buy Arm/Mellanox.
I love the story of the underdog run by a founder, innovating it’s way to getting into new markets against harsh competition. Win for capitalism!
https://www.extremetech.com/computing/312528-nvidia-overtake...
This acquisition can be seen as a beacon of nvidia's past struggle against the market and the competitors.
For whatever happened, nvidia innovated to their success, and had enabled possibly the biggest tech boom so far through deep learning. Might be one day everyone claimed nvidia to be the "most important company" on earth.
Not correct Mellanox was bought for $7B.
> Huang told me that first thing that the combined company will do is to, “bring NVIDIA technology through Arm’s vast network.” So I’d expect NVIDIA GPU and NPU IP to become available quickly to smartphone, tablet, TV and automobile SoC providers as quickly as possible.
> Arm CEO Simon Segars framed it well when he told me, “We're moving into a world where software doesn't just run in one place. Your application today might run in the cloud, it might run on your phone, and there might be some embedded application running on a device, but I think increasingly and with the rollout of 5g and with some of the technologies that Jensen was just talking about this kind of application will become spread across all of those places. Delivering that and managing that there's a huge task to do."
> Huang ... “We're about to enter a phase, where we're going to create an internet that is thousands of times bigger than the internet that we enjoy today. A lot of people don't realize this. And so, so we would like to create a computing company for this age of AI.”
$40B is an obscene lot of money objectively and what's the endgame for Nvidia? If it's to "fuse" ARM's top CPU designs with their GPU prowess, then couldn't they invest the money to restart their own CPU designs (e.g. Carmel)? My inner pessimist, as with others here, is that Nvidia will somehow cripple the ARM ecosystem or prioritize their own needs over those of other customers'. Perhaps an appropriate analogy is Qualcomm's IP licensing shenanigans and how they've crippled the non-iOS smartphone industry.
That said, there's also examples of companies making these purchases with minimal insidious behavior and co-existing with their would-be competitors: Microsoft's acquisition of Github, Google's Pixel smartphones, Sony's camera lenses business and even Samsung, which supposedly firewalls its components teams so the best tech is available to whoever wants (and is willing to pay for it).
I suppose if this acquisition ends up going through (big if), then we'll see Nvidia's true intent in 3-5 years.
1) Adoption of ARM CPU's (AWS Graviton, rPi etc) will cause software to be adapted to ARM anyway, meaning: Nvidia could come out with a full vertically integrated cloud.
or
2) Leveraging full vertical integration with ML based super computers.
Announced at ARM TechCon last year: https://www.arm.com/products/silicon-ip-cpu/ai-platform
A 25% gain over a horizon of four years is not bad for your average investment -- but this isn't an average investment.
First, compared to the SP500, this underperforms over the same horizon (even compared to end of 2019 rather than the inflated prices right now).
Second, ARM's sector (semiconductors) has performed far, far better in that time. The PHOX (Philadelphia Semiconductor Index) doubled in the same time period.
And looking at AMD and NVIDIA, it feels as if ARM would have been in a position to benefit from the surrounding euphoria.
On the other hand, unless I'm misremembering, ARM back then was already considered massively overvalued precisely because it was such a prime takeover target, so perhaps its the $32B that are throwing me off here.
Maybe in a decade or so it may be more relevant but not today. Calling this a "fundamental threat" is a tad exaggerated.
Now arm is yet another US company.
However, given the momentum of Nvidia these past several years alongside the massive adoption and evolution of ARM, this is probably going to be the most interesting acquisition to watch over the next few years.
Edit: yes I meant nVidia not AMD!
Thanks!
[1] https://riscv.org/wp-content/uploads/2017/05/Tue1345pm-NVIDI...
Not sure how relistic that scenario is, although I personally can very much see this being used as a negotiation vehicle, depending on the actual security concern (I'm obviously not an expert there..)
However the situation for Chinese companies is even clearer now. Huawei, Hikvision etc. need to move away from ARM. Probably on to their own thing as RISC-V is dominated by US companies.
This allows them to design their own cores using the Arm instruction set[3] and presumably includes perpetual IP licenses for Arm IP used while the license is in effect. New Arm IP doesn't seem to be included, since existing 32bit Arm licensees had to upgrade to a 64bit license[2].
[1] https://www.anandtech.com/show/7112/the-arm-diaries-part-1-h...
[2] https://www.electronicsweekly.com/news/business/finance/arm-...
[3] https://en.wikipedia.org/wiki/ARM_architecture#Architectural...
Nvidia has consistently for many years refused to properly support Linux and other open source OSs.
Heck, Wayland compositors just say "if you're using nvidia then don't even try to use our software" since they're fed up of Nvidia's lack of collaboration.
I really hope ARM doesn't go the same way. :(
Whether or not some SoC (e.g. in a phone) is going to be supported by Linux doesn't depend on ARM but on the manufacturer of the given chip. That won't change in any way.
ARM maintains the GCC toolchain for the ARM architecture but that is unlikely to go anywhere (and even if it did, it is open source and anyone else can take it over).
The much bigger problem is that Nvidia could now start putting squeeze on chip makers who license the ARM IP for their own business reasons - Nvidia makes its own ARM-based ICs (e.g. the Jetson, Tegra) and it is hard to imagine that they will not try to use their position to stiffle the competition (e.g. from Qualcomm or Samsung).
ARM directly maintains the main ARM parts of the Linux kernel among other things.
That's exactly the thing -- nvidia refuses to document their products, so third parties can't support it.
Should they extend their practices to ARM, then ARM support will quickly whither.
Apple is probably putting together a RISC-V hardware group as we speak. The Jobs ethos will not allow them to depend this heavily on somebody else for such a critical technology.
The $12B comes from Nvidia the company, the remaining money comes from Nvidia's shareholders directly.
[1] Only if the valuation of ARM is "worth it" the fresh issue of shares will not cost the current shareholders anything. This is rarely the case , if Nvida overvalued(or less likely undervalued) the deal then current shareholders are giving more than they got for it.
It's just different forms of shareholder assets being traded for other assets, by the shareholders (or rather, their majority vote).
Cash paid in this instance is treated no different than cash in their normal operating expenses. If either generates profits in line with their current expected returns, the stock price stays the same, and everyone is indifferent to the transaction.
Same goes for stock issuance. If the expectation of the use of proceeds from the issuance are in line with the company's current expected returns, everyone is indifferent.
Your statement is still true, and the stock market jumped today on the news, so I feel my connotation is misplaced.
> One person close to the talks said that Nvidia would make commitments to the UK government over Arm’s future in Britain, where opposition politicians have recently insisted that any potential deal must safeguard British jobs.
So the deal has already been influenced by one regulator. That should encourage other regulators.
> SoftBank will remain committed to Arm’s long-term success through its ownership stake in NVIDIA, expected to be under 10 percent.
Why is this stake necessary?
Edit: it’s not necessary/a requirement.
They’re noting that after the transaction, SoftBank will still fall under the 10% ownership threshold that requires more reporting from the SEC [1]:
> Section 16 of the Exchange Act applies to an SEC reporting company's directors and officers, as well as shareholders who own more than 10% of a class of the company's equity securities registered under the Exchange Act. The rules under Section 16 require these “insiders” to report most of their transactions involving the company's equity securities to the SEC within two business days on Forms 3, 4 or 5.
[1] https://www.sec.gov/smallbusiness/goingpublic/officersanddir...
This isn't correct. If investors thing Nvidia overpaid, its share price will decline. There are many examples of acquiring companies losing significant value on announcements to buy other companies even in pure stock deals.
One would be making the argument, "the cost is essentially free because although we spent $40B, we acquired a company worth $40B". Obviously, that's not any more correct than the case of paying in stock.
You realize that logic no longer matters in this economy. There's an oversupply of printed money and stonks literally only going up.
I get that Hackernews is dominated by people working in software and software news, but as a part of the real economy (and not the stock market) it's actually not that large and Europe doesn't frame trade policy around it, for good reasons.
The US also doesn't support free-trade for economic reasons, but for political and historical reasons, which is to maintain a rule based alliance across the globe, traditionally to fend off the Soviets. Because they aren't around any more, the US is starting to ditch it. The US has never economically benefited from free-trade, it's one of the most insular nations on the planet. EU-Asia trade with a volume of 1.5 trillion almost doubles EU-American trade, tendency increasing, and that's why Europe is free-trade dependent.
I think you're also getting mixed up between 'free-trade' and 'free-markets'. Free trade is about trade deals: NAFTA, WTO, EU, CPTPP, Mercour or whatever trade grouping you want - generally to do with the removal of taxes and standardisation of goods between countries.
Free markets on the other hand is do with the liberalisation of markets i.e removing government intervention (as much as possible) i.e regulations and restrictions of buying and selling of stuff - in this case companies (which can be covered in a trade deal admittedly)
What I'm advocating is that British gov (and most European gov's) restricts the selling of their tech companies based purely on the importance of the tech company.
Why?
Because as I say its do to with control. We're not able to make democratic, sovereign decisions when the fabric of how most things are done is controlled completely by someone else.
Maybe part of the problem is that due to so many regulations, there's not a healthy startup ecosystem and the compensation isn't remotely high enough to draw the best talent.
There is a reason Russia and China have strong tech companies and Europe doesn’t. That reason isn’t lack of money, lack of talent or regulations. The only way for Europe to get big tech companies is by removing or crippling big US companies so EU companies can actually compete. The US companies would be quickly replaced by EU alternatives and those would offer high compensation all the same.
Whether or not that is worth it from the perspective of the EU is not so black and white - tech is obviously not everything - but the current situation where all EU data gets handed to the US government on a silver platter is also far from optimal from the perspective of the EU.
Ultimately free trade is where the world would like to get to purely from an economic basis but you have to do that in tandem with the rest of the world. If you go first everybody else has a economic advantage over you as possibly the UK will find out after Brexit actually happens. Also politics gets in the way of the world achieving full free trade. Some gov will always want votes by protecting an industry - like the UK's fishing industry for instance.
Skype is another example to add to the list.
Skype is effectively dead technology and isn’t even promoted any more.
Reaganomics talking point since the 80s, yet the U.S. constantly relaxes regulations, recently it released even more environmental ones and it looks in parts like Mars.
But of course, cut regulations, cut corporate taxes, cut benefits, cut, cut, cut. There's never a failure model for such capitalism apparently. 2008 even was blamed on regulation, rather than lack of thereof.
Am quite frankly done with this line of argument.
* We are no longer in the ps2 era. EA now uses Frostbite, which was developed by the Swedish studio Dice. It is alive and well, powering some 40-50 games. https://en.m.wikipedia.org/wiki/Frostbite_(game_engine)
* Nokia was dead well before MS bought them.
I dont see how Renderware would compete with Unreal. Even their owner EA choose Unreal. They were great in PS2 era, but next Gen console ( PS3 ) they were not.
Nokia was dead even before Stephen Elop became the CEO. So the Microsoft acquisition has nothing to do with it.
IMG - Yes. But I would argue they were dead either way. They couldn't get more GPU licensing due to ARM's Mali being cheap and good enough. They couldn't expand into other IP licensing areas. Their MIPS acquisition was 5 years too late. Their wireless part couldn't compete with CEVA. And they somehow didn't sell themselves to Apple as an Exit. ( But then Apple lied about not using IMG's IP. While Steve Jobs often put a spin thing, I find Time Cook's Apple quite often just flat out lying )
If Renderware hadn't been brought by EA (and hence controlled by your competitor), the rest of the industry would probably have kept using Renderware as it was the best option and development would have continued. It would have been built on to deliver next gen experiences.
It mirrors pretty much perfectly what is wrong with the ARM Nvidia deal.
Nokia yes wasn't doing well in the smart phone sector but was doing excellently in the feature phone sector. Hence why HMD Global is now doing very well selling those handsets.
Perhaps governments around the world should do what the US did (and still does) to foreign companies before its too late.
Although I dont agree with selling to Softbank at least they didn't have a dog in the game. What is bad about Nvidia is that they have a dog in the (chip) game. A major reason you went to ARM was for a non biased design team - you knew you were getting their best if you paid for it. Now I'm afraid you don't.
Also my comment is not anti US, it just so happens that the US has all the big tech companies and foolishly the European countries, especially the UK believes it can compete in a level playing field with the US even though the US's GDP is about 9 times bigger than the UK's - god knows how much bigger its equities markets are.
At the EU level the US is not but then this kind of stuff isn't decided about at the EU level - maybe it should be, not that that will help the UK see the error in its ways after Christmas.
As for emotional reaction with the greatest of respect did you read your message before you posted it?
A monoculture is bad. Any monoculture.
There is one thing they would need to worry about though, which is that if the rest of the market moves to RISC-V or x64 or whatever else, it's not implausible that someone might at some point make a processor which is superior to the ones those companies make in-house. If it's the same architecture, you just buy them or license the design and put them in your devices. If it's not, you're stuck between suffering an architecture transition that your competitors have already put behind them or sticking with your uncompetitive in-house designs using the old architecture that nobody else wants anymore.
Their best move might be to forget about the architecture license and make the switch to something else with the rest of the market.
This assumes that there isn't some other factor in transitioning architecture - this argument could boil down in the mid 2000's to "Why not go x86/amd64", but you couldn't buy a license to that easily (would need to be 3-way with Intel/AMD to further complicate things)
Apple has done quite well with their ARM license, outperforming the rest of the mobile form factor CPU market by a considerable margin. I don't doubt that they could transition - they've done it successfully 3 times already, even before the current ARM transition.
Apple under Cook has said they want to "to own and control the primary technologies behind the products we make". I doubt they'd turn away from that now to become dependent on an outside technology, especially given how deep their pockets are.
EDIT: correction, make that the last generation or four (oops, time flies)
No, they may not. People keep suggesting these kinds of things, but part of the license agreement is that you can't modify the ISA. Only ARM can do that.
> There’s been a lot of confusion as to what this means, as until now it hadn’t been widely known that Arm architecture licensees were allowed to extend their ISA with custom instructions. We weren’t able to get any confirmation from either Apple or Arm on the matter, but one thing that is clear is that Apple isn’t publicly exposing these new instructions to developers, and they’re not included in Apple’s public compilers. We do know, however, that Apple internally does have compilers available for it, and libraries such as the Acclerate.framework seem to be able to take advantage of AMX. [0]
my123's instruction names leads to a very shallow rabbit hole on google, which turns up a similar list [1]
Agreed upon: ['amxclr', 'amxextrx', 'amxextry', 'amxfma16', 'amxfma32', 'amxfma64', 'amxfms16', 'amxfms32', 'amxfms64', 'amxgenlut', 'amxldx', 'amxldy', 'amxldz', 'amxldzi', 'amxmac16', 'amxmatfp', 'amxmatint', 'amxset', 'amxstx', 'amxsty', 'amxstz', 'amxstzi', 'amxvecfp', 'amxvecint']
my123 also has ['amxextrh', 'amxextrv'].
[0] https://www.anandtech.com/show/14892/the-apple-iphone-11-pro....
[1] https://www.realworldtech.com/forum/?threadid=187087&curpost...
(famously so, Intel used to ship arm chips with WMMX and Apple for example ships their CPU today with the AMX AI acceleration extension)
Broadcom has an architectural license. They do also license core designs.
Really if nVidia locks up the lower end cores, then a lot of stuff breaks. Billions of super tiny ARM cores are everywhere. ARM has few competitors in the instruction set space for low end, low power, low cost cores. AVR, PIC, and MIPS are what come to mind. And AVR/PIC are owned by Microchip corporation.
These ARM chip unit licenses are dirt cheap, there's hundreds of small manufacturers, and their chips go in everything, and in unexpected places. And these aren't just little microprocessors anymore. They're even in SoCs as little coprocessors that manage separate hardware components in realtime.
The amount of penetration ARM has in hidden places cannot be underestimated. And there isn't a quick replacement for them. Not one freely licensed to any manufacturer.
Why would they do that anyway? The downsides are obvious (immediate loss of revenue), the risks are huge (antitrust litigation, big boost to RiscV or even Mips), the possible benefits are nebulous.
Those who are most obviously at risk are designers of mobile GPUs (Broadcom, PowerVR ...).
I know momentum is currently towards ARM over POWER, but... OpenPOWER is certainly a thing, and has IBM / Red Hat support. IBM may be expensive, but they already were proven "fair partners" in the OpenPOWER initiative and largely supportive of OSS / Free Software.
I don't see OpenPOWER going there, but I can easily see RISC-V going there. So, for the moment, that is the horse I'm betting on.
In mobile form, it would have made a large leap in both performance and battery life. And it would have been a fairly easy market to break into: the average life of a mobile device is a few years, not a few decades. Recompilation and redistribution of software is the status quo.
I understand what you're saying and this seems to be the prevailing pattern but I really don't understand it. ARM could easily be a standalone company. For some reason, mergers are in.
I have always assumed that their shareholders were offered so much of a premium on their shares that they chose to sell them rather than hold onto them. Clearly based on their fiscal 2015 results[1] they were a going concern.
[1] https://www.arm.com/company/news/2016/02/arm-holdings-plc-re...
Apple could of course Afford to invest in RISC-V (and surely has played with it internally) but they have enough control of their future under the current arrangement that it will be a long long time before they feel any need to switch — 15 years at least.
Not all fashionable words are devoid of meaning.
Even if it does get approved, and even if NVIDIA decides to not screw up any of the licensees, the whole notion of NVIDIA being capable of doing so to any of their (NVIDIA's) competitors, will surely mean extra selling points for all of ARM competitors like MIPS, RISC-V etc.
Now, every reference-implementation ARM processor manufactured will fund GeForce desktop products, datacenter/enterprise, etc as well.
NVIDIA definitely needs something like this in the face of the new Samsung deal, as well as AMD's pre-existing console deals.
It’s an excellent deal for NVIDIA of course, I’m certain they intend to make the chips they produce much faster than the ones they license (if they even ever release another open design) to the point where buying CPUs from Nvidia might be they only game in town. We’ll have to see but this is what I expect to happen.
NVIDIA already has an CPU architect team building their own ARM CPUs with an unlimited ARM license.
ARM doesn't give NVIDIA a world-class CPU team like apple's, amazon's or fujitsu. ARM own cores are "meh" at best. Buying such a team, would also have been much cheaper than 40b$.
Mobile ARM chips are meh, but nvidia doesn't have GPUs for that segment, and their current architectures probably don't work well there. The only ARM chips that are ok-ish are embedded/IoT at < 1W power envelope. It would probably take nvidia 10 years to develop GPUs for that segment, the margins on that segment are razor thin (0.10$ is the cost of a full SoC on that segment), and it is unclear whether applications on that segment need GPUs (your toaster certainly does not).
The UK appears to require huge R&D investments in ARM to allow the sale. And ARMs bottom line is 300million $/year in revenue, which is peanuts for nvidia.
So if anything, ARM has a lot to win here with nvidia pumping in money like crazy to try to improve ARM's CPU offering. Yet this all seem super-risky because at the segments ARM is competing at, RISC-V competes as well, and without royalties. It is hard to compete against something that's free, even if it is slightly less good. And chances are that over the next 10 years RISC-V will have much better cores (NVIDIA themselves started replacing ARM cores with RISC-V cores in their GPUs years ago already...).
Either way, the claim that it is obvious to everybody what the 3D-chess being played here is false. To me this looks like a bad buy for nvidia. They could have paid 1 billion for a world class CPU team and just continue to license ARM and/or switch to RISC-V chips. Instead they are spending 40 billion in a company that makes 300 million a year, makes meh-cpus, is heavily regulated in the UK and the world, has problems with China due to being in the West, have to invest in the UK which is leaving the EU in a couple of weeks, etc.
I'm not claiming I'm right and you're wrong, of course. I just think it's unfair to make negative assumptions at this point, so wanted to paint a possible good thing.
Microsoft have away a free browser with their operating system - leaving little room for other browser vendors to serve that market.
Each ARM design deal including a GPU for cheap leaves little room for other GPU vendors.
This is a pretty naive depiction of Wall Street.
Despite efforts to the contrary the EU functions as a glorified free trade zone, half a century of integration cannot beat 1000 years of fragmentation.
I say this because I think Europe with a set of consistent regulations and ways of establishing business would serve as a good counterweight to the freewheeling, "anything goes" nature of US capitalism. But I think the fragmentation is its Achilles Heel.
------------------------
[1] https://www.doingbusiness.org/en/rankings?region=oecd-high-i...
If not then it's very likely that NVIDIA will do everything in it's power to increase prices for ARM designs.
Clearly, if a buyer is willing to pay a premium for shares, it's because they believe the company is worth that premium. If the shareholders were optimistic, they might consider that as a signal of the company's underlying value and choose not to sell.
Sometimes activist shareholders will pressure a company to sell, like in the case of the Whole Foods sale to Amazon. Jana Partners owned ~8% and they motivated Whole Foods to look for buyers. They dumped their shares after the sale was announced and before it was executed. Whether that was the best for the other 92% of shareholders, the company's employees, and its customers is another question entirely.
My question is really outside of that. There will always be banks, investors, and companies that are motivated to consolidate and integrate companies to be increasingly competitive to the point of being anti-competitive. What is the counter force that helps maintain moderately sized companies that are profitable on their own?
https://nvidianews.nvidia.com/news/nvidia-to-acquire-arm-for...
> Under the terms of the transaction, which has been approved by the boards of directors of NVIDIA, SBG and Arm, NVIDIA will pay to SoftBank a total of $21.5 billion in NVIDIA common stock and $12 billion in cash, which includes $2 billion payable at signing. The number of NVIDIA shares to be issued at closing is 44.3 million, determined using the average closing price of NVIDIA common stock for the last 30 trading days. Additionally, SoftBank may receive up to $5 billion in cash or common stock under an earn-out construct, subject to satisfaction of specific financial performance targets by Arm.
Since NVIDIA currently has 617 million shares outstanding, if the earn-out were to be fully in common stock, this would bring Softbank to 8.8% of NVIDIA from this transaction alone (plus anything they already have in NVIDIA as public-market investors).
(I believe that the Forbes analysis in the sibling comment is mistaken in describing this as a "10% stake in new entity" - no new entity is mentioned in the press release itself.)
The Forbes article is based on a joint interview today with the CEOs of Arm and Nvidia, who could have provided more detail than the press release, specifically:
> Arm operating structure: Arm will operate as an NVIDIA division
This level of detail can be confirmed during the analyst call on Monday. Operating as a separate division would help assuage concerns about Arm's independence. The press release says:
> Arm will remain headquartered in Cambridge ... Arm’s intellectual property will remain registered in the U.K.
Those statements are both consistent with Arm operating as a UK-domiciled business that is owned by Nvidia.
> Softbank ownership: Will keep 10% stake in new entity
That would mean Nvidia acquired 90% of Arm for $40B, i.e. Arm was valued at $44B.
Is Softbank invested in Arm licensees, who may benefit from Softbank influence? Alternately, which Arm licensees would bid in a future auction of Softbank's 10% stake of Nvidia's Arm division?
In the current example if the cash is coming from a debt instrument is it not bank funding it now?
it is typically about who is fronting the money now, it could be your bank making loans, or from cash reserves you have, fresh stock issue, selling another asset, or even the target's bank as in the case of LBO.
The shareholders always end up paying for it eventually in some form or other. Differentiating it by the current source helps understand the deal structure and risks better.
When doing a deal, it comes down to price and source of funds. Changing either can drastically change how good or bad the deal is.
Really, it shows that the market valued Parse much more than the cash it cost Facebook. If Parse was bought with stock instead of cash, that's almost cooler. Since it allowed Parse to capture more of the surplus value they created. (Since the stock price popped).
A global strategic bottleneck in China for these things doesn't require patent or environmental law violations, it just requires us to pay more for them. For a critically strategic resource like this, the US should ensure a consistent supply chain independent of geopolitical concerns with China. And if China were to cut off supply then concern for their patents goes out the window too.
The strange thing is if you do not count brexit, Arm is one of the many example Uk can do it. And whilst we say Nokia, Sieman and japan fuji (sitting in Hosiptal now and thinking those mri, ...) non-chinese and non-Russia do dominate the tech world even they are not USA. But communist ideology totalitarian I found tik tok is really the exception.
Hence I think Eu has their problem. But not because they are not as good as Russia or china.
In case you didn't know about Singles' Day: https://graphics.reuters.com/SINGLES-DAY-ALIBABA/0100B30E24T...
Their phones are pretty good, too (or were pretty good before they got cut off from their suppliers). Their edge was that they built really great cameras into their phones.
https://riscv.org/membership/members/
While some like Google and Alibaba are listed as platinum founding members.
Mali support is done by outside groups (not ARM). Midgard and Bifrost models are well supported (anything that starts with a T or G, respectively). Support for older models is a little worse, but better than some other GPUs.
Adreno support is done by outside groups (not Qualcomm) and is lagging behind new GPUs that come out considerably.
PowerVR GPUs (Imagination) have terrible open source support.
They did contribute somewhat to the "tegra" driver at least.
An open architecture, and business model based on partnership doesn't really synchronize with vendor locking your products for increased profits.
> The transaction does not include Arm’s IoT Services Group, which is made up of Treasure Data and Arm’s IoT Platform business.
> Back in July, Arm announced a proposed transfer to separate Treasure Data from its core semiconductor IP licensing business, enabling Treasure Data to operate as an independent business. That separation is on track and will be completed before the close of the NVIDIA transaction. Most importantly, you should not see any disruption in our service or support as a result of this news.
Well, the optimistic reason would be talent-share. nVidia has a lot of chip designers, and ARM has a lot of chip designers, and having all of them under one organization where they can share discoveries, research and ideas could benefit all of nVidia's products.
Buying ARM not only allows them to control the direction of development, it also protects them should anyone else have bought it with hostile intent.
Personally I see this as more of an attack on Intel.
It all happened because Apple came out with the first 64-bit design, and QCOMM wasn't ready. Rather than deliver 32-bit for 1 more year, they used an off the shelf ARM 64-bit design (A57) in their SoC called the Snapdragon 810, and boy was it terrible.
This could change.
Given that the CEO was the supply chain guy at the time I suspect the latter, as I’d imagine he’d be more dispassionate than Jobs.
In any case I seriously doubt nvidia could, much less would benefit from cancelling Apple’s agreement.
I've seen this argument made before.
It would be a valid point if Apple stopped using Nvidia GPUs in 2008 (they did), and then never used them again. And yet, 4 years later, they used Nvidia GPUs on the 2012 MacBook Retina 15" on which I'm typing this.
RISC-V and ARM can coexist, but RISC-V in the mainstream is a far away due to nothing more than momentum. People won't even touch Intel in a mobile device anymore, not just because of power usage, but software compatibility.
Are you referring to the Graviton2 for Amazon? If so, you might be interested to learn that ARM designed the cores in that chip.
> (NVIDIA themselves started replacing ARM cores with RISC-V cores in their GPUs years ago already...).
The only info on this I'm aware of is https://riscv.org/wp-content/uploads/2017/05/Tue1345pm-NVIDI..., which says nvidia is replacing some internal proprietary RISC ISA with RISC-V, not replacing ARM with RISC-V.
Famously, the Tegra SoCs, as used in the Nintendo Switch.
Yes, ARM mostly just does licensing, but it may turn out that this acquisition gives Nvidia positive influence over future ISA and fundamental design changes which emerge from their own experience building microprocessors.
Maybe that just benefits Nvidia, or maybe all of their licenses; I don't know. But, I think the high price of this acquisition should signal that Nvidia wants ARM for more than just collecting royalties (or, jesus, the people here who think they're going to cancel the licenses or something, that's a wild prediction).
The other important point is Mali, which has a very obvious and natural synergy with Nvidia's wheelhouse. Another example of Nvidia making ARM better; Nvidia is the leader in graphics, this is no argument, so their ability to positively influence Mali (whether by actually improving it, or replacing it with something GeForce) may be beneficial to the OEMs who use it.
In my view you have this completely backwards. I think the opposite is true and that Nvidia is not a powerhouse CPU designer at all. They make extremely impressive GPUs certainly, but that does not automatically translate to great capabilities in CPUs. In terms of CPUs they have so far either used standard ARM designs and have attempted their own Project Denver custom architecture which are not bad but have not impressed CPU wise either. In this area Nvidia would need ARM - primarily for themselves.
> The other important point is Mali, which has a very obvious and natural synergy with Nvidia's wheelhouse. Another example of Nvidia making ARM better; Nvidia is the leader in graphics, this is no argument, so their ability to positively influence Mali (whether by actually improving it, or replacing it with something GeForce) may be beneficial to the OEMs who use it.
I know you're only entertaining the thought, but the image of Nvidia shipping HDL designs of Geforce IP to Samsung or Mediatek in the short term future seems completely alien to me. Things would need to change drastically at Nvidia for them to ever do this.
Certainly Nvidia has the capabilities to sell way better graphics to the ARM ecosystem, and very likely only one line of GPUs can survive, but it just seems extremely unlike Nvidia to ever license Geforce IP to their competitors.
I don't believe they ever closed a deal, but clearly Nvidia had some interest in becoming an IP vendor. Perhaps the terms were too onerous or the price too high.
Since Ampere and GPUs in general are structured nothing like a microprocessor, I doubt you'll find anyone who agrees with that.
On the Intel side, the process obstacles have been tragic, but they have plenty of hot products and plenty of x86 market share to lose, or in other words, plenty of time to recover CPU performance dominance.
Apple has pulled this off about 4 times because of their small market share and willingness to deprecate old hardware, software and the close control of the hardware they release.|
In the PC world - x86 will remain with us for a LONG time to come.
But that has really nothing to do with the architecture. They just spent more money on CPU R&D than their ARM competitors. They could have done the same thing with RISC-V, and if that's where the rest of the industry is going, they could be better off going there too. Especially for Mac, since they're about to do a transition anyway. They could benefit from putting it off for another year while they change the target architecture to something the rest of the market might not be expected to avoid in the future.
There is also no guarantee that their success is permanent. They might have done a better job than Qualcomm this year, but what happens tomorrow, if Google throws their hat into the ring, or AMD makes a strong play for the mobile market, or Intel gets their heads out of their butts, or China decides they want the crown and gives a design team an unlimited budget? There is value in the ability to switch easily in the event that someone else is the king of the mountain for a while.
> I don't doubt that they could transition - they've done it successfully 3 times already, even before the current ARM transition.
Just because they can do it doesn't mean it's free.
Really if Intel was shrewd, they'd recognize that they've lost Apple's business already and just sell them an x86 license, under some terms that Intel would care about and Apple wouldn't, like they can only put the chips in their own devices. Then Apple could save themselves the transition entirely and do another refresh with processors from Intel while they put their CPU team to task redesigning their own chips to be x86_64. It would give both Apple and Intel a chance to throw a punch at Nvidia (which neither of them like) while helping both of them. Apple by avoiding the Mac transition cost and Intel by maintaining the market share of their processor architecture and earning them whatever money they get from Apple for the license.
And it gives Intel a chance to win Apple's business back. All they have to do is design a better CPU than Apple does in-house, and Apple could start shipping them again without having to switch architectures. Which is also of value to Apple because it allows them to do just that if Intel does produce a better CPU than they do at any point in the future.
And you don't want to move to ARM if everybody else is moving away from it.
I likewise don't think destroying the ARM ecosystem is Nvidia's primary objective here - far from it. It might not even be a secondary objective. But I do think the ARM ecosystem will be slowly torn apart either as an innocent bystander or as a less prominent secondary objective (given real-world complexities, probably some combination of both). When Nvidia buys ARM, and they explicitly buy them to improve their competitive position, there's only one direction where I see the incentives going. Those will be against the open ARM ecosystem, which is a breeding ground for big and small competition against Nvidia.
Even if Nvidia's primary motives are relatively benign, I think they'll inevitably create a sitation where the ARM ecosystem can't continue existing in its current form. That's where the real tragedy will be.
They could, but that's more than just an easy money grab. Something Nvidia would already be familiar with from Tegra.
What seems more likely/risky would be nvidia starts charging a premium for a Mali replacement, or begins sandbagging Mali on the lower end of things. But Qualcomm already has Adreno to defend against that.
The latest Tegra SoC launched March of 2020.
On mobile SoC margins I guess that margins are low because there is a lot of competition - start cutting off IP to the competition and margins will rise.
I suspect that their focus will be on the server / automotive to start off with but the very fact that they can to any of this is troubling for me.
And a realistic estimate how long it will take to develop something that would be on par with today's Snapdragon, Exynos or Apple's chips is at least those 10 years. You need quite a bit more to have a high performance processor than just the instruction set.
The "just worked" chips are microcontrollers, something you may want to put in your toaster or fridge but not a SoC at the level of e.g. Snapdragon 835 (which is an old design, at that).
Also the Sipeed chips are mostly just unoptimized reference designs, they have a fairly poor performance.
Most people who talk and hype RISC-V don't realize this.
Alibaba recently said their XT910 was slightly faster than the A73. Since the first actual A73 launched in Q4 2016, that would imply they are at most 4 years behind.
SiFive's U8 design from last year claimed to have the same performance as A72 with 50% greater performance per watt and using half the die area. Consider how popular the Raspberry PI is with it's A72 cores. With those RISCV cores, they could drastically increase the cache size and even potentially add more PCIe lanes within the same die size and power limits.
Finding out new things takes much more time than re-implementing what is known to already work. As with other things, the 80/20 rule applies. ARM has caught up a few orders of magnitude in the past decade. RISCV can easily do the same and give the lack of royalties. Meanwhile, the collaboration helps to share costs and discoveries which might mean progress will be even faster.
I'm not saying Apple can drop-in RISC-V front-end to their silicon and call it a day, but you get the idea.
Sifive has a pretty decent chance at making performant chips within next few years.
RISC-V in high performance computing is years out even if big players like Samsung or Qualcomm decided to switch today. New silicon takes time to develop.
And Nvidia really couldn't care less about the $1-$10 RISC-V microcontrollers that companies like SiPeed or Gigadevice are churning out today (and even there ARM micros are outselling these by several orders of magnitude).
> A 25% gain over a horizon of four years is not bad for your average investment -- but this isn't an average investment.
> First, compared to the SP500, this underperforms over the same horizon (even compared to end of 2019 rather than the inflated prices right now).
> Second, ARM's sector (semiconductors) has performed far, far better in that time. The PHOX (Philadelphia Semiconductor Index) doubled in the same time period.
> And looking at AMD and NVIDIA, it feels as if ARM would have been in a position to benefit from the surrounding euphoria.
That's the "valuation drops" - relatively to the market, ARM has significantly underperformed, despite the business being actually healthy and on the rise.
That's like throwing pennies onto a pile of gold. NVIDIA makes billions of yearly revenue. ARM makes ~300 million. NVIDIA revenue is 60% of a GPU price. ARM margins in IoT/embedded/phone chips are thin-to-non-existent. If anything, NVIDIA will need to cut GPU spending to push ARM to the moon. And the announcement already suggest that this will happen.
That margins on $1 microcontroller are "thin-to-non-existent" is thus completely irrelevant - those are margins of the silicon manufacturer, not ARM's.
GPU profits should be able to cover their own R&D, especially given the obscene prices on the high end cards.
Console makers dictated that RDNA must use a forward-compatible version of the GCN ISA. While AMD might have wanted to make some changes of some ideas that turned out to be less-than-optimal, they cannot because they are stopped by the console makers paying the bills.
The problem is more with smaller companies that could be destroyed before they even get a chance to compete. Those can be bullied pretty easily by a company the size of Samsung.
Not the one to rule them all is the key to every innovation.
It must be probably weird for Apple technical staff to communicate so closely with one division of Samsung while fighting for market share with another division of the same company.
What markup does Apple pay for Samsung OLED displays compared to Samsung’s other OLED customers? I think this is highly relevant if you want to use it as an example. Because if the markup for Apple is 5x that of other buyers of Samsung OLED displays then you certainly can’t say Samsung is “happy” to sell them to Apple.
Same for nVidia-owned-ARM: if they’re happy to sell ARM licenses at 5x the previous price, then that will surely increase sales for nVidia’s own chips. I guess my overall point is: a sufficiently high asking price is equivalent to a refusal to sell.
obviously nobody but Samsung and their customers will know that information, and anyone who could reveal it is under NDA.
Apparently the prices are good enough that Apple doesn't go elsewhere.
A bidding war of course.
On the surface, it's capitalism at work. In reality, Samsung winds up in a no-lose situation. If Motorola wins, Samsung gets bigger margins due to the battle. If Samsung wins, they play "pass around the money" with their accountants, but their only actual costs are those of production.
I'd note that chaebol wouldn't exist in a free market. They rely on corruption of the Korean government.
If they gave away cash, that's a different story, it all depends.. if they were sitting on $1tn cash, and the spent $40bn, that's no biggie. I mean we went through COVID, what worse can come next?
Well, you’re now getting 50% of the dividend produced by the new combined entity. If the deal was correctly priced, your share of the Arm dividend should exactly replace the portion of your Nvidia dividend that you lost through dilution.
Nvidia is essentially telling us that they think their shares are currently richly valued. I agree with that.
They could also think their shares are valued accurately but believe the benefits of synthesis would increase the value.
To be honest, I don't think that NVidia/ARM will screw over their Cortex-M0 or Cortex-M0+ customers over. I'm more worried about the higher-end, whether or not NVidia will "play nice" with its bigger rivals (Apple, Intel, AMD) in the datacenter.
https://www.nxp.com/part/FS32R274VCK2VMM
https://www.digikey.com/product-detail/en/nxp-usa-inc/FS32R2...
The two related devkits list for $529 and $4,123: https://www.digikey.com/products/en/development-boards-kits-...
--
Those processors make quite a few reference to an "e200", which I think is the CPU architecture. I discovered that Digi-Key lists quite a few variants of this under Core Processor; and checking the datasheets of some random results suggests that they are indeed Power architecture parts.
https://www.digikey.com/products/en/integrated-circuits-ics/...
The cheapest option appears to be the $2.67@1000, up-to-48MHz SPC560D40L1B3E0X with 256KB ECC RAM.
Selecting everything >100MHz finds the $7.10@1000 SPC560D40L1B3E0X, an up-to-120MHz part that adds 1MB flash (128KB ECC RAM).
Restricting to >=200MHz finds the $13.32@500 SPC5742PK1AMLQ9R has which has dual cores at 200MHz, 384KB ECC RAM and 2.5MB flash, and notes core lock-step.
--
After discovering the purpose of the "view prices at" field, the landscape changes somewhat.
https://www.digikey.com/products/en/integrated-circuits-ics/...
The SPC574S64E3CEFAR (https://www.st.com/resource/en/datasheet/spc574s64e3.pdf) is 140MHz, has 1.5MB code + 64KB data flash and 96KB+32KB data RAM, and is available for $14.61 per 1ea.
The SPC5744PFK1AMLQ9 (https://www.nxp.com/docs/en/data-sheet/MPC5744P.pdf) is $20.55@1, 200MHz, 2.5MB ECC flash, 384KB ECC RAM, and has two cores that support lockstep.
The MPC5125YVN400 (https://www.nxp.com/docs/en/product-brief/MPC5125PB.pdf) is $29.72@1, 400MHz, supports DDR2@200MHz (only has 32KB onboard (S)RAM), and supports external flash. (I wonder if you could boot Linux on this thing?)
The use some desoldering braid to soak up the excess solder. It will remove all the bridges and leave perfect joints.
Not to mention starting a JV has nothing to do with perpetual IP access. You will still have to pay for it.
I obviously don't know that for sure but the idea that Apple would stake their future on something they don't have a legal ironclad position seems unlikely.
Just because Apple and nVidia has bad relationship at the moment regarding their GPUs is probably orthogonal to what they'll do with this new branch of nVidia, that is ARM.
True about frenemies the entire time that Apple was suing Samsung, it was using Samsung to manufacture many of its components.
Yea, they got fucked by Nvidia business practices multiple times. There's a saying about shame on you, shame on me, etc. Unless the entire Nvidia business unit also gets replaced in the same transaction, it doesn't matter how much of a faux separation of concerns they want to market.
Offer customers iOS apps and games on the next MacBook as a straight swap for Boot Camp and Parallels. Once they’ve moved everyone over to their own chips and brought back Rosetta and U/Bs they’re essentially free to replace whatever they like at the architecture level.
In their reveal I noticed that they only mentioned ARM binaries running in virtual environments. It makes sense if you don’t want to commit to supporting GNU tools natively on your devices (as it would mean sticking with an established ISA)
Apple is large enough that if they want to break from ARM in the future they can do so by forking an LLVM backend. That's not a large job if it's a very small change, but once it's been done once they have plenty of resources to provide ongoing support (like they do for webkit).
The dividends of doing so are potentially massive. Given that they've been able to make really large gains with the A series chips to date on mobile (not least because they've been able to offload tasks to dedicated co-processors that general-purpose ARM cores don't ship with), the rewards for having chips that are a generation ahead of other like-for-like computers will outweigh the cost of maintaining the software.
I guess they have some ISA flexibility (which is remarkable). But not much; each transition was still a very special set of circumstances and a huge hurdle, I'm sure.
As in, most of the work occurs in the low-level parts of the Operating system. After that the OS should abstract the differences away from User-space.
First of all: there's lots of software that's not the OS. The OS is the easy bit: everything else: grindy, grindy horrorstory. A lot of that code will be third-party. And if you think, "hey, we'll just recompile!", and you can actually get them to too - well, good luck, but performance will be abysmal in many cases. Lots and lots of libraries have hand-tuned code for specific architectures. Anything with vectorization - despite compilers being much better than they used to be - may see huge bog downs without hand tuning. That's not just speculation; you can look at software that's gets the vectorization treatment or was ported to arm from x86 poorly - perfomance falls off a cliff.
Then there's the JITs and interpreters, of which there are quite a few, and they're often hyper-tuned to the ISA's they run on. Also, they can't afford to run something like LLVM on every bit of output; that's way too slow. So even non-vectorized code suffers (you can look at some of the .net core ARM developments to get a feel for this, but the same goes for JS/Java etc). Webbrowsers are hyper-tuned. regexengines, packet filters, etc etc etc
Not to mention: just getting a compiler like LLVM to support a new ISA as optimally as x86 or ARM isn't a small feat.
Finally: at least at this point, until our AI overloads render that redundant - all this work takes expertise, but that expertise takes training, which isn't that easy on an ISA without hardware. That's why Apple's current transition is so easy: they already have the hardware; and the trained experts some with over a decade of experience on that ISA!. But if they really want to go their own route... well, that's tricky, because what are all those engineers going to play around on to learn how it works; what's fast, and what's bad?
All in all, it's no coincidence transitions like this take a long time, and that's for simple (aka well-prepared) transitions like the one's Apple's doing now. Saying they have ISA "flexibility", like they're somehow interchangeable is completely missing the point on how tricky on those details are, and how much they're going to matter on how achievable such a transition is. Apple doesn't have general ISA flexibility, it has a costly route from specifically x86 to specifically ARM, and nothing else.
Also likely the small tweaks they will want from time to time should be "easy" to follow internally, if you can orchestrate everything from top to bottom and back.
(+ there's the can of worms of target-specific MSRs being writable from user-space, Apple does this as part of APRR to flip the JIT region from RW- to R-X and vice-versa without going through a trip to the kernel. That also has the advantage that the state is modifiable per-thread)
Please, stop spreading nonsense. All of this is public knowledge.
The Neural Engine is a completely separate hardware block, and you have good reasons to have such an extension available on the CPU directly, to reduce latency for short-running tasks.
Is it possible AMX is implemented with the implementation-defined system registers and aliases of SYS/SYSL in the encoding space reserved for implementation-defined system instructions? Do you have the encodings for the AMX instructions?
Let me repeat this: part of the ARM architectural license says that you can't modify the ISA. You have to implement a whole subset (the manual says what's mandatory and what's optional), and only that. This is, as I've been saying, public knowledge. This is how it works. And there are very good reasons for this, like avoiding fragmentation and losing control of their own ISA.
And once again, stop spreading misinformation.
In the case of Arm, the guarantees provided back in 2016 were legally binding, which is why we’re here, four years and another acquisition later, with Nvidia now eager to demonstrate it is standing by those commitments.
Maybe in this particular instance they did learn something?
But how can we evaluate the whether that will continue?
What if ARM is not sold, and then (for whatever reason) stagnates, doesn't innovate, gets overtaken in some way, and enters gradual decline?
Perhaps that's unlikely, but prevent the sale, period is feels too absolute.
More like 30-40. https://en.wikipedia.org/wiki/Private_equity_in_the_1980s
Specifically about the Apple case,
After your tone, not certainly obligated to answer but will write one quickly...
Apple A13 adds AMX, a set of (mostly) AI acceleration instructions that are also useful for matrix math in general. The AMX configuration happens at the level of the AMX_CONFIG_EL1/EL12/EL2/EL21 registers, with AMX_STATE_T_EL1 and AMX_CONTEXT_EL1 being also present.
The list of instructions is at https://pastebin.ubuntu.com/p/xZmmVF7tS8/ (didn't bother to document it publicly at least at this point).
Hopefully that clears things up a bit,
And please don't ever do this again, thank you. (this also doesn't comply with the guidelines)
-- a member of the checkra1n team
And then that was that. We haven't seen Nvidia GPU again.
That said: AMD GPUs have also had their share of issues on MacBooks.
A sample instruction: 20 12 20 00... which doesn't in any stretch parse as a valid arm64 instruction in the Arm specification.
Edit: Some other AMX combinations off-hand:
00 10 20 00
21 12 20 00
20 12 20 00
40 10 20 00
And the compiler doesn't even come close to having as much information as the CPU has. Which basically means that most of the VLIW stuff just ends up needing to be broken up inside the CPU for good performance.
But what have we learned in these past 20 years?
* Computers will continue to become more parallel -- AMD Zen2 has 10 execution pipelines, supporting 4-way decode and 6-uop / clock tick dispatch per core, with somewhere close to 200 registers for renaming / reordering instructions. Future processors will be bigger and more parallel, Ice Lake is rumored to have over 300-renaming registers.
* We need assembly code that scales to all different processors of different sizes. Traditional assembly code is surprisingly good (!!!) at scaling, thanks to "dependency cutting" with instructions like "xor eax, eax".
* Compilers can understand dependency chains, "cut them up" and allow code to scale. The same code optimized for Intel Sandy Bridge (2011-era chips) will continue to be well-optimized for Intel Icelake (2021 era) ten years later, thanks to these dependency-cutting compilers.
I think a future VLIW chip can be made that takes advantage of these facts. But it wouldn't look like Itanium.
----------
EDIT: I feel like "xor eax, eax" and other such instructions for "dependency cutting" are wasting bits. There might be a better way for encoding the dependency graph rather than entire instructions.
Itanium's VLIW "packages" is too static.
I've discussed NVidia's Volta elsewhere, which has 6-bit dependency bitmasks on every instruction. That's the kind of "dependency graph" information that a compiler can provide very easily, and probably save a ton on power / decoding.
It's not only about reducing the need for figuring out dependencies at runtime, but you could also partly reduce the need for the (power hungry and hard to scale!) register file to communicate between instructions.
Unless your CPU has a means for profiling where your pipeline stalls are coming from, combined with dynamic recompilation/reoptimization similar to IBM's project DAISY or HP's Dynamo.
It's not going to do well as out-of-order CPUs that make instruction re-optimization decisions for every instruction, but I wouldn't rule out software-controlled dynamic re-optimization getting most of the performance benefits of out-of-order execution with a much smaller power budget, due to not re-doing those optimization calculations for every instruction. There are reasons most low-power implementations are in-order chips.
The compiler has nearly all of the information that the CPU has, and it has orders of magnitude more. At best, your CPU can think a couple dozen cycles ahead of what it is currently executing. The compiler can see the whole program, can analyze it using dozens of methodologies and models, and can optimize accordingly. Something like Link Time Optimization can be done trivially with a compiler, but it would take an army of engineers decades of work to be able to implement in hardware.
The 200-sized reorder buffer says otherwise.
Loads/stores can be reordered for 200+ different concurrent objects on modern Intel skylake (2015 through 2020) CPUs. And its about to get a bump to 300+ sized reorder buffers in Icelake.
Modern CPUs are designed to "think ahead" almost the entirety of DDR4 RAM Latency, allowing reordering of instructions to keep the CPU pipes as full as possible (at least, if the underlying assembly code has enough ILP to fill the pipelines while waiting for RAM).
> Something like Link Time Optimization can be done trivially with a compiler, but it would take an army of engineers decades of work to be able to implement in hardware.
You might be surprised at what the modern Branch predictor is doing.
If your "call rax" indirect call constantly calls the same location, the branch predictor will remember that location these days.
The CPU has something the compiler can never have.
Runtime information.
That's why VLIW works great for DSP which is 99.9 % fixed access patterns, while being bad for general purpose code.
Each machine instruction on NVidia Volta has the following information:
* Reuse Flags
* Wait Barrier Mask
* Read/Write barrier index (6-bit bitmask)
* Read Dependency barriers
* Stall Cycles (4-bit)
* Yield Flag (1-bit software hint: NVidia CU will select new warp, load-balancing the SMT resources of the compute unit)
Itanium's idea of VLIW was commingled with other ideas; in particular, the idea of a compiler static-scheduler to minimize hardware work at runtime.
To my eyes: the benefits of Itanium are implemented in NVidia's GPUs. The compiler for NVidia's compiler-scheduling flags has been made and is proven effective.
Itanium itself: the crazy "bundling" of instructions and such, seems too complex. The explicit bitmasks / barriers of NVidia Volta seems more straightforward and clear in describing the dependency graph of code (and therefore: the potential parallelism).
----------
Clearly, static-compilers marking what is, and what isn't, parallelizable, is useful. NVidia Volta+ architectures have proven this. Furthermore, compilers that can emit such information already exist. I do await the day when other architectures wake up to this fact.
Speculative execution is something you would want to do with the itanium as well, otherwise the machine is going to be stalling all the time waiting for branches/etc. Similarly, later itaniums went OoO (dynamically scheduled) because it turns out, the compiler can't know runtime state..
https://www.realworldtech.com/poulson/
Also while googling for that, ran across this:
https://news.ycombinator.com/item?id=21410976
PS: speculative execution is here to stay, it might be wrapped in more security domains and/or its going to just be one more nail in the business model of selling shared compute (something that was questionably from the beginning).
His explanation centred on the fact that Intel decided early on that Itanium would only ever be an ultra high end niche product and only built devices which Intel could demand very high prices for. This in turn meant that almost no one outside of the few companies who were supporting Itanium development and certainly not most of the people who were working on other compilers and similar developer tools at the time, had any interest in working on Itanium because they simply could not justify the expense of obtaining the hardware.
So all the organic open source activity that goes on for all the other platforms which are easily obtainable by pedestrian users simply did not go on for Itanium. Intel did not plan on that up front (though in hindsight it seemed obvious) and by the time that was widely recognised within the management team no one was willing to devote the sort of scale of resources that were required for serious development of developer tools on a floundering project.
ISTR that Intel & HP spent well over a $billion on VLIW compiler R&D, with crickets to show for it all.
How much are you suggesting should be spent this time for a markedly different result?
https://dl.acm.org/doi/book/10.5555/923366 https://dl.acm.org/doi/10.1145/349299.349318
and many, many others (it produced so many PhDs in 90s). And, needless to say, HP and Intel hired so many excellent researchers during the heydays of Itanium. So I don't know on what basis you think there wasn't enough investment. So I have no choice but to assume you're ignorant of the actual history here, both in academics and industry.
It turns out instruction scheduling can not overcome the challenge of variable memory and cache latency, and branch prediction, because all of those are dynamic and unpredictable, for "integer" application (i.e. bulk of the code running on the CPUs of your laptop and cell phones). And, predication, which was one of the "solutions" to overcome branch misprediction penalties, turns out to be not very efficient, and is limited in its application.
For integer applications, it turns out the instruction level parallelism isn't really the issue. It's about how to generate and maintain as many outstanding cache misses at a time. VLIW turns out to be insufficient and inefficient for that. Some minor attempts are addressing that through prefetches and more elaborate markings around load/store all failed to give good results.
For HPC type workload, it turns out data parallelism and thread-level parallelism are much more efficient way to improve the performance, and also makes ILP on a single instruction stream play only a very minor role - GPUs and ML accelerators demonstrate this very clearly.
As for the security and the speculative execution, speculative execution is not going anywhere. Naturally, there are many researches around this like:
https://ieeexplore.ieee.org/abstract/document/9138997 https://dl.acm.org/doi/abs/10.1145/3352460.3358306
and while it will take a while before the real pipeline implements ideas like above thus we may continue to see some smaller and smaller vulnerabilities as the industry collectively plays whack-a-mole game, I don't see a world where the top of the line general-purpose microprocessor giving up on speculative execution, as the performance gain is simply too big.
I have yet to meet any academics or industry processor architects or compiler engineer who think VLIW / Itanium is the way to move forward.
This is not to say putting as much work to the compiler is a bad idea, as nVidia has demonstrated. But what they are doing is not VLIW.
The vast majority of the code written for any given computer or smartphone doesn't have this level of sophistication and optimization though. I'd wager that for most code just changing the build target will indeed mostly just work.
It won't be painless but modern code tends to be so high level and abstracted (especially on smartphones) than the underlying ISA matters a lot less than in the past.
This isn't a question of precise instruction timing, it's a question of compilers being pretty bad at leveraging SIMD in general, even in 2020. Also, while I'm sure lots of projects have hand-tuned assembly, even higher level stuff like intrinsics help a lot, and need manual porting.
Also - why would Apple massively up and change uArchs? Even if they did decide to turn Apple Silicon into not-ARM, I'd wager it would look a lot more like ARM than for example, x86 does.
https://www.marketwatch.com/investing/stock/nvda/financials/...
Their only other option would be taking on debt.
That's telling us something about what they think about their share valuation right now.
All it tells you is that they think it's preferable to taking on debt, which in some sense is the position you always start from. Debt has a deadweight cost that you have to overcome.
But we can test your theory. You're saying they thought their shares were overvalued. The market's reaction was the opposite - announcing the deal bumped their share price 7.5%.
Perhaps from your perspective, a VLIW architecture that fixes these problems wouldn't necessarily be VLIW anymore. Which... could be true.
That's why today's CPUs can have 4-way decoders and 6-way dispatch (AMD Zen and Skylake), because they can "pick up more latent parallelism" that the compilers have given them many years ago.
"Classic" VLIW limits your potential parallelism to the ~3-wide bundles (in Itanium's case). Whoever makes the "next" VLIW CPU should allow a similar scaling over the years.
-----------
It was accidental: I doubt that anyone actually planned the x86 instruction set to be so effectively instruction-level parallel. Its something that was discovered over the years, and proven to be effective.
Yes: somehow more parallel than the explicitly parallel VLIW architecture. Its a bit of a hack, but if it works, why change things?
Its really too bad the dev boards are so expensive but I get you need a lot of layers to route that sort of BGA.
There are still new chips using the Cortex-M3 today. Microcontroller devs do NOT want to be changing their code that often.
New chips move the core to lower-and-cheaper process nodes (and lower the power consumption), while otherwise retaining the same overall specifications and compatibility.
No different than say, Cortex-M0 or M0+ in many regards (although ARM scales down to lower spec'd pieces).
questionably from the beginning
Agreed. If you look at what's the majority of compute loads (e.g. Instagram, Snap, Netflix, HPC) then that's (a) not particularly security critical, and (b) so big that the vendors can split their workload in security critical / not security critical, and rent fast machines for the former, and secure machines for the latter.I wonder which cloud provider is the first to offer this in a coherent way.
Since they're not branding it as ARM in any way, shape, or form, and they have a perpetual architectural license to the ISA, I suspect they could do pretty much what they please - as long as they don't call it ARM. Which they don't.
Even if not, then yes, I would imagine that a behemoth of IP design like Apple would probably have considered IP rights during that design...
Prior to that it was a publicly traded company, presumably with a diverse array of international shareholders.
There are more ARM chips sold each year than those of all its competitors together. Yet ARM's revenue is 300 million $.
Why? Because ARM lives from the ISA royalties, and their revenue on the cores they license is actually small.
With RISC-V on the rise, and west sanctions against china, RISC-V competition against ARM will only increase, and it is very hard to compete against something that's good / better and has lower costs (RISC-V royalties are "free").
I really have no idea why NVIDIA would adquire ARM. If they want a world-class CPU team for the data-center, ARM isn't that (Graviton, Apple Silicon, Fujitsu, etc. are built and designed by better teams). ARM cores are used by Qualcom and Samsung, but these aren't world-class and get beaten every gen by Apple Silicon. If they want ARM royalties, that's high risk business, and very low reward (there is little money to make there).
The only ok-ish cores ARM makes are embedded low-power cores (not mobile, but truly IoT < 1W embedded). Hard to imagine that an architecture like Volta or Ampere that perform well at 200-400W would perform well at the <1W envelope. No mobile phone in the world uses nvidia accelerators, and mobile phones are "supercomputers" when compared with the kind of devices ARM is "ok-ish" at.
So none of this makes sense to me, except if NVIDIA would want to "license" GPUs with ARM cores to IoT and low power devices like ARM does, but that sounds extremely far-fetched, because nvidia is super-far away from a product there, and also because the margins for those products are very very thin, and nvidia tends to like 40-60% margins. You just can have those when buying IoT chips for 0.12$. Its also hard to sell a GPU to these use cases because they often don't need it.
Graviton uses Neoverse CPU cores, which are designed by ARM. To say that ARM is not a world-class CPU team is unfair. Especially as Ampere just announced an 80 core datacenter SoC using Neoverse cores.
Revenue is not $300, we don't know what ARM's revenue is because it hasn't been published since 2016. And back then it was like $1.5 billion. $300 million was net income. Again, in 2016.
I think you've already been adequately corrected on your misconceptions about ARM's CPU design teams.
The latest Fujitsus HPC offering the A64FX is also ARM based though.[1][2] And it sounds as though this is replacing their SPARC64 in this role .
VLIW is a compromise product: its more parallel than a traditional CPU, but less parallel than SIMD/GPUs.
And modern CPUs have incredibly powerful SIMD engines: AVX2 and AVX512 are extremely fast and parallel. There are compilers that auto-vectorize code, as well as dedicated languages (such as ipsc) which work for SIMD.
Encoders, decoders, raytracers, and more have been rewritten for Intel AVX2 SIMD instructions, and then re-rewritten for GPUs. The will to find faster execution has always existed, but unfortunately, Itanium failed to perform as well as its competition.
GPUs do work like this – shaders recompiled all the time – so VLIW was used in GPUs (e.g. TeraScale). But on CPUs we have a world of optimized, "done" binaries.
I'm talking about Volta's ability to detect dependencies. Which is null: the core itself probably can't detect dependencies at all. Its entirely left up to the compiler (or at least... it seems to be the case).
AMD's GCN and RDNA architecture is still scanning for read/write hazards like any ol' pipelined architecture you learned in college. The NVidia Volta thing is new, and probably should be studied from a architectural point of view.
Yeah, its a GPU-feature on NVidia Volta. But its pretty obvious to me that this explicit dependency-barrier thing could be part of a future ISA, even one for traditional CPUs.
https://www.anandtech.com/show/5699/nvidia-geforce-gtx-680-r...
> NVIDIA has replaced Fermi’s complex scheduler with a far simpler scheduler that still uses scoreboarding and other methods for inter-warp scheduling, but moves the scheduling of instructions in a warp into NVIDIA’s compiler. In essence it’s a return to static scheduling
and I think this is describing more or less the same thing in Maxwell: https://github.com/NervanaSystems/maxas/wiki/Control-Codes
Hi! Counterexample here.
Especially because of the x86 oligopoly I would think that Arm is so much more important as an ecosystem.
https://fuse.wikichip.org/news/3099/centaur-unveils-its-new-...
IIRC you watch the "Rise of the Centaur" documentary they talk about the Intel lawsuit, and the corresponding counter suit that they won. Which makes the whole thing sound like MAD.
More interesting there is https://en.wikichip.org/wiki/zhaoxin/kaixian
In some ways, it actually frees them up a bit; as they don't have to reciprocate any efforts with Intel. Which they tried to leverage with their Padlock technology. Unfortunately, their marketshare limits any real practical usage of those benefits.
But, given the high market share of Arm in several markets allowing one firm the ability to use that market share to gain competitive advantage in related markets seems to me to be deeply problematic.
- They don't license because they can make a lot more money manufacturing the chips themselves.
- AMD also has the right to x86 because Intel originally allowed them to build x86 compatible chips (some customers insisted on a 'second source' for cpus) and following legal action and settlements between the two companies over the years there is now a comprehensive cross licensing agreement in place. [1]
- Note that AMD actually designed the 64 bit version x86 that is used in most laptops / desktops and servers these days.
[1] https://www.kitguru.net/components/cpu/anton-shilov/amd-clar...
IBM required it. It was their business MO to protect themselves from losing access to a technology or having the market be dictated by one company. Intel acquiesced so that they would be the architecture of the IBM PC-series.
Not that they would complain if AMD lost their license or ceased existing, they just don't seem to actively be trying to cancel the license at this point.
Only in countries with poor regulators like the states does it work like this.
In reality Arm commoditised in many markets CPUs by making reasonable designs available to all at reasonable cost, keeping control over the ISA and allowing firms to innovate in their implementations. You can have the same code running on a Raspberry PI, an iPhone and a 64 core Graviton2 server.
The Nvidia takeover threatens all this by giving control to a firm who could well 'unlevel' the playing field and even refuse to offer the latest IP to competitors.
RISC-V may provide a way out for firms unhappy with Nvidia but it could be a bumpy path. And its certainly not the case that Nvidia are paying $40bn to consolidate Mali with their own graphics IP.
It works like that in Japan: say hello to the many giant conglomerates that rule their economy top to bottom.
It works like that in South Korea: say hello to Samsung, roughly 12% of South Korea's GDP in a given year (Walmart by contrast is equal to 2.5% of US GDP, and that's crazy big).
It works like that in Germany: say hello to a parade of big old industrial giants that have dominated their economy for most of the past century and will continue to.
It works like that in China, openly so: they intentionally go out of their way to promote giant national champions at the expense of everyone else.
It works like that in France: their largest corporations and richest individuals are even larger in relation to their economy and national wealth than they are in the US (say hello to Arnault, Bettencourt, Pinault and the Wertheimers).
It works like that in Russia: say hello to the countless, directly state protected oligarchs. Threaten their interests, you die. Their approach is super simple.
It works like that in Italy and Spain, which are both dominated by old, large corporations and family interests. Which heavily explains their forever economic stagnation.
It even works like that in Switzerland: ever see how large their financial companies are in relation to the economy? Who do you think actually runs Switzerland? Their banks are comically outsized versus the size of the economy.
It completely works like that in Brazil and India.
It works like that across all of the Middle East, to a much greater degree than most anywhere else.
It works like that in second tier economies, including: Poland, Mexico, Argentina, Romania, Turkey, Thailand.
It works like that in poor countries, including: Vietnam, Indonesia, Philippines, Ukraine, South Africa, Pakistan, Bangladesh.
https://www.reuters.com/article/us-ilva-m-a-arcelormitta-eu/...
Might and reality is not right.
We need competition. We adore competition. And even the countries you quote many do have competition. You just do not innovate and of course one can try to stop and rest. But the works does not.
Be water not mountain my friend.
How exactly is this only like in the states?
You get instructions scheduled based on actual dynamically measured usage patterns, but you don't pay for dedicated circuits to do it, and you don't re-do those calculations in hardware for every single instruction executed.
It's not a guaranteed win, but I think it's worth exploring.
Morpheus: No, Neo. I'm trying to tell you that when you're ready, you won't need "call rax".
---
Compiler has access to optimizations that are at the higher level of abstraction than what CPU can do. For example, the compiler can eliminate the call completely (i.e. inline the function), or convert a dynamic dispatch into static (if it can prove that an object will always have a specific type at the call site), or decide where to favor small code over fast code (via profile-guided optimization), or even switch from non-optimized code (but with short start-up time) to optimized code mid-execution (tiered compilation in JITs), move computation outside loops (if it can prove that the result is the same in all iterations), and many other things...
My point was simply that the compiler may be in position to disprove the assumption that this call is in fact dynamic (it may actually be static) or that it has to be a call in the first place (and inline the function instead).
I'm certainly not arguing against branch predictors.
Intel did change sockets as a means to disallow socket-compatibility; forcing consumers into their architecture if they bought their motherboard, but that had no effect on AMD's development. AMD had purchased a significant share of DEC's engineering portfolio and, along with it, their employees. Those employees then developed the K7 (Athlon) architecture around some of the Alpha's technological advantages, which included HyperTransport, a multi-issue FPU (fixing one of the major issues AMD had struggled with and bringing them ahead of Intel), etc.
The problem with waiting for the patents to expire is you're always stuck 20 years behind.
Being 20 years behind isn't a particularly big deal if those 20 years are basically just SSE3, SSE4, AVX.
Uuuuh, presumably the case was about patents, right? I don't see how cleanroom-ing is fine with regard to patents.
Summary: Cyrix (and subsequently, VIA) have an implied license to Intel's patents; allowing them to develop x86 hardware.
How that differs from AMD: AMD and Intel have a full cross-license on technologies. This means, AMD can utilize Intel resources to integrate AVX-512, a ring-bus, etc (and vice versa). VIA can not. They must develop those technologies independently, in a compatible manner.
Oddly enough, integrated circuits have their own IP scheme. IP has copyright, patent right, trade secrets, trademarks, and mask works. You rarely learn about mask works, since they're so narrow.
https://en.wikipedia.org/wiki/Integrated_circuit_layout_desi...
I think a lot of this would hinge on that corner of the law.