I Think They [Anthropic] Are Lying to You [video]

I Think They [Anthropic] Are Lying to You [video](youtube.com)

61 points by salutis 2 hours ago | 43 comments

mikgp 38 minutes ago |

The one thing that gets me is - Boris must know what everyone is thinking when he says he merges 300 MR’s per day, but I think everyone knows it doesn’t mean what he’s implying - it just can’t. He can’t read 300 open source contributions per day to determine if they fit into the spirit of Claude Code. Even if he’d fully automated the testing and integration process. And 300 people per day submitting contributions?

He could mean a few other thingsc one would be, he has like 20 version tags and he merges 15 features into each version (does he explicitly say merge to main?)

The other thing he could mean is he has like a software engineering agent and that software engineering agent like loops through GitHub issues and his personal notepad and maybe uses a few different branches to test things out and build I dunno adversarially, running all sorts of experiments.

Which would be genuinely cool! But using Mr’s to say tweak bunch of variables back and forth isn’t what 300 MR’s implies.

But then the ultimate question is - it may be cool to fully automate a software engineering agent, and certainly is the type of research Anthropic and someone of Boris’ stature (and pay level) should be working on. But is it efficient?

I guess yes hems talked about this:

https://karozieminski.substack.com/p/boris-cherny-claude-cod...

orangebread 1 hour ago |

I think this guy is using AI differently than me. Since Opus 4.6 and GPT 5.3, I have been able to absolutely crush my coding work. Boris might be embellishing how he ONLY writes loops, but for the most part I am just handing off planning docs to Claude or GPT and they implement it with like 95% accuracy.

A lot of you don't want to hear it but this is a user issue.

reinitctxoffset 48 minutes ago | |

I read this a lot and it is just very foreign to me. I use AI systems in software work all day seven days a week and my job has become simultaneously more interesting and more difficult because I scale the ambition up until it's hard again.

Isn't anything else a surrender to irrelevance? I agree that many coding tasks that were previously effort intensive are now not effort intensive, but there's no ceiling I'm aware of on how correct and performant and economical and capable software can be short of saturating the hardware.

And the emergence of agentic intelligence at scale demands new regimes of performance and correctness and economy like maybe nothing else ever has.

I have an anecdote related to TUI flickering in that my TUI library had a flickering problem because it was doing more than 10k FPS, and so I had to lock the buffer swap to the vsync to stop it tearing.

AI coding didn't make more React too cheap to meter, it made notcurses bound into Trinity-inspired deterministic replay event substrate over io_uring possible.

https://youtu.be/YqgEtpJ8tGI?feature=shared

ryan_n 11 minutes ago | |

> A lot of you don't want to hear it but this is a user issue.

Hmm people say this all the time in these discussions but idk if I buy that it's really a "user issue". It's really, really not hard to "use" agentic ai. It literally involves instructing an llm to do things in natural language. Anyone who knows how to code and speak a language can do this. As you yourself seem to believe, even people who don't know how to code can do this. I just don't think it's possible that THAT many people are having an issue typing some words to instruct an llm to write some code. Maybe the issue is more the type of software you are working on vs the type of software that other people are working on. I don't know, I just don't think "skill issue" is really a valid argument here...

Edit: for the record, I think what Opus can produce is extremely impressive. But I still am not really close to letting write 100% of code I write. And I think that is true for a lot of people, not just me. It still generates (sometimes obvious) bugs. Until that stops, the statement "coding is solved" is objectively false, which (I think?) is largely the point of the video.

nitwit005 1 hour ago | |

He's discussing Anthropic struggling to fix an issue with their own product. He's not the one struggling.

Jeremy1026 10 minutes ago | |

This guy has been against AI from the very onset, and no matter what happens with AI going forward I think he'll always poopoo it.

mikgp 48 minutes ago | |

You have planning docs?

I am in no way surprised a sufficient waterfall method passed to Claude code could result in a completely accurate application. But most applications aren’t built via waterfall for all the reasons.

Also agents are just loops. So if you use Claude Code you are doing. Everything with a loop. So I do believe him but it’s a weird flex.

slopinthebag 33 minutes ago | |

If Anthropic has some of the best and most highly paid software engineers on the planet working on a simple program (terminal app) with virtually unlimited tokens for the strongest coding model, and they still ship a sloppy buggy mess, what does that say about the quality of code you are outputting?

A lot of you don't want to hear it, but you aren't doing better than Anthropic so unless your use case is ridiculously simple, Claude Code is the ceiling for what you're creating, and if what you're doing is at all complex, the ceiling is much much lower.

taurath 57 minutes ago | |

Sorry, but context rot is real, and I’d be curious how your code is playing out in the real world. Is it shipping? Is it a known product with stable docs? Is it greenfield?

Aspects of coding are faster certainly, but oh gosh can it get very wrong very fast when things go sideways, and with everyone using it, the chaos factor compounds into a near halt.

dools 51 minutes ago | | |

When you hand something off to Claude code, the harness is doing lots of different sessions it’s not a one shot.

bag_boy 1 hour ago | |

Can you give me your use case here? I have not gotten around to trying loops in Claude code but have started to notice the hype.

themafia 52 minutes ago | |

> just handing off planning docs to Claude or GPT and they implement it with like 95% accuracy.

Do you have any publicly available demonstrations of this claim?

> A lot of you don't want to hear it

That there are skill differences in the use of technology? On the contrary this knowledge makes me suspicious of undocumented claims like yours.

> this is a user issue.

Another claim I wish was quantified. With all the billions invested I assumed this would naturally come to exist. I may have just missed it. Any pointers?

calvinmorrison 47 minutes ago | | |

> Do you have any publicly available demonstrations of this claim?

Yeah I mean for example I wrote up a new audio mixer application for TDE using basically claude and just saying - hey rewrite the old ALSA one with Pulse/Pipewire.... its awesome. I dont know how it works.,

triyambakam 22 minutes ago | |

I am definitely hyped on it and have 10-20 Codex and Claude Code terminals open, but I do wonder what you're building and with who that you would say 95% accuracy. I get so frustrated with Codex inventing new ways to do something every time it compacts or start a new session.

"You're right, I shouldn't have done that" Ya think?!

thraway3837 1 hour ago | |

Yup. The only caveat I'd add is that I'm using an alternate account to agree with people who say that AI coding has been amazing, because there is a seemingly a good chunk of people who dislike it and it will be met with downvotes. Also because my real account has my real name in the profile along with projects I work on, and a simple search could reveal my pro-AI coding views and these same folks who downvote could also be a future interviewer.

I think the world changed. And it's changed for the good. AI is a tool, and we should not be afraid of this tool for the coding world. I am only speaking about coding, I'm not speaking about other uses of AI, just so that we're clear on the scope of what I mean by good change.

For the first time, I see people who had all these ideas finally bring them to reality and watch it blossom. They wanted to build something to share with their communities, but the walls were too high. Too much gatekeeping. Too much of thinking that programming was a task for the elite few and not for the masses. Along the way, we all forgot that we build tools for people. And having an additional tool help us make better tools for people is a win. Just below this comment, I see people talking about dementia, "lots more generated code, almost all of it garbage", "future where garbage software".

I think the only delusional ones are the idea that humans were better at coding. Have you never had to work on an older project? One that you did not have to start fresh on? Or did you come into either one and go "wow, this is perfect! everything is so beautiful!" Do you seriously consider your fresh project (that didn't use AI) to be the best most perfect beautiful code ever?

The fact is that nobody cares. People want to use good things and have fun with their lives. They're not worried about whether you wrote a method that parses some strings beautifully or did it with a one-liner. That never mattered, and I think a lot of you can't let go of that world view change and instead lash out at people who simply embrace that programming was simply a tool, not some elite special skill. And we're going back to those beliefs. It's done. It's over. Get over it.

scooby7430 43 minutes ago | | |

Agreed. I think the opinion on AI is split into two camps, people who's enjoyment from programming came from writing the code and people who like building things. It's really undeniable at this point that AI has changed the job and I really enjoy it now more than ever, I can come up with an idea, guide an LLM through the steps to build it and have it be a real thing faster than I ever could have imagined.

Yeah LLMs aren't perfect, there is back and forth along the way and if you just let it loose you are going to end up with slop but I feel like we can achieve better quality now in a shorter amount of time using the tool properly. I'm not sure if I am just naive but I am really excited about the possibilities now and have been spending more time than ever building what I want. I used to think that writing the code was the enjoyable part for me but I think it was just building things.

I empathize with people in the other camp who got into it for the love of the code and now that part of the job is being taken away but I think it would benefit them to be honest about LLMs and try and work out a path forward here rather than just "my function is better than an LLM one, LLMs are just slop machines"

zingababba 1 hour ago | | |

Well said. Most of my career I made a trade-off and that trade-off was that I would much rather spend my free time outside in nature than on implementing my wild ideas which I always recognized would take considerable time. I'd maybe take 1% of my ideas anywhere. Now I can play with ideas while I'm out on the trail and turn those into something I can test within a couple hours, it's the most fun I've had on computers since the very beginning.

bitwize 2 hours ago |

Pretty much the same point I made: https://news.ycombinator.com/item?id=48403908

Their apparent inability to get the basics right makes me severely doubt their claims of self-improving AI. The humans at Anthropic wouldn't know improvement if it landed on their lap and started twerking, and AI cannot do a job without strong human intervention into what the goals and guardrails actually are.

I'm kind of reminded of when Microsoft claimed it took a team of Ph.D.s to write a terminal application that updated at 60fps, and then Casey Muratori did it over a weekend. And this was before AI was writing code in earnest; when LLM-induced brainrot really sets in, civilization is in for a world of fresh hurt: lots more generated code, almost all of it garbage. And the promised AI crossover point where it becomes AGI, or indistinguishable from for software design purposes, recedes into the infinite future.

nhinck2 1 hour ago |

I mean you can go through Boris' history here on hn to see he is a liar.

antonvs 2 hours ago |

It goes beyond lying. It's kind of war, and they're the aggressor.

Everyone else needs to start treating them that way, or you're going to regret it once you realize what's actually happening.

phendrenad2 1 hour ago | |

Please please tell us so we're prepared. sad puppy eyes

ggm 1 hour ago |

We've reached peak stupidity when a supposedly reasoned case about "AI bad" has to be proffered .. in video.

If you want my attention tonight, surely then "put more effort in" applies here too?

I was a low bar target: I already think AI coding is a mistake. But I want to read about it. Not listen to it with megabits of associated video I don't want to watch either.

Tag as "rage bait" and move on I did not like, I did not subscribe.