Claude Is Not Your Architect. Stop Letting It Pretend

Claude Is Not Your Architect. Stop Letting It Pretend(hollandtech.net)

29 points by cdrnsf 1 hour ago | 12 comments

retrac 57 minutes ago |

For fun I've been vibe coding something I know well: toolchains. Maybe not the right thing to vibe code. But I can more or less judge the quality of the output.

When left to its own devices with the instructions "make an assembler for the architecture in ISA.md" -- well it picked Python as the implementation language. Tokens lifted through a bunch of regex. No expression parser! Oh dear. My first assembler was like that too, to be fair.

However, when I described the desired passes and their types:

    collectDefines :: [SourceLine] -> Either AsmError ([SourceLine], Map Text Text)
    
    runLitPool :: [SourceLine] -> Either AsmError ([SourceLine], [(Text, LitKey)])
    
    evalExpr :: Text -> Map Text Text -> Either AsmError Int

etc. It was almost one-shot. About 20 minutes until I was happy. Assembles all the test programs correctly. Code is mediocre in many places. But it would have taken me weeks to implement.

bluegatty 21 minutes ago | |

So where AI has deterministic inputs and outputs it is extremely good to the point I think that there's a theoretical issue around computational there.

Like - it can do the work for us.

It jives with post training and verifiable rewards.

The reason AI doesn't do well at 'architecture' is 1) are are bad at it and have given it a lot of mush and 2) we don't have good abstractions for it.

The result is - you stick to 'very strong conventions' and if you walk of that path you're risking a lot.

Toolchains are very deterministic, the AI can take it apart and re-assemble like Lego - and each level of the space is also deterministic. It's perfect for AI.

mlinhares 36 minutes ago | |

I keep telling people that they have to design and think about it first and then go to the tool, but they keep saying “Claude can plan too” and obviously it produces some shit that requires a lot of changes while when I get it to go I can almost always one shot the stuff I want because I am actually putting in the time to give it a detailed plan of what to do.

Even just saving me the time to deal with CI is worth it.

allthetime 28 minutes ago | | |

Effective planning with LLMs isn’t prompting “design me a system” - it’s asking “how would a system to accomplish x be designed” and then engaging in dialogue and research with the LLM as an assistant and critic - running outputs through other agents for further critique and refinement - asking for justifications of decisions you are not informed enough to evaluate properly yourself. It is entirely possible to develop strong systems outside of your current skill and knowledge with methods like this. When done properly your own knowledge should have grown to meet the product you end up with.

joe_mamba 39 minutes ago | |

>Code is mediocre in many places.

As if code written by devs at major corporations is flawless.

Nokia's Symbian OS took days to build. Days. With a D. Not hours.

One of our devs shipped code to prod with a memory leak thanks to a library that had "do not use this library in production because it causes a memory leak" written everywhere.

So I don't wanna hear about how poor AI code is when human code is dogshit too. Human laziness and stupidity can beat AI hallucinations.

bad_username 33 minutes ago |

I think the article has the correct message, but I disagree with this:

> It’s just incapable of the thing that makes a real architect valuable: saying “no.”

From my experience Claude is excellent at saying "no". It won't say "no" if the prompt doesn't call for it (it won't say "no" to your direct request to do something, usually). But it offers good critique and happily pushes back if you make it clear that that's a first class option.

brookst 23 minutes ago | |

Same here. And I find that inviting research and dissent makes it even stronger. “I’m thinking we need to model prompt assembly as a graph, with versioning for graph configs. Please do some research on best practices in this area and see if you think it makes sense for this app.”

laszlojamf 36 minutes ago |

I keep hearing that claude is supposedly so agreeable. This doesn't agree with my experience. Claude will often tell me that I'm wrong, and insist on its own solution being right even when I tell it it's wrong.

Waterluvian 34 minutes ago | |

I’ve been doing amateur game dev as a way to explore Claude and I’ve found it to be quite reasonable about when it agrees and disagrees.

It will tell me a suggested abstraction is probably overkill and just to make a component own the new thing I’m discussing.

What I’m missing from the loop is it later saying without directly prompting, “hey it’s time to revisit that abstraction idea.”

skybrian 35 minutes ago |

Sometimes it will make a mess, but a coding agent is also very useful during the cleanup phase.

Yes, that's assuming you take time to clean up now and then. If you don't, that's on you.

CPLX 29 minutes ago |

I agree with the article, but I feel like this is something that anyone who uses AI aggressively for a while picks up on pretty quickly.

The thing that I find Claude incredibly good at when I'm designing architecture is working more like a research assistant on briefing decisions. It has the ability to read the entire code base and draw some conclusions. It can pull from lots of best practices and the millions of blog posts about this or that pretty effortlessly, which would take me a lot more time.

And then if asked, it can do a really good job of laying out the landscape around decisions and walking through the trade-offs. Like the author of this post, I found that if you let it, it will certainly be happy to just come up with some architecture and run with it, often in ways that will paint you quite rapidly into a corner.

But if you ask it to present you with all the trade-offs and let you make the judgment calls, it's great for that too.

That's certainly how I use it. And I think, just like anything else, working with AI is a skill, and similar to working with libraries, SaaS providers, service providers, frameworks, or anything else that's a "helper." You learn how something that could work but will fail silently is a problem, or you learn how depending on a fly-by-night SaaS company for a key framework is different than depending on a well-populated open source project, etc.

In the same way, you learn that relying on Claude's judgment is a bad idea, while relying on Claude's ability to summarize, brief, and research can be incredibly efficient.

collectDefines :: [SourceLine] -> Either AsmError ([SourceLine], Map Text Text) runLitPool :: [SourceLine] -> Either AsmError ([SourceLine], [(Text, LitKey)]) evalExpr :: Text -> Map Text Text -> Either AsmError Int