Does coding with LLMs mean more microservices?

ben.page

42 points by jer0me 10 hours ago


iainmerrick - an hour ago

Like almost all of these articles, there's really nothing AI- or LLM-specific here at all. Modularization, microservices, monorepos etc have all been used in the past to help scale up software development for huge teams and complex systems.

The only new thing is that small teams using these new tools will run into problems that previously only affected much larger teams. The cadence is faster, sometimes a lot faster, but the architectural problems and solutions are the same.

It seems to me that existing good practices continue to work well. I haven't seen any radically new approaches to software design and development that only work with LLMs and wouldn't work without them. Are there any?

I've seen a few suggestions of using LLMs directly as the app logic, rather than using LLMs to write the code, but that doesn't seem scalable, at least not at current LLM prices, so I'd say it's unproven at best. And it's not really a new idea either; it's always been a classic startup trick to do some stuff manually until you have both the time and the necessity to automate it.

areys - 14 minutes ago

The context window framing is compelling but I think the more durable pressure is the "blast radius" problem — not whether the model can hold your whole codebase in working memory, but whether a confident-but-wrong agent can cause cascading failures across the whole system.

Even as context windows hit 1M+ tokens, you'd probably still want bounded rollback units. Clean module interfaces help less with the comprehension problem and more with the containment one. Whether those modules get a network boundary is mostly an ops tradeoff — the architectural principle is the same either way.

jensneuse - 39 minutes ago

One thing I find interesting is how GraphQL has evolved from an API technology for API consumers with "different needs" to an API technology for agents. What helped organizations scale GraphQL across multiple teams is Federation, a way to split one supergraph into multiple subgraphs. So, what works well to scale teams actually works equally well for agents. The core value you can get from Federation is a "coordination" layer that is deterministic. Now, what's interesting is that you can scale agentic software development pretty well when you have a deterministic layer where everyone involved can agree. I wrote more about this on our blog if anyone is interested: https://wundergraph.com/blog/graphql-api-layer-for-ai-agents

nikeee - 3 hours ago

What matters for LLMs is what matters for humans, which usually means DX. Most Microservice setups are extremely hard to debug across service boundaries, so I think in the future, we'll see more architectural decisions that make sense for LLMs to work with. Which will probably mean modular monoliths or something like that.

int_19h - 3 hours ago

That's an argument for components with well-defined contracts on their interfaces, but making them microservices just complicates debugging for the model.

It's also unclear whether tight coupling is actually a problem when you can refactor this fast.

tatrions - 4 hours ago

The bounded surface area insight is right, but the actual forcing function is context window size. Small codebase fits in context, LLM can reason end-to-end. You get the same containment with well-defined modules in a monolith if your tooling picks the right files to feed into the prompt.

Interesting corollary: as context windows keep growing (8k to 1M+ in two years), this architectural pressure should actually reverse. When a model can hold your whole monolith in working memory, you get all the blast radius containment without the operational overhead of separate services, billing accounts, and deployment pipelines.

veselin - 2 hours ago

I think this is a promise, probably also for spec driven development. You write the spec, the whole thing can be reimplemented in rust tomorrow. Make small modules or libraries.

One colleague describes monolith vs microservices as "the grass is greener of the other side".

In the end, having microservices is that that the release process becomes much harder. Every feature spans 3 services at least, with possible incompatibility between some of their versions. Precisely the work you cannot easily automate with LLMs.

Theaetetus - 2 hours ago

I don't think LLMs push us to use microservices as much as Borgers says they do. They don't avoid the problems microservices have always faced, and encapsulation is mostly independent from whether a boundary is a service-to-service boundary:

https://www.natemeyvis.com/agentic-coding-and-microservices/

siruwastaken - 4 hours ago

This seems like the idea of modularizing code, and using specific function sighatures for data exchange as an API is being re-invented by people using AI. Aren't we already mostly doing things this way, albeit via submodules in a monolith, due to the cognitive ctrain it puts on humans to understand the whole thing at any given time?

victorbjorklund - 2 hours ago

I think no. But I think it makes sense to break down your app into libraries etc

_pdp_ - 4 hours ago

This makes no sense. You can easily make a monolith and build all parts of it in isolation - i.e. modules, plugins, packages.

In fact, my argument is that there will be more monolith applications due to AI coding assistants, not less.

c1sc0 - 4 hours ago

Why microservices when small composable CLI tools seem a better fit for LLMs?

Kim_Bruning - 3 hours ago

A typical rant (composed from memory) goes something like this:

> "These AI types are all delusional. My job is secure. Sure your model can one-shot a small program in green field in 5 minutes with zero debugging. But make it a little larger and it starts to forget features, introduces more bugs than you can fix, and forget letting it loose on large legacy codebases"

What if that's not a diagnosis? What if we see that as an opportunity? O:-)

I'm not saying it needs to be microservices, but say you can constrain the blast radius of an AI going oops (compaction is a famous oops-surface, for instance); and say you can split the work up into self-contained blocks where you can test your i/o and side effects thoroughly...

... well, that's going to be interesting, isn't it?

Programming has always supposed to be about that: Structured programming, functions (preferably side-effect-less for this argument), classes&objects, other forms of modularization including -ok sure- microservices. I'm not sold on exactly the latter because it feels a bit too heavy for me. But ... something like?

novachen - 26 minutes ago

[dead]

claud_ia - 3 hours ago

[dead]

jeremie_strand - 10 hours ago

[dead]

benh2477 - 10 hours ago

[dead]