Tools 3 MIN READ

Curating Your Technical Stack for 2026

The age of bloated AI stacks is over. A pragmatic guide to selecting the small set of tools that will actually compound.

Gaurav Goel
Minimalist still-life arrangement of curated objects representing a refined toolset.

Walk into the engineering bay of any company that adopted AI early and you will see the same thing: a stack that has grown too fast. Three vector stores. Two orchestration frameworks. A handful of agent libraries that solved the same problem on different Tuesdays. The team is competent. The system is incoherent. This is not a tooling problem. It is a curation problem.

The first wave of generative AI rewarded breadth. Teams that tried everything found something that worked. The second wave punishes breadth. Maintenance cost compounds. Onboarding gets harder. New work slows down. The most productive engineering teams I have worked with in the past year share a single, unfashionable trait: their stack is smaller than their competitors’ and changes less often.

A stack worth keeping

A useful technical stack in 2026 has fewer parts than the ones being marketed to you. In practice, that means choosing one option in each of the following layers and resisting the urge to evaluate a second one until the first has demonstrably failed.

  • A model provider. One primary, one fallback. Stop ablating.
  • An orchestration layer. Whatever your senior engineer can debug at 2am.
  • A retrieval system. Pick the boring one with the longest production track record.
  • An evaluation harness. Not optional. The teams that ship reliably have one; the teams that do not, do not.
  • A deployment target. The same one you already use for the rest of your software.

That is the stack. Five layers, one choice per layer, swapped only with cause. The teams that operate this way move faster than the teams running comparative benchmarks every six weeks. Speed compounds.

We deleted four libraries last quarter. Velocity went up. Nobody noticed they were gone.

Engineering Lead, infrastructure team

The cost of optionality

The argument for a broader stack is usually framed as flexibility. We need to be ready, the reasoning goes, for the next breakthrough. We need to keep our options open. This sounds like wisdom. In practice it is a way of paying for capability you do not use.

Optionality is not free. Every additional dependency requires monitoring, patching, security review, onboarding documentation, and a person who knows how it behaves under load. A stack with eight competing tools is not eight times as flexible. It is eight times as expensive to operate, and the team’s collective attention is split across surfaces that, mostly, no one is touching.

The cure is not minimalism for its own sake. The cure is opinionation. Have an opinion about what you build with. Defend it. Revisit it on a schedule rather than every time a new framework launches. The teams that compound their advantage do so by making fewer, longer-lived choices — and then by doing more interesting work on top of them.

What to buy, what to build

The last principle is the oldest one. Buy the commodity layers. Build the layers that are the product. The stack is not where the differentiation lives. The differentiation lives in the workflow your team has designed on top of the stack — the prompts, the evaluation rubrics, the escalation policies, the data your customers cannot get elsewhere.

A curated stack does not slow you down. It is the only thing that lets you go fast for more than a quarter.


tooling stack pragmatism
Share

More from the Brief

All essays