Construction AI Brief
After I/O shipped the capability, the last ten days have been about control, cost and consequence: enterprise agent containment, near-frontier coding at a tenth of the price, and a sober RTPI warning that automation is outrunning the planning system.
PlanOps automates the planning tasks you’re reading about.
Start free
Today’s context: This brief covers the latest movements in AI tooling, adoption, and signals for construction teams. Read on for what matters and what to focus on.
The Royal Town Planning Institute has warned that the UK planning system is not equipped to handle the way AI, automation and advanced manufacturing are reshaping land use and employment. The core problem is predictive: while AI and advanced manufacturing raise the economic value of an area, automation is simultaneously cutting the number of conventional jobs - so local planning authorities can no longer reliably forecast future employment levels or the associated economic value they are meant to plan for. The guidance underpinning these judgements, the Economic Needs Assessment (ENA) that authorities use for land allocation, was last updated in 2019 and predates the current wave entirely. The RTPI is calling for better integration of plans and strategies, a government-led National Spatial Framework, and regional collaboration to support emerging industrial clusters such as AI, advanced manufacturing and data centres. Alongside the warning, the institute has published best-practice advice for planners working with AI.
For anyone in development, infrastructure or data-centre work, this is the policy story that actually touches the pipeline. Employment forecasts feed land allocation; land allocation shapes what gets consented and where; and data-centre demand - the physical backbone of the AI boom - is colliding with a framework that can't model it. It is also a useful counterweight to the vendor optimism that dominates most AI coverage: the constraint on AI's impact in the built environment increasingly isn't the technology, it's whether the planning and skills systems around it can keep pace.
Why it matters
If your work depends on local plans, employment-land designations or data-centre consents, expect more friction and less certainty until the guidance is refreshed. Engage early with planning authorities, evidence your own employment and economic-value assumptions rather than relying on theirs, and treat the RTPI's AI guidance as a signal of where scrutiny is heading.
Automate your programme admin. Get your evenings back.
This is the direct sequel to last week's point about MCP being asked to carry a governance load it wasn't built for. Anthropic has extended its Claude Managed Agents platform with two enterprise controls. Self-hosted sandboxes (in public beta) let the actual execution of an agent's tools run on infrastructure you control - your own environment or a managed provider such as Cloudflare, Daytona, Modal or Vercel - while Anthropic still handles orchestration, context and recovery. That gives you control over network policy, audit logging, runtime configuration and, crucially, data residency. MCP tunnels (in research preview) let agents reach private MCP servers - internal databases, APIs, ticketing systems, knowledge bases - without exposing them to the public internet: instead of opening inbound firewall rules, you run a lightweight gateway that makes an outbound encrypted connection.
The plain-English version: these are the features that let an agent run inside your security perimeter rather than behind a sandbox your security team takes six weeks to clear. For construction, where project data is commercially sensitive and increasingly subject to Building Safety Act record-keeping and golden-thread obligations, "where does the execution actually happen and who can see the data" is the question that decides whether an AI pilot ever reaches a live project. This is the infrastructure that makes a defensible answer possible.
Why it matters
If data residency or security review has been blocking an AI pilot, the containment options just improved materially. Make "execution location, audit logging and data residency" explicit requirements in any agent procurement - and note these are still beta/preview, so treat them as evaluation-stage, not production-hardened.
Computer-using agents in Microsoft Copilot Studio reached general availability (announced 13 May, rolling across all commercial Power Platform geographies including Europe, with government clouds following in the second half of 2026). The capability is the one most relevant to construction's back office: agents that operate websites and desktop applications through the user interface itself - using vision and reasoning to navigate live screens and adapt when layouts or fields shift - to automate processes that previously relied on brittle scripts or manual workarounds because the underlying systems had no API. The GA build ships with OpenAI's computer-use model and Claude Sonnet 4.5, Azure Key Vault for credential storage, Microsoft Purview audit logging, and configurable human-in-the-loop review.
Construction runs on exactly the kind of legacy, API-less software this targets - older accounting and cost systems, supplier portals, certification and compliance websites, plant-hire and procurement tools. The honest caveat is that UI-driven automation is inherently more fragile than an API integration and needs monitoring, but the credential-vaulting, audit-logging and human-in-the-loop features are precisely the governance scaffolding that makes it viable for regulated, auditable work. Note the recurring pattern across this week's releases: the headline isn't the model, it's the controls shipped around it.
Why it matters
Map the recurring admin tasks your team does inside systems that have no API - re-keying data between portals, downloading and filing certificates, chasing statuses. Those are now automatable with an audit trail. Start with one low-risk, well-defined task and keep a human approval gate.
Step back and the week tells one story. The capability question - can AI agents do useful, multi-step work? - was largely answered at I/O. The releases since have all been about the harder, more boring questions: where does the agent run, who can see the data, what gets logged, and who signs off? Anthropic's sandboxes and tunnels, Copilot Studio's credential vaulting and audit logging, even the human-in-the-loop defaults - these are governance features dressed as product launches. The RTPI's warning is the same lesson from the policy side: the technology has outrun the framework around it, and the gap is now the constraint.
For construction specifically, that is good news, because the sector's blockers were never really about capability - they were about assurance, auditability and data sensitivity on regulated projects. The tooling is finally being built to answer those questions. The teams that move fastest over the next quarter will be the ones who treat "show me the audit trail and the data-residency story" as the first question, not an afterthought - and who keep a human reviewing anything an agent produces before it touches a live project, a price or a programme.
Why it matters
Adoption is now gated by governance readiness, not model quality. Get your data-classification, approval-gate and audit-logging house in order, and you can move quickly when a pilot proves out - without a six-week security scramble each time.
50 free Intelligence Units. Set up your first project in under 20 minutes. No credit card needed.
Get 50 free Intelligence UnitsDaily practical AI insight for construction teams. What changed, why it matters, and what to ignore.
50 free Intelligence Units — automate your programme admin
We help construction teams turn AI into useful work, not noise. Understanding what’s changing in AI is the first step. Making it work on-site is the real difference.
A big month for UK construction AI starts this week. Digital Construction Week opens on Wednesday, Anthropic shipped a flagship with native multi-agent workflows on Friday, and the company is now valued at $965bn. A practical Monday-morning take on what's worth your time.
Found this useful? Share it.
Cursor's Composer 2.5, an agentic coding model built for long, tool-heavy IDE sessions (reading files, running terminal commands, editing across multiple files, executing tests and iterating), moved from launch promotion to standard pricing on 26 May - $0.50 input / $2.50 output per 1M tokens, up from the introductory $0.25 / $1.25. Even at the standard rate, it posts near-frontier coding benchmark scores at roughly a tenth of the API cost of Claude Opus 4.7 or GPT-5.5. It is built on the same open-source checkpoint as Composer 2 - Moonshot's Kimi K2.5 - post-trained on 25× more synthetic coding tasks, which is a neat illustration of how much capability you can now buy on top of an open base model.
The relevance for construction software teams is the same theme we keep returning to, now with sharper economics. Capable agentic coding is no longer a premium-priced bottleneck. For the kind of small, internal tools that pay off in this sector - a script to reconcile cost codes, an integration to pull data off your CMS or BIM stack, a one-off compliance-record extractor - a model at this price changes the build-vs-buy calculation. The frontier models still lead on the hardest, multi-step engineering, but most internal-tooling work doesn't need the frontier.
Why it matters
Re-run the build-vs-buy maths on the small automations you've been putting off because "we'd need a developer." At these prices, a thin internal tool built with an agentic coder is often cheaper than another SaaS subscription - provided you keep a human reviewing what it ships.
Digital Construction Week is next week, professional indemnity insurers are starting to write AI out of their policies, and LinkedIn has begun throttling the reach of AI-cadence posts. A practical, slightly less polished brief — by design.
Claude landed inside Bluebeam this week. Anthropic and Microsoft shipped the controls that let agents run inside your perimeter. The RTPI warned the planning system can't keep up, and some PI insurers started writing AI out of cover. Digital Construction Week is next Wednesday.