Home/Blog/AI-Native Development Platforms
Platform Engineering

AI-Native Development Platforms: The CI/CD Pipeline After Coding Agents

Coding agents make producing a plausible change cheap. The platform's new job is to make evidence mandatory: which specification authorized the work, which checks proved it, which identity created it, and which controlled path delivered it.

Generated code is a proposal, not a release

GitHub's coding-agent workflow already reflects the new shape of development: an agent works in its own environment, creates a pull request, and requests review. GitHub also documents that workflows from agent-authored pull requests require approval before they run. That boundary matters. The agent can produce change quickly; trusted automation and accountable reviewers decide whether the change deserves execution and delivery privileges.

An AI-native platform is therefore not an IDE with a chatbot. It is a software delivery control plane that assumes some authors are autonomous, fast, non-human, and occasionally wrong in convincing ways. It turns intent and evidence into machine-enforced promotion rules.

AI-assisted SDLC pipeline
01 IntentExecutable specAcceptance criteria, constraints, risk class, affected systems, and required evidence.
02 PlanScoped agent taskRepository, branch, tools, permissions, budget, and completion contract.
03 ChangeIsolated workspaceAgent identity, signed commits, dependency diff, and generated artifact inventory.
04 VerifyDeterministic checksBuild, lint, types, unit, integration, contract, and regression tests.
05 SecureRisk gatesCode scanning, dependency review, secrets, policy, and threat-model delta.
06 ReviewEvidence-aware PRHuman reviews intent, design, exceptions, failed attempts, and proof.
07 AttestProvenanceTrusted build signs artifact, materials, workflow identity, and verification results.
08 DeliverControlled promotionCanary, observability, rollback, approval, and post-deploy verification.

The specification becomes the first pipeline artifact

Human-written tickets often leave assumptions in the author's head. Coding agents need a more executable contract: expected behavior, forbidden behavior, interfaces, examples, non-functional constraints, migration requirements, and the tests that must exist. The pipeline should link every generated diff to that spec and reject changes that cannot demonstrate coverage of its acceptance criteria.

NIST's Secure Software Development Framework emphasizes preparing the organization, protecting software, producing well-secured software, and responding to vulnerabilities. Coding agents do not replace those practices; they increase the volume at which the platform must enforce them.

Intent evidenceSpec, task owner, risk class, permitted scope, and acceptance criteria.
Verification evidenceTests, scans, policy decisions, coverage, performance, and compatibility.
Origin evidenceAgent identity, model/tool versions, commits, dependencies, build, and attestation.
Runtime evidenceDeployment approval, canary metrics, incidents, rollback, and production outcome.

CI becomes an evidence graph

A green check is too coarse when changes arrive faster than humans can read them. The platform should preserve why each check exists, which requirement it covers, which artifact it evaluated, and whether the environment was trusted. Protected branches and rulesets can require reviews and status checks. Dependency review exposes risky package changes. Code scanning finds vulnerabilities. Artifact attestations establish build provenance. SLSA defines progressively stronger supply-chain guarantees.

The useful output is not “pipeline passed.” It is an evidence graph joining task, identity, commits, dependencies, tests, findings, exceptions, reviews, artifacts, attestations, deployment, and runtime signals.

StageCoding agent responsibilityPlatform gateHuman responsibility
IntentRestate requirements and identify ambiguity.Require spec, owner, scope, risk class, and evidence contract.Resolve tradeoffs and approve business intent.
ImplementationChange code, tests, docs, migrations, and dependency manifests.Isolated environment, least privilege, identity, budget, and full diff capture.Review architecture and consequential decisions.
VerificationRun checks, diagnose failures, and propose fixes.Independent trusted runners execute mandatory checks.Approve exceptions; never accept agent self-attestation alone.
SecurityExplain findings and remediate within scope.Code scanning, dependency review, secret scanning, policy, and risk scoring.Accept or reject residual risk.
Supply chainDeclare generated files, tools, and materials used.Trusted build produces signed artifact and provenance attestation.Set required provenance level and exception policy.
DeliveryPropose rollout and rollback plan.Environment protections, approvals, canary checks, and automated rollback.Authorize high-impact promotion and own outcome.

Agents should improve the pipeline, not bypass it

Agents are excellent at generating missing tests, explaining failures, proposing smaller diffs, updating documentation, and repairing policy violations. They should receive structured feedback from the pipeline and iterate inside the same constrained task. They should not be able to weaken the required checks, approve their own workflow run, dismiss findings, mint production credentials, or merge around protections.

Use short-lived OIDC credentials for trusted workflows instead of long-lived cloud secrets. Keep agent workspaces separate from deployment environments. Require trusted builders to create attestations. Treat any change to workflow definitions, permissions, rulesets, or security configuration as a higher-risk class.

What I would build

I would build a platform portal where every coding-agent task starts from a versioned spec and produces a delivery dossier. The dossier would contain the agent identity, task permissions, plan, diff, generated-file manifest, test mapping, scans, exceptions, reviews, provenance, rollout, and runtime result.

The central visual would be an evidence timeline for each change. A reviewer could inspect a failed test, trace it to the requirement it protects, see how the agent responded, and verify that the final artifact came from the approved commit and trusted workflow.

The design principle

When code generation becomes abundant, trust must come from constraints and evidence rather than typing effort. The AI-native platform does not ask whether a human or agent wrote the code. It asks whether the change satisfied the same explicit contract, passed independent verification, preserved provenance, and entered production through a controlled path.