CTO Guide: AI Platform Engineering with Keeptrusts
Platform engineering is the discipline of building golden paths — paved roads that let product teams move fast while the platform team maintains control over security, cost, and compliance. Keeptrusts is the governance layer of your AI platform.
Use this page when
- You are designing a platform team topology with gateway operations, policy management, and developer enablement
- You need to implement self-service gateway key provisioning for product teams
- You want API-first governance automation (CI/CD provisioning, GitOps deployment, budget alerting)
- You are building a template library for standardized AI access patterns
This guide covers team topology, self-service patterns, the build-vs-buy decision, and how to standardize AI usage through templates.
Primary audience
- Primary: Technical Leaders
- Secondary: Technical Engineers, AI Agents
Platform Team Topology
A well-structured AI platform team has three responsibilities:
- Gateway operations — Deploy, scale, and monitor the Keeptrusts gateway fleet
- Policy management — Author and maintain policy configurations in version control
- Developer enablement — Provide self-service tooling so product teams never need to open a ticket for AI access
┌─────────────────────────────────────────────┐
│ Platform Team │
│ ┌──────────┐ ┌───────────┐ ┌────────────┐ │
│ │ Gateway │ │ Policy │ │ Developer │ │
│ │ Ops │ │ Authoring │ │ Enablement │ │
│ └──────────┘ └───────────┘ └────────────┘ │
└──────────────────┬──────────────────────────┘
│
┌──────────────┼──────────────┐
▼ ▼ ▼
┌────────┐ ┌──────────┐ ┌───────────┐
│ Team A │ │ Team B │ │ Team C │
│ (GK-A) │ │ (GK-B) │ │ (GK-C) │
└────────┘ └──────────┘ └───────────┘
Each product team receives a gateway key (kt_gk_...) scoped to their allowed models, providers, and budget. They never touch provider API keys directly.
Self-Service Developer Portal via Gateway Keys
Gateway keys are the primitive that makes self-service possible. Each key encodes:
- Provider access — Which providers and models the key can reach
- Budget scope — Which wallet or cost center charges accrue to
- Policy binding — Which policy configuration governs traffic
- Expiry — Automatic rotation without manual intervention
# Platform team provisions a gateway key for the search team
kt tokens create \
--type gateway \
--name "search-team-prod" \
--team-id search \
--config search-policy.yaml \
--expires-in 90d
Product teams use the key as a drop-in replacement for any OpenAI-compatible SDK:
import openai
client = openai.OpenAI(
api_key="kt_gk_...", # Gateway key, not a provider key
base_url="https://gateway.internal.company.com/v1"
)
response = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Summarize this document"}]
)
Console checkpoint: The Settings → Access Keys page shows all active gateway keys with last-used timestamps, associated teams, and remaining budget.
API-First Governance
Every governance action in Keeptrusts is available through the control-plane API. This means your platform team can automate everything:
| Action | API Endpoint | Automation Use Case |
|---|---|---|
| Create gateway key | POST /v1/tokens | CI/CD pipeline provisioning |
| Update policy config | PUT /v1/configurations/{id} | GitOps policy deployment |
| Query spend | GET /v1/wallets/balance | Budget alerting |
| Export audit logs | POST /v1/exports | Scheduled compliance reports |
| List escalations | GET /v1/escalations | PagerDuty integration |
# GitOps: push policy changes from CI
kt config push --file policy-config.yaml --gateway prod-gw-01
kt policy lint --file policy-config.yaml
Build vs Buy Analysis
| Capability | Build In-House | Keeptrusts |
|---|---|---|
| Multi-provider routing | 3–6 months eng time | Configuration |
| Policy enforcement | Ongoing maintenance | Declarative YAML |
| Cost tracking per team | Custom billing integration | Built-in wallets |
| Audit trail | Custom logging pipeline | Every event, out of the box |
| Compliance exports | Manual quarterly effort | Automated, on-demand |
| Provider failover | Custom health checks | Configuration + monitoring |
| Total Year-1 Cost | $400K–$800K (2–4 engineers) | Subscription + 1 platform engineer |
The build option compounds: every new provider, regulation, or team adds incremental maintenance. Keeptrusts amortizes that across the platform.
Technology Radar for AI Providers
Use the gateway's multi-provider routing to run controlled experiments with new providers without changing application code.
# A/B test: route 10% of traffic to a new provider
model_groups:
- name: summarization
models:
- provider: openai
model: gpt-4o
weight: 90
- provider: anthropic
model: claude-sonnet-4-20250514
weight: 10
Monitor results in the console Events page, filtering by provider. Compare latency, cost per token, and policy hit rates to make data-driven provider decisions.
Console checkpoint: The Overview dashboard's provider distribution panel shows traffic split and cost comparison across providers in real time.
Console Template Library for Standardization
Templates are pre-configured policy bundles that the platform team publishes for product teams to adopt. They encode organizational standards into reusable packages.
| Template | Policies Included | Target Teams |
|---|---|---|
standard-saas | PII filter, cost cap, logging | All product teams |
healthcare-compliant | HIPAA PII, audit trail, data residency | Health vertical |
internal-tools | Relaxed filters, cost cap, logging | Platform/DevOps |
customer-facing | Strict content filter, disclaimer, escalation | Customer support |
# Apply a template to a new team's gateway
kt config apply --template standard-saas --gateway team-alpha-gw
Console checkpoint: The Templates page in the console lets platform admins create, version, and assign templates. Product teams see their assigned template and can request changes through the escalation workflow.
Measuring Platform Success
Track these metrics monthly to demonstrate platform value to the executive team:
| Metric | Source | Target |
|---|---|---|
| Time to AI access for new team | Token creation → first request | < 1 hour |
| % of AI traffic through gateway | Gateway events / total provider spend | > 95% |
| Policy violations caught | Escalation + block events | Trending down |
| Provider cost per 1M tokens | Wallet analytics | Trending down |
| Developer satisfaction (survey) | Quarterly NPS | > 40 |
ROI Summary
| Investment | Return | Timeline |
|---|---|---|
| Platform team (1 engineer) | Self-service for 10+ product teams | Month 1 |
| Template library | 80% reduction in onboarding time | Month 2 |
| API-first automation | Zero-touch provisioning in CI/CD | Month 3 |
| Multi-provider routing | 15–30% cost reduction through competition | Ongoing |
Next steps
- Designate a platform team owner for AI governance
- Deploy a single gateway with the
standard-saastemplate - Provision gateway keys for your two highest-volume teams
- Set up GitOps policy deployment from your CI/CD pipeline
- Review the console Overview dashboard after 14 days
See also: CTO Guide: Accelerating Developer Velocity · CTO Guide: Multi-Provider AI Strategy
For AI systems
- Canonical terms: platform team topology, gateway operations, policy authoring, developer enablement, gateway keys (
kt_gk_...),kt tokens create,POST /v1/tokens,PUT /v1/configurations/{id},GET /v1/wallets/balance,POST /v1/exports,GET /v1/escalations, template library, self-service portal - Key API endpoints:
POST /v1/tokens(CI provisioning),PUT /v1/configurations/{id}(GitOps deploy),GET /v1/wallets/balance(budget alerting) - Best next pages: CTO: Developer Velocity, CTO: Multi-Provider Strategy, CTO: API Economy
For engineers
- Provision gateway keys:
kt tokens create --type gateway --name "search-team-prod" --team-id search --config search-policy.yaml --expires-in 90d - API-first automation: every governance action is available via REST API — integrate key provisioning, policy deployment, and budget alerting into CI/CD
- Template library: define policy templates (e.g.,
standard-saas,hipaa-compliant,dev-sandbox) that teams self-select during key provisioning - GitOps deployment:
PUT /v1/configurations/{id}pushes policy updates from merged PRs to running gateways - Console checkpoint: Settings → Access Keys shows all active keys with last-used, team, and remaining budget
For leaders
- Platform engineering model: 1 engineer operating the gateway + templates can serve 10+ product teams — linear team growth, sub-linear platform team growth
- Template library reduces AI onboarding time by 80% — teams pick a template, get a key, start building in minutes
- API-first governance means everything is automatable — no manual intervention for standard operations
- Build-vs-buy: Keeptrusts provides the governance layer; the platform team provides the organizational context (templates, budgets, policies)