Cloud Providers: Offering Governed AI as a Platform Service

Cloud providers that offer hosted AI to customers are not shipping one internal assistant. They are operating a platform surface where every tenant expects different retention, residency, tool, and approval rules. The commercial promise is flexible AI access. The operational challenge is making that flexibility governable without building a custom control plane for every customer or market segment.

Keeptrusts helps because it turns governance into part of the platform service contract. Tool Validation constrains what tools a route can expose, Agent Firewall governs tool actions and kill switches, Data Routing Policy filters model targets using provider metadata, and Regulated Execution gives stricter workloads a dedicated operating lane. Combined with Multi-Gateway and Data Residency, that lets a provider sell governed AI as an explicit service feature instead of a soft promise.

Use this page when

You operate a managed AI platform, internal developer platform, or hosted agent service.
You need customer-facing controls for tools, provider routing, residency, and reviewability.
You want a reusable governance model across commercial, enterprise, and regulated service tiers.

Primary audience

Primary: Technical Leaders
Secondary: Platform Engineers, Governance reviewers

The problem

Platform providers sit in a difficult middle layer. Customers want speed and flexible model choice, but they also want guarantees: no training on their traffic, a specific retention window, region-specific processing, and confidence that agents cannot perform unexpected actions. If those guarantees exist only in product marketing or internal runbooks, the provider is effectively asking customers to trust service behavior they cannot independently verify.

Tool-enabled agent routes make the problem harder. A hosted AI platform may offer retrieval, ticket creation, export, or workflow automation as part of the product. Without declared-tool governance, a route can drift over time. A team adds one more tool for convenience, then another team reuses the same route, and before long the service is exposing capabilities the original customer approval never covered.

Regional and sector differences also matter. A public-sector tenant may need local-only processing and zero retention. A commercial tenant may accept a broader provider pool. If the platform handles those cases with undocumented exceptions instead of a formal control surface, operations and sales eventually collide: the product cannot prove it is honoring the commitment it sold.

The solution

The right model is to make governance a first-class service primitive. Start with tool declarations. Tool Validation keeps the service honest about what a route can request. It does not validate every tool argument itself, but it does stop undeclared tools and ensures schemas compile, which is the right baseline for a multi-tenant platform.

Then govern the actions. Agent Firewall gives you exact-match allow and deny lists, per-request action caps, simple per-action rate limits, and kill-switch behavior when suspicious patterns or PII appear in tool arguments. That is especially useful for hosted agent platforms where customers expect guardrails around automation, not just around model text.

Provider choice should be explicit as well. Data Routing Policy filters targets according to declared data_policy metadata such as zero retention, no training, retention windows, local-only processing, and tokenized-input support. That creates a real boundary between service tiers instead of a loose internal convention.

Finally, split the runtime by tier when needed. Govern AI Agents and Regulated Execution are useful because they encourage a platform-service design where sensitive workloads are not merely “flagged.” They run through a stricter lane with its own routing and evidence model.

Implementation

This example shows a governed agent route for a platform-service tier that requires zero retention, declared tools, and strict action caps.

pack:
  name: governed-platform-service
  version: 1.0.0
  enabled: true

providers:
  targets:
    - id: regional-zdr
      provider: openai
      model: gpt-5.4-mini-mini
      secret_key_ref:
        env: OPENAI_API_KEY
      data_policy:
        zero_data_retention: true
        training_opt_out: true
        retention_days: 0
        in_memory_only: true
        sanitized: true
        accepts_tokenized_input: true
        allow_internet_egress: false
        local_only_processing: true

policies:
  chain:
    - tool-validation
    - agent-firewall
    - data-routing-policy
    - audit-logger

policy:
  tool-validation:
    declared_tools:
      - knowledge_search
      - case_create
      - export_csv
    schemas:
      knowledge_search:
        type: object
        properties:
          query:
            type: string
        required:
          - query
      case_create:
        type: object
        properties:
          title:
            type: string
        required:
          - title
    allow_undeclared: false

  agent-firewall:
    allowed_tools:
      - knowledge_search
      - case_create
      - export_csv
    blocked_tools:
      - delete_tenant
    max_actions_per_window: 3
    max_actions_per_session: 10
    rate_limits:
      export_csv: 1
    kill_switches:
      halt_on_suspicious_pattern: true
      halt_on_pii_in_action: true

  data-routing-policy:
    require_zero_data_retention: true
    require_no_training: true
    max_retention_days: 0
    require_in_memory_only: true
    sanitize_before_provider: true
    tokenize_sensitive_fields: true
    allow_internet_egress: false
    local_only_processing: true
    on_no_compliant_provider: block
    log_provider_selection: true

  audit-logger: {}

This kind of pack works well as a premium or regulated service lane. The important point is that the service tier is defined by enforceable runtime behavior, not by a policy PDF. If you need a looser commercial route, create a separate pack with a deliberate change in routing requirements. Do not ask the platform runtime to make those differences implicitly.

Operationally, pair this with regional deployment choices. Multi-Gateway lets teams keep service topology aligned with geography, and Data Residency helps frame how routing commitments should map to actual infrastructure and provider metadata.

Results and impact

The biggest result is credibility. Sales and solution teams can describe AI governance in service-level terms because the platform has a concrete runtime model behind the claim. Engineering benefits too: new managed routes inherit a known boundary instead of starting from an empty policy file.

This also improves customer onboarding. Rather than debating every tool and provider option from scratch, the provider can present a small number of supported service profiles with documented controls. That reduces exception handling and makes compliance conversations much more concrete.

Finally, the evidence model improves. When customers ask why a request failed or which provider was eligible, the platform can answer from decision events and routing logs instead of reverse-engineering behavior from application traces.

Key takeaways

A governed AI platform service needs enforceable runtime controls, not only contractual language.
Tool Validation and Agent Firewall are strong complements for hosted agent routes.
Data Routing Policy is what turns provider promises into an actual filter.
Use separate packs and lanes for regulated or premium service tiers instead of mixing them with default routes.
Regional topology and evidence collection are part of the product, not only of operations.

Cloud Providers: Offering Governed AI as a Platform Service

Use this page when​

Primary audience​

The problem​

The solution​

Implementation​

Results and impact​

Key takeaways​

Next steps​