Provider-Portable AI Architecture

Your path to AI-powered compliance without vendor lock-in. Use Amazon Bedrock today, Google Vertex AI tomorrow, Azure AI Foundry next week, or all at once.

5 Provider Adapters

1 Unified API

3 Routing Strategies

Zero Vendor Lock-in

Without Gateway	With Gateway
Provider-specific code in apps	One API, any provider
Scattered governance	Centralized controls
Expensive provider switching	Configuration-based routing
Manual cost optimization	Intelligent auto-routing
Fragmented observability	Unified tracing and logs

Without Gateway

With Gateway

Provider-specific code in apps

One API, any provider

Scattered governance

Centralized controls

Expensive provider switching

Configuration-based routing

Manual cost optimization

Intelligent auto-routing

Fragmented observability

Unified tracing and logs

Strategy	Optimizes For	Use Case
Cost	Minimize spend while meeting quality thresholds	High-volume, non-critical workloads
Performance	Minimize latency for interactive experiences	Real-time compliance Q&A
Quality	Maximize output quality for critical decisions	Regulatory filing review
Hybrid	Balance all factors dynamically	Default for most workloads

Strategy

Optimizes For

Use Case

Cost

Minimize spend while meeting quality thresholds

High-volume, non-critical workloads

Performance

Minimize latency for interactive experiences

Real-time compliance Q&A

Quality

Maximize output quality for critical decisions

Regulatory filing review

Hybrid

Balance all factors dynamically

Default for most workloads

Enterprise Authority Management

OPA-powered governance that goes far beyond standard access control—9 dimensions of context-aware policy enforcement for regulated financial services.

9-Dimensional ABAC vs Industry Standard

Industry Standard (3D RBAC)

✓ Subject (Who)
✓ Resource (What)
✓ Action (Read/Write/Delete)

Coarse-grained, context-blind

Our Implementation (9D ABAC)

✓ Subject, Resource, Action (base)
✓ Environment (trading floor, VPN, device)
✓ Purpose (operations, audit, reg reporting)
✓ Data Subject (customer tier, consent)
✓ Aggregation (AML thresholds, PII masking)
✓ Cross-LOB (wealth → retail data sharing)
✓ Emergency (reg exam, fraud investigation)

Context-aware, purpose-driven

Real-World Banking Scenarios

📊

Trading Floor Analyst

Scenario: Analyst needs AI-generated market insights during trading hours

9D Decision: Environment=trading_floor + time=market_hours + role=analyst → real-time data, no PII, audit logged

environment.location == "trading_floor" → market_data_only

🔍

AML Compliance Officer

Scenario: AI flags suspicious transaction pattern, needs customer history

9D Decision: Purpose=AML_investigation + SAR_filed → full transaction history, SSN masked unless SAR threshold met

purpose.type == "aml" && sar_threshold_met → full_pii

🏛

Regulatory Examiner Access

Scenario: OCC examiner requests AI model documentation and decisions

9D Decision: Emergency=reg_exam + examiner_credentials_verified → full model access, all decisions, complete audit trail

emergency.type == "regulatory_exam" → full_transparency

💲

Cross-LOB Data Request

Scenario: Wealth management AI needs retail banking transaction data for holistic customer view

9D Decision: Cross_LOB=wealth→retail + approved_use_case + data_sharing_agreement → aggregated view only, no raw transactions

cross_lob.approved && dsa_active → aggregated_view

Policy-as-Code (OPA Rego) - Banking GRC

# AML investigation grants elevated access with audit
allowed_fields[field] if {
    input.purpose.type == "aml_investigation"
    input.user.role == "compliance_officer"
    input.case.sar_filed == true
    field := customer_pii_fields[_]  # Full PII for filed SAR
}

# Trading floor restricts to market data only
allowed_fields[field] if {
    input.environment.location == "trading_floor"
    time_in_market_hours(input.environment.timestamp)
    field in {"ticker", "price", "volume", "sentiment_score"}
    not field in customer_pii_fields
}

# Cross-LOB data sharing requires approved use case
cross_lob_access_allowed if {
    input.cross_lob.source_lob != input.cross_lob.requester_lob
    approved_cross_lob_use_case(input.purpose.type)
    data_sharing_agreement_active(input.cross_lob.source_lob, input.cross_lob.requester_lob)
}

# Regulatory exam break-glass with mandatory audit
allowed_fields[field] if {
    input.emergency.type == "regulatory_exam"
    input.emergency.examiner_credentials_verified == true
    field := all_fields[_]  # Full transparency for regulators
    # Mandatory: log_regulatory_access(input)
}

Every policy decision is auditable, version-controlled, and reviewable through standard PR workflows. Integrates with existing NIST 800-53 AC and AU control families.

AI Control	Description	Maps To	Evidence
GO-1	AI Governance System Implementation	NIST PM-1, ISO 42001 5.1	Charter, RACI, meeting minutes
RS-5	Fairness & Bias Management	NIST AI RMF MEASURE 2.10, EU AI Act Art. 10	Bias testing reports, demographic parity metrics
RS-4	Explainability Requirements	EU AI Act Art. 13, ISO 42001 8.4	SHAP/LIME outputs, decision audit logs
OM-1	Performance Monitoring	NIST CA-7, ISO 42001 9.1	Model accuracy dashboards, drift alerts
TP-2	Third-Party Model Validation	NIST SA-9, EU AI Act Art. 28	Vendor assessments, model cards

AI Control

Description

Maps To

Evidence

GO-1

AI Governance System Implementation

NIST PM-1, ISO 42001 5.1

Charter, RACI, meeting minutes

RS-5

Fairness & Bias Management

NIST AI RMF MEASURE 2.10, EU AI Act Art. 10

Bias testing reports, demographic parity metrics

RS-4

Explainability Requirements

EU AI Act Art. 13, ISO 42001 8.4

SHAP/LIME outputs, decision audit logs

OM-1

Performance Monitoring

NIST CA-7, ISO 42001 9.1

Model accuracy dashboards, drift alerts

TP-2

Third-Party Model Validation

NIST SA-9, EU AI Act Art. 28

Vendor assessments, model cards

# Feedback logged for every AI interaction await feedback_client.log_copilot_event({ "event_type": "insight_feedback", "persona": current_user.role, "user_feedback": "helpful", # or "not_helpful" "provider": selected_provider, "latency_ms": response_time, "governance_control": "OM-1" # Links to AI governance }) # Pattern analysis → threshold tuning patterns = await feedback_client.analyze_alert_patterns() if control_false_positive_rate > 0.3: adjust_alert_threshold(control_id, direction="less_sensitive")

Implementation Roadmap

A structured approach to implementing the Provider-Portable AI Architecture in RegRiskIQ.

Phase 1: Foundation (Weeks 1-3)

Phase 2: Quality Assurance (Weeks 4-6)

Phase 3: Governance (Weeks 7-10)

▶

Provider-Portable AI Architecture Implementation

Enable RegRiskIQ to leverage multiple foundation model providers without vendor lock-in

#9114 Epic

This epic delivers a complete provider-portable architecture enabling RegRiskIQ to use AWS Bedrock, Google Vertex AI, Azure AI Foundry, OpenAI, and Ollama interchangeably. The architecture separates AI consumption from AI provision through a Model Gateway pattern with centralized governance, observability, and quality assurance.

▶

Centralized Prompt Registry

Version-controlled prompt management with provider-specific variants

#9115 Phase 1

Establish a centralized registry for managing AI prompts as versioned, deployable artifacts. This enables A/B testing, rollback capability, and provider-specific optimizations.

▶

Create prompt_registry database schema

#9122

Acceptance Criteria (BDD)

Given a new RegRiskIQ database migration

When the migration is applied

Then a prompt_registry table exists with columns: id, name, version, template, provider_variants, created_at, is_active

And the version column has a unique constraint with name

And provider_variants is a JSONB column supporting arbitrary provider keys

▶

Implement Prompt Registry CRUD API

#9123

Acceptance Criteria (BDD)

Given an authenticated API request

When I POST to /api/prompts with a valid prompt template

Then a new prompt is created with version 1

And the response includes the prompt ID and version

Given an existing prompt "regulatory_analysis"

When I PUT to /api/prompts/regulatory_analysis with updated content

Then a new version is created (immutable versioning)

And the previous version remains accessible

▶

Add provider-specific prompt variants

#9124

Acceptance Criteria (BDD)

Given a prompt with provider_variants for "openai" and "anthropic"

When the ModelManager requests the prompt for provider "anthropic"

Then the anthropic-specific variant is returned

And Claude-specific syntax (Human:/Assistant:) is applied

Given the model_manager.py module

When I import TaskIntent

Then the enum contains: REGULATORY_ANALYSIS, RISK_SCORING, POLICY_SEARCH, COMPLIANCE_CHECK, DOCUMENT_SUMMARIZATION

And each intent has a string value matching its lowercase name

▶

Extend TaskContext with intent fields

#9127

Acceptance Criteria (BDD)

Given a TaskContext dataclass

Create a unified dashboard exposing AI metrics including provider performance, cost tracking, quality scores, and integration with existing Trust Architecture confidence metrics.

▶

Aggregate ModelManager metrics

#9130

Acceptance Criteria (BDD)

Given the AI Models Service is running

When I GET /api/ai/metrics

Then I receive per-provider statistics: request_count, success_rate, avg_latency_ms, total_cost_usd, error_count

And metrics are available for all 6 providers: openai, azure, bedrock, vertex, ollama, anthropic

▶

Integrate Trust Architecture confidence metrics

#9131

Acceptance Criteria (BDD)

Given the Trust Architecture service is processing requests

When I view the AI Observability Dashboard

Then I see TRAQ confidence scores aggregated by provider

And I see rejection rate due to low confidence thresholds

And I see citation accuracy metrics per intent

▶

Create Grafana AI dashboard

#9132

Acceptance Criteria (BDD)

Given Grafana is deployed with the RegRiskIQ observability stack

When I navigate to the "AI Provider Performance" dashboard

Then I see panels for: latency (p50, p95, p99), cost per provider, success rate, requests per minute

And I can filter by time range, provider, and intent

▶

Provider Equivalence Testing

Tenant-Aware Gateway

Multi-tenant support with per-tenant rate limits and preferences

#9120 Phase 3

Enable multi-tenant operation of the Model Gateway with tenant isolation, per-tenant rate limiting, and tenant-specific provider preferences. Conditional on multi-tenant deployment requirements.

▶

Add tenant_id to request context

#9140

Acceptance Criteria (BDD)

Given an authenticated request with a JWT containing tenant_id claim

When the request reaches the ModelManager

Then tenant_id is extracted and added to TaskContext

And all downstream operations are scoped to that tenant

▶

Implement per-tenant rate limiting

#9141

Acceptance Criteria (BDD)

Given tenant "acme" has a rate limit of 100 requests/minute

When tenant "acme" sends their 101st request in a minute

Then a 429 Too Many Requests response is returned

And the response includes Retry-After header

And other tenants are not affected

▶

Add tenant provider preferences

#9142

Acceptance Criteria (BDD)

Given tenant "acme" has preferred_providers: ["bedrock", "azure"]

When the routing engine selects a provider for tenant "acme"

Then only bedrock and azure are considered

And openai and ollama are excluded from routing decisions

▶

Data Residency Routing

Route requests based on data residency and compliance requirements

#9121 Phase 3

Implement simple, configuration-driven routing rules to ensure data residency compliance. Providers in disallowed regions are automatically excluded from routing decisions.

▶

Add allowed_regions to tenant config

#9143

Acceptance Criteria (BDD)

Given a tenant configuration schema

When I configure tenant "eu_bank" with allowed_regions: ["eu-west-1", "eu-central-1"]

Then the configuration is validated and stored

And the routing engine can query allowed regions for any tenant

▶

Filter providers by region

#9144

Acceptance Criteria (BDD)

Given tenant "eu_bank" with allowed_regions: ["eu-west-1"]

And provider "bedrock" is configured for region "us-east-1"

When the routing engine selects providers

Then "bedrock" is excluded from eligible providers

And only EU-region providers are considered

▶

Log data residency decisions

#9145

Acceptance Criteria (BDD)

Given a routing decision that excludes providers due to data residency

When the decision is logged

Then the audit log includes: tenant_id, allowed_regions, excluded_providers, selected_provider, reason

And compliance officers can query all residency-based routing decisions

Ready to Move Forward?

Your path to provider-portable AI compliance starts with a structured engagement.

Architecture Review
with Technical Teams

Pilot Deployment
Scope Definition

Provider & Policy
Configuration

Production
Deployment

OpenAI	GPT-5, o3, o4-mini, GPT-4.1, embeddings
Azure OpenAI	Enterprise deployments, Managed Identity
AWS Bedrock	Nova 2, Claude 4.5, Llama 4, Mistral Large 3
Google Vertex	Gemini 3 Flash/Pro, Model Garden
Ollama	Local models, zero API cost

Provider-Portable AI Architecture

The Strategic Challenge

Immediate Value Delivery

Strategic Flexibility

Governance Requirements

Cost Optimization

The Model Gateway Architecture

Why This Pattern Works

Key Capabilities

Intent-Based Routing

Prompt Registry IP ASSET

Policy Engine 9D ABAC

Provider Adapters

Observability

RAG Independence

Everything as Code GitOps

Continuous Learning RLHF

Routing Strategies

Architectural Value Proposition

Strategic Advantages

Freedom of Choice

Optimized Economics

Consistent Governance

Future-Proofing

The Hyperscalers Agree

AWS Reference Architecture

Azure API Management

Google Vertex AI

The Strategic Takeaway

Enterprise Authority Management

9-Dimensional ABAC vs Industry Standard

Industry Standard (3D RBAC)

Our Implementation (9D ABAC)

Real-World Banking Scenarios

Trading Floor Analyst

AML Compliance Officer

Regulatory Examiner Access

Cross-LOB Data Request

Policy-as-Code (OPA Rego) - Banking GRC

AI Governance Program

Multi-Framework Integration

63 Controls Across 14 Domains

Governance Per AI Use Case

Risk Classification

Control Mapping

Continuous Monitoring

Sample Control Mappings

Continuous Improvement Architecture

The Learning Loop

Contextual Feedback

Analytics Dashboard

False Positive Detection

Reinforcement Signals

Intelligent Notifications

BPMN Workflows

Feedback Service Architecture

Own Your AI Future

Negotiate from Strength

Protect Your IP

Control Your Economics

The Build vs. Buy Framework

✓ Buy from Vendors When:

★ Build/Own When:

⚡ Lightweight Infrastructure, Heavy Leverage

Our Recommendation

Ready for the Agentic Future

The Shift Happening Now

Why Governance is Critical

Gateway as Agent Control Plane

Agent Identity & Authorization

Per-Agent Cost Allocation

Tool Use Governance

Complete Audit Trail

Future-Proofing Your Investment

RegRiskIQ Implementation Status

✓ What's Been Built

Core Model Abstraction Layer

5 Provider Implementations

Intelligent Routing Engine

→ Phase 2 Enhancements