Architecture¶

This document describes the internal architecture of AAP Bridge.

Overview¶

AAP Bridge follows an ETL (Export, Transform, Load) architecture with state management for checkpointing and idempotency.

flowchart TB
    subgraph CLI["CLI Layer"]
        prep[prep]
        export[export]
        transform[transform]
        import_cmd[import]
        cleanup_cli[cleanup]
    end

    subgraph Migration["Migration Layer"]
        SchemaComparator[Schema Comparator]
        Exporter[Exporter]
        Transformer[Transformer]
        Importer[Importer]
        Cleanup[Cleanup]
    end

    subgraph Client["Client Layer"]
        AAPSource[AAPSourceClient]
        AAPTarget[AAPTargetClient]
    end

    subgraph State["State Management"]
        MigrationState[MigrationState]
        IDMapping[IDMapping]
        Progress[Progress]
        PostgreSQL[(PostgreSQL)]
    end

    prep --> SchemaComparator
    export --> Exporter
    transform --> Transformer
    import_cmd --> Importer
    cleanup_cli --> Cleanup

    SchemaComparator --> AAPSource
    Exporter --> AAPSource
    Importer --> AAPTarget
    Cleanup --> AAPTarget

    AAPSource --> MigrationState
    AAPTarget --> MigrationState
    MigrationState --> IDMapping
    MigrationState --> Progress
    MigrationState --> PostgreSQL

Directory Structure¶

src/aap_migration/
├── cli/                    # Command-line interface
│   ├── main.py            # Entry point, command groups
│   ├── menu.py            # Interactive menu
│   ├── commands/          # Individual commands
│   │   ├── prep.py
│   │   ├── export_import.py
│   │   ├── cleanup.py
│   │   └── ...
│   └── utils.py           # CLI utilities
├── client/                 # HTTP clients
│   ├── aap_source_client.py
│   ├── aap_target_client.py
│   ├── vault_client.py
│   └── bulk_operations.py
├── migration/              # Core ETL logic
│   ├── coordinator.py     # Orchestration
│   ├── exporter.py        # Export logic
│   ├── transformer.py     # Transform logic
│   ├── importer.py        # Import logic
│   └── state.py           # State management
├── schema/                 # Schema handling
│   ├── comparator.py
│   └── models.py
├── validation/             # Validation logic
├── reporting/              # Progress and reports
│   ├── live_progress.py
│   └── report.py
├── config.py              # Configuration
├── resources.py           # Resource registry
└── utils/                 # Utilities
    ├── logging.py
    └── idempotency.py

Key Components¶

Client Layer¶

AAPSourceClient¶

HTTP client for source AAP (2.3/2.4):

Base URL: /api/v2/
Handles pagination
Rate limiting and retries
Authentication via token

AAPTargetClient¶

HTTP client for target AAP (2.6+):

Base URL: /api/controller/v2/ (Platform Gateway)
Bulk operation support
Same retry/rate limiting logic

Migration Layer¶

Exporter¶

Exports resources from source AAP:

class ResourceExporter:
    async def export_resources(
        self,
        resource_type: str,
        endpoint: str,
        page_size: int = 100,
    ) -> AsyncGenerator[dict, None]:
        """Paginate through all resources."""
        ...

Resource-specific exporters inherit from ResourceExporter:

OrganizationExporter
InventoryExporter
HostExporter
etc.

Transformer¶

Transforms data between AAP versions:

class DataTransformer:
    DEPENDENCIES = {"organization": "organizations"}

    async def transform(
        self,
        data: dict[str, Any],
        state: MigrationState,
    ) -> dict[str, Any]:
        """Apply transformations."""
        ...

Transformations include:

Field renames
Type conversions
Dependency resolution
Default value injection

Importer¶

Imports resources to target AAP:

class ResourceImporter:
    async def import_resource(
        self,
        resource_type: str,
        source_id: int,
        data: dict[str, Any],
    ) -> dict[str, Any] | None:
        """Import single resource."""
        ...

Special handling for:

Bulk operations (hosts)
Conflict resolution
ID mapping

State Management¶

MigrationState¶

Central state manager backed by PostgreSQL:

class MigrationState:
    def mark_completed(
        self,
        resource_type: str,
        source_id: int,
        target_id: int,
    ) -> None: ...

    def get_mapped_id(
        self,
        resource_type: str,
        source_id: int,
    ) -> int | None: ...

    def is_migrated(
        self,
        resource_type: str,
        source_id: int,
    ) -> bool: ...

Database Schema¶

-- ID mappings
CREATE TABLE id_mapping (
    resource_type VARCHAR(100),
    source_id INTEGER,
    target_id INTEGER,
    source_name VARCHAR(512),
    PRIMARY KEY (resource_type, source_id)
);

-- Migration progress
CREATE TABLE migration_progress (
    resource_type VARCHAR(100),
    source_id INTEGER,
    status VARCHAR(50),
    error_message TEXT,
    updated_at TIMESTAMP
);

Resource Registry¶

Central registry of all resource types:

RESOURCE_REGISTRY = {
    "organizations": ResourceTypeInfo(
        name="organizations",
        endpoint="organizations/",
        migration_order=20,
        cleanup_order=100,
        has_exporter=True,
        has_importer=True,
    ),
    ...
}

Controls:

Migration order (dependencies first)
Cleanup order (dependents first)
Batch sizes
Bulk API usage

Data Flow¶

Export Flow¶

Source AAP
    │
    ▼
AAPSourceClient.get()
    │
    ▼
ResourceExporter.export_resources()
    │
    ▼
File Writer (split by records-per-file)
    │
    ▼
exports/{resource_type}/{resource_type}_XXXX.json

Transform Flow¶

exports/{resource_type}/*.json
    │
    ▼
DataTransformer.transform()
    │
    ├── Remove deprecated fields
    ├── Rename changed fields
    ├── Add default values
    └── Resolve dependencies
    │
    ▼
transformed/{resource_type}/*.json

Import Flow¶

transformed/{resource_type}/*.json
    │
    ▼
ResourceImporter.import_resource()
    │
    ├── Check if already migrated (state)
    ├── Resolve FK dependencies (ID mapping)
    ├── Create/Update resource
    └── Record mapping
    │
    ▼
Target AAP (via AAPTargetClient)

Extension Points¶

Adding Resource Types¶

See Adding Resource Types.

Custom Transformers¶

Create a custom transformer for complex transformations:

class CustomTransformer(DataTransformer):
    DEPENDENCIES = {"organization": "organizations"}
    REQUIRED_DEPENDENCIES = {"organization"}

    async def transform(
        self,
        data: dict[str, Any],
        state: MigrationState,
    ) -> dict[str, Any]:
        data = await super().transform(data, state)
        # Custom logic here
        return data

Custom Importers¶

Override import behavior:

class CustomImporter(ResourceImporter):
    async def import_resource(
        self,
        resource_type: str,
        source_id: int,
        data: dict[str, Any],
    ) -> dict[str, Any] | None:
        # Custom pre-processing
        ...
        return await super().import_resource(resource_type, source_id, data)

Performance Considerations¶

Concurrency¶

Configurable via max_concurrent
Uses asyncio.Semaphore for limiting
Default: 10 concurrent requests

Bulk Operations¶

Hosts: 200 per bulk request
Uses /bulk/host_create endpoint
Significantly faster than individual creates

Rate Limiting¶

Configurable requests per second
Exponential backoff on 429 responses
Respects Retry-After headers

Memory Management¶

Streaming exports (generator-based)
File splitting for large datasets
Batched imports