How Oblix Works¶

Oblix is an AI model orchestration SDK that intelligently switches between edge (local) and cloud-based models based on connectivity, system resources, and other factors. This page explains the core architecture and concepts that power Oblix.

Core Architecture¶

Oblix Architecture Diagram

Oblix consists of several key components:

Client Interface - The main entry point (OblixClient) that developers interact with
Model Management - Handles various model types from different providers
Agent System - Monitors system state and makes orchestration decisions
Session Management - Maintains persistent chat sessions
Orchestration Engine - Manages the execution flow between edge and cloud models

Intelligent Orchestration¶

Oblix's intelligent orchestration system works by:

Monitoring Resources - Tracking CPU, memory, and GPU utilization
Checking Connectivity - Monitoring network quality and availability
Applying Policies - Using configurable policies to make orchestration decisions
Executing Actions - Dynamically switching between models based on current conditions

Example Orchestration Scenarios¶

Scenario	Resource State	Connectivity	Orchestration Decision
Offline work	Available	Disconnected	Switch to edge Ollama model
Limited bandwidth	Available	Degraded	Use smaller cloud model or edge model
Resource constrained	Constrained	Optimal	Switch to cloud model
Optimal conditions	Available	Optimal	Prefer edge model for speed

Agent System¶

Agents are pluggable components that monitor specific aspects of the system to enable orchestration:

ResourceMonitor¶

Tracks system resources including: - CPU utilization and load - Memory availability - GPU availability and utilization - Real-time GPU metrics on macOS (utilization percentage and memory usage)

Based on resource metrics, it recommends the optimal execution target (local CPU, local GPU, or cloud).

ConnectivityAgent¶

Monitors network connectivity including: - Connection type (wifi, ethernet, etc.) - Latency and packet loss - Available bandwidth - Connection stability

The connectivity agent adapts to changing network conditions, ensuring reliability even in challenging environments.

Model Abstraction¶

Oblix provides a unified interface for working with different model types:

Ollama Models - Edge models running locally
OpenAI Models - Cloud models from OpenAI (GPT series)
Claude Models - Cloud models from Anthropic (Claude series)

Each model implementation handles: - Authentication and API communication - Token counting and management - Performance metrics collection - Error handling and retries

Session Management¶

Oblix includes a built-in session management system that:

Maintains conversation history
Provides persistence across application restarts
Enables multi-session workflows
Handles context management for stateful conversations

Orchestration Flow¶

When you execute a prompt with Oblix, here's what happens behind the scenes:

Pre-execution Checks - All registered agents perform checks and make recommendations
Orchestration Decision - The system selects the optimal model based on agent recommendations
Context Retrieval - If in a session, relevant context is loaded
Execution - The prompt is sent to the selected model for processing
Metrics Collection - Performance metrics are collected during execution
Response Handling - The response is returned along with metadata
Session Update - If in a session, the interaction is saved

Configuration System¶

Oblix stores configuration persistently, including:

Model configurations
API keys (securely)
Custom thresholds
User preferences

The configuration system provides portability and simplifies repeated use.

Optimizing for Developer Experience¶

Oblix abstracts away the complexity of working with multiple models, allowing developers to:

Write code once that works across changing environments
Gracefully handle offline scenarios
Optimize for performance and cost
Maintain conversation context seamlessly

This design ensures that AI applications built with Oblix are more resilient, adaptive, and user-friendly than traditional approaches.