Traits & Interfaces

MagicAF’s extensibility comes from its trait-based design. Every major component is accessed through an async trait, and every trait can be implemented by your application.

Infrastructure Traits

These traits define the fundamental AI building blocks.

`EmbeddingService`

Produces dense vector embeddings from text input.

#[async_trait]
pub trait EmbeddingService: Send + Sync {
    /// Embed a batch of input strings, returning one vector per input.
    async fn embed(&self, inputs: &[String]) -> Result<Vec<Vec<f32>>>;

    /// Embed a single string.
    async fn embed_single(&self, input: &str) -> Result<Vec<f32>>;

    /// Verify the upstream service is reachable.
    async fn health_check(&self) -> Result<()>;
}

Shipped implementation: LocalEmbeddingService — calls any OpenAI-compatible /v1/embeddings endpoint.

You might implement this for: ONNX Runtime, CoreML, TensorFlow Lite, gRPC endpoints, or in-process inference.

`VectorStore`

Stores, searches, and manages dense vectors with JSON payloads.

#[async_trait]
pub trait VectorStore: Send + Sync {
    /// Index a batch of embeddings with payloads.
    async fn index(
        &self,
        collection: &str,
        embeddings: Vec<Vec<f32>>,
        payloads: Vec<serde_json::Value>,
    ) -> Result<()>;

    /// Nearest-neighbor search.
    async fn search(
        &self,
        collection: &str,
        query_vector: Vec<f32>,
        limit: usize,
        filter: Option<serde_json::Value>,
    ) -> Result<Vec<SearchResult>>;

    /// Delete all vectors for a given entity ID.
    async fn delete_by_entity(
        &self,
        collection: &str,
        entity_id: Uuid,
    ) -> Result<()>;

    /// Ensure a collection exists with the given vector dimensions.
    async fn ensure_collection(
        &self,
        collection: &str,
        vector_size: usize,
    ) -> Result<()>;
}

Shipped implementations:

QdrantVectorStore — Qdrant REST API
InMemoryVectorStore — zero-dependency, in-process store

You might implement this for: Milvus, Weaviate, pgvector, FAISS, or any custom backend.

`LlmService`

Sends chat completion requests to a language model.

#[async_trait]
pub trait LlmService: Send + Sync {
    /// Structured chat completion request.
    async fn chat(&self, request: ChatRequest) -> Result<ChatResponse>;

    /// Convenience: turn a prompt into generated text.
    async fn generate(&self, prompt: &str, config: GenerationConfig) -> Result<String>;

    /// Verify the upstream service is reachable.
    async fn health_check(&self) -> Result<()>;
}

Shipped implementation: LocalLlmService — calls any OpenAI-compatible /v1/chat/completions endpoint.

You might implement this for: gRPC model servers, in-process inference (llama.cpp bindings), or custom APIs.

Adapter Traits

These traits are the domain extension seam — where your application-specific logic plugs in.

`EvidenceFormatter`

Converts vector search results into a text block for the LLM prompt.

#[async_trait]
pub trait EvidenceFormatter: Send + Sync {
    async fn format_evidence(&self, results: &[SearchResult]) -> Result<String>;
}

`PromptBuilder`

Assembles the final prompt from the user query and formatted evidence.

#[async_trait]
pub trait PromptBuilder: Send + Sync {
    async fn build_prompt(&self, query: &str, evidence: &str) -> Result<String>;
}

`ResultParser<T>`

Parses raw LLM output into a strongly-typed domain result.

#[async_trait]
pub trait ResultParser<T>: Send + Sync {
    async fn parse_result(&self, raw_output: &str) -> Result<T>;
}

Thread Safety

All traits require Send + Sync, meaning implementations are safe for concurrent use across Tokio tasks. MagicAF is fully tokio-native and uses #[async_trait] throughout.

Trait Composition

The RAGWorkflow is generic over all traits. This means the compiler statically dispatches all calls — zero runtime overhead from the trait-based design:

// All type parameters are resolved at compile time
let workflow: RAGWorkflow<
    LocalEmbeddingService,
    QdrantVectorStore,
    LocalLlmService,
    DefaultEvidenceFormatter,
    DefaultPromptBuilder,
    RawResultParser,
    String,
> = RAGWorkflow::builder()
    // ...
    .build()?;

In practice, Rust infers all type parameters from the builder calls — you never need to write them out.

Infrastructure Traits#

EmbeddingService#

VectorStore#

LlmService#

Adapter Traits#

EvidenceFormatter#

PromptBuilder#

ResultParser<T>#

Thread Safety#

Trait Composition#