Call Routing

ActualLab.Rpc supports flexible call routing, enabling scenarios like:

Sharding – Route calls to specific servers based on a shard key
Load balancing – Distribute calls across multiple backend servers
Affinity routing – Route calls based on user ID, entity ID, or other attributes
Dynamic topology – Handle server additions/removals without client restarts

Core Concepts

RouterFactory

The RouterFactory is the entry point for custom routing. It's configured via RpcOutboundCallOptions:

services.AddSingleton(_ => RpcOutboundCallOptions.Default with {
    RouterFactory = methodDef => args => {
        // Return RpcPeerRef based on method and arguments
        return RpcPeerRef.Default;
    }
});

The factory receives an RpcMethodDef and returns a function that maps ArgumentList to RpcPeerRef. This two-level design allows you to:

Inspect the method definition once (outer function)
Make per-call routing decisions based on arguments (inner function)

RpcPeerRef

RpcPeerRef identifies the target peer for a call:

Type	Description
`RpcPeerRef.Default`	The default remote peer (for single-server scenarios)
`RpcPeerRef.Local`	Execute locally (bypass RPC)
`RpcPeerRef.Loopback`	In-process loopback (for testing)
Custom `RpcPeerRef`	Your own peer reference with routing state

RpcRouteState

RpcRouteState enables dynamic rerouting when the target peer changes:

public class MyPeerRef : RpcPeerRef
{
    public MyPeerRef(string targetId)
    {
        HostInfo = targetId;
        RouteState = new RpcRouteState();

        // Start monitoring for topology changes
        _ = Task.Run(async () => {
            await WaitForTopologyChange();
            RouteState.MarkChanged(); // Triggers rerouting
        });
    }
}

When RouteState.MarkChanged() is called:

Active calls on this peer receive RpcRerouteException
The RPC interceptor catches the exception
After a delay (ReroutingDelays), the call is rerouted via RouterFactory

RpcRerouteException

RpcRerouteException signals that a call must be rerouted to a different peer. It's thrown automatically when:

RouteState.MarkChanged() is called on the peer's RpcPeerRef
An inbound call arrives at a server that's no longer responsible for the shard/entity

// Throwing manually (e.g., in a service method)
throw RpcRerouteException.MustReroute("Target server changed");

Simple Example: Hash-Based Routing

This example from the MultiServerRpc sample routes chat calls based on chat ID hash:

const int serverCount = 2;
var serverUrls = Enumerable.Range(0, serverCount)
    .Select(i => $"http://localhost:{22222 + i}/")
    .ToArray();
var clientPeerRefs = serverUrls
    .Select(url => RpcPeerRef.NewClient(url))
    .ToArray();

services.AddSingleton(_ => RpcOutboundCallOptions.Default with {
    RouterFactory = methodDef => args => {
        if (methodDef.Service.Type == typeof(IChat)) {
            var arg0Type = args.GetType(0);
            int hash;
            if (arg0Type == typeof(string))
                hash = args.Get<string>(0).GetXxHash3();
            else if (arg0Type == typeof(Chat_Post))
                hash = args.Get<Chat_Post>(0).ChatId.GetXxHash3();
            else
                throw new NotSupportedException("Can't route this call.");

            return clientPeerRefs[hash.PositiveModulo(serverCount)];
        }
        return RpcPeerRef.Default;
    }
});

Key points:

Routes IChat calls based on the first argument (chat ID or command)
Uses GetXxHash3() for consistent hashing (doesn't change between runs)
Falls back to RpcPeerRef.Default for other services

Advanced Example: Dynamic Mesh Routing

The MeshRpc sample demonstrates dynamic routing with automatic rerouting when topology changes.

Custom PeerRef with RouteState

public sealed class RpcShardPeerRef : RpcPeerRef
{
    private static readonly ConcurrentDictionary<ShardRef, LazySlim<ShardRef, RpcShardPeerRef>> Cache = new();

    public ShardRef ShardRef { get; }
    public string HostId { get; }

    public static RpcShardPeerRef Get(ShardRef shardRef)
        => Cache.GetOrAdd(shardRef, static (shardRef, lazy) => new RpcShardPeerRef(shardRef, lazy));

    private RpcShardPeerRef(ShardRef shardRef, LazySlim<ShardRef, RpcShardPeerRef> lazy)
    {
        var meshState = MeshState.State.Value;
        ShardRef = shardRef;
        HostId = meshState.GetShardHost(shardRef)?.Id ?? "null";
        HostInfo = $"{shardRef}-v{meshState.Version}->{HostId}";

        // Enable rerouting
        RouteState = new RpcRouteState();

        // Monitor for topology changes
        _ = Task.Run(async () => {
            var computed = MeshState.State.Computed;
            // Wait until this host is removed from the mesh
            await computed.When(x => !x.HostById.ContainsKey(HostId), CancellationToken.None);

            // Remove from cache and trigger rerouting
            Cache.TryRemove(ShardRef, lazy);
            RouteState.MarkChanged();
        });
    }
}

RouterFactory with Type-Based Routing

public Func<ArgumentList, RpcPeerRef> RouterFactory(RpcMethodDef methodDef)
    => args => {
        if (args.Length == 0)
            return RpcPeerRef.Local;

        var arg0Type = args.GetType(0);

        // Route by HostRef
        if (arg0Type == typeof(HostRef))
            return RpcHostPeerRef.Get(args.Get<HostRef>(0));
        if (typeof(IHasHostRef).IsAssignableFrom(arg0Type))
            return RpcHostPeerRef.Get(args.Get<IHasHostRef>(0).HostRef);

        // Route by ShardRef
        if (arg0Type == typeof(ShardRef))
            return RpcShardPeerRef.Get(args.Get<ShardRef>(0));
        if (typeof(IHasShardRef).IsAssignableFrom(arg0Type))
            return RpcShardPeerRef.Get(args.Get<IHasShardRef>(0).ShardRef);

        // Route by hash of first argument
        if (arg0Type == typeof(int))
            return RpcShardPeerRef.Get(ShardRef.New(args.Get<int>(0)));

        return RpcShardPeerRef.Get(ShardRef.New(args.GetUntyped(0)));
    };

Rerouting Flow

When a peer's route state changes, the following sequence occurs:

Local Execution Mode

When a call routes to the local server (via RpcPeerRef.Local or a custom peer ref pointing to the current host), RpcLocalExecutionMode controls how local execution coordinates with rerouting signals.

This is only relevant for distributed services (RpcServiceMode.Distributed). Non-distributed services ignore this setting.

RpcLocalExecutionMode Values

Mode	LocalExecutionAwaiter	Rerouting Check	Cancellation Token	Use Case
`Unconstrained`	Not awaited	None	Original token	Non-distributed services, simple calls
`ConstrainedEntry`	Awaited once	At entry point only	Original token	Compute services, where late reroutes are acceptable
`Constrained`	Awaited	At entry + during execution	Linked to `ChangedToken`	Long-running calls that must abort on reroute

How It Works

When a call executes locally with RpcRouteState:

Unconstrained: Executes immediately without coordination. Use for calls where rerouting mid-execution is acceptable.
ConstrainedEntry: Waits for LocalExecutionAwaiter before starting. If the route changed while waiting, throws RpcRerouteException. Once execution starts, it won't be interrupted.
Constrained: Same as ConstrainedEntry, plus the cancellation token is linked to RpcRouteState.ChangedToken. If the route changes during execution, the call is cancelled and rerouted. This is the most defensive mode.

Default Modes

The default mode depends on both the service mode and method type:

Service Mode	Method Type	Default Mode	Rationale
`Distributed`	Regular methods	`Constrained`	Commands may have side effects; must abort if route changes
`Distributed`	Compute methods	`ConstrainedEntry`	Read-only; safe to complete locally even if route changes
Non-distributed	Any	`Unconstrained`	No rerouting concerns

Why compute methods use ConstrainedEntry:

Compute methods (methods returning Task<T> on IComputeService) are read-only operations that produce cached computed values. If a compute method starts executing locally and the route changes mid-execution:

The client (i.e., another server that requests the value) will notice the change in topology and terminate the peer responsible for the call, which triggers rerouting of all its open calls and invalidation of compute method calls awaiting invalidation.
So at worst, a redundant computation occurs on the "wrong" (old) server.

Using Constrained would unnecessarily cancel the computation, which isn't much better than a subsequent rerouting.

Why regular distributed methods use Constrained:

Regular methods on distributed services often perform commands with side effects (database writes, state mutations). If a route changes mid-execution:

The operation might complete on a server no longer responsible for the data
This could cause inconsistencies in a sharded system
Aborting and rerouting ensures the correct server handles the operation

Resolution order:

Method-level [RpcMethod(LocalExecutionMode = ...)] attribute
Service-level configuration via HasLocalExecutionMode()
Auto-default based on service mode and method type

Configuration

Configure at the service level:

services.AddRpc()
    .AddDistributed<IMyService, MyServiceImpl>()
    .HasLocalExecutionMode(RpcLocalExecutionMode.ConstrainedEntry);

Override at the method level using RpcMethodAttribute:

public interface IMyService : IRpcService
{
    // Use Unconstrained for this specific fast method
    [RpcMethod(LocalExecutionMode = RpcLocalExecutionMode.Unconstrained)]
    Task<int> GetCount(string key, CancellationToken ct);

    // Use full Constrained for this long-running method
    [RpcMethod(LocalExecutionMode = RpcLocalExecutionMode.Constrained)]
    Task<Report> GenerateReport(ReportRequest request, CancellationToken ct);
}

When to Use Each Mode

Unconstrained: For fast, idempotent operations where executing on the "wrong" server temporarily is acceptable. Also used internally for NoWait calls and system calls.
ConstrainedEntry: For compute methods or operations that are safe to complete locally even if the route changes mid-execution. The result may be discarded, but no side effects occur.
Constrained: For operations with side effects (database writes, external API calls) that must not complete on a server that's no longer responsible for the data. This ensures consistency during topology changes.

Configuration

Rerouting Delays

Configure delays between rerouting attempts via RpcOutboundCallOptions:

services.AddSingleton(_ => RpcOutboundCallOptions.Default with {
    ReroutingDelays = RetryDelaySeq.Exp(0.1, 5), // 0.1s to 5s exponential backoff
});

The delay sequence uses exponential backoff to avoid overwhelming the system during topology changes.

Host URL Resolution

When using custom RpcPeerRef types, configure how to resolve the actual host URL:

services.Configure<RpcWebSocketClientOptions>(o => {
    o.HostUrlResolver = peer => {
        if (peer.Ref is IMyMeshPeerRef meshPeerRef) {
            var host = GetHostById(meshPeerRef.HostId);
            return host?.Url ?? "";
        }
        return peer.Ref.HostInfo;
    };
});

Connection Kind Detection

Detect whether a peer reference points to a local or remote peer:

services.Configure<RpcPeerOptions>(o => {
    o.ConnectionKindDetector = peerRef => {
        if (peerRef is MyShardPeerRef shardPeerRef)
            return shardPeerRef.HostId == currentHostId
                ? RpcPeerConnectionKind.Local
                : RpcPeerConnectionKind.Remote;

        return peerRef.ConnectionKind;
    };
});

Best Practices

Cache PeerRefs – Create and reuse RpcPeerRef instances for the same routing key. The MeshRpc sample uses ConcurrentDictionary with LazySlim for thread-safe caching.
Use consistent hashing – Use GetXxHash3() or similar stable hash functions. string.GetHashCode() varies between runs.
Handle topology changes gracefully – Use RpcRouteState to automatically reroute when servers come and go.
Monitor rerouting – Rerouting is logged at Warning level. High rerouting rates may indicate topology instability.
Consider local execution – Return RpcPeerRef.Local when the call can be handled by the current server to avoid network overhead.

1. Compute Services, Computed, and States

2. ActualLab.Rpc and Distributed Compute Services

3. CommandR: CQRS and Beyond

4. Real-time UI in Blazor App

6. Operations Framework

7. ActualLab.Core

8. TypeScript Port

Cache-Aware API Design

Authentication in Fusion

Interceptors and Proxies

Fusion vs...

Call Routing

Core Concepts

RouterFactory

RpcPeerRef

RpcRouteState

RpcRerouteException

Simple Example: Hash-Based Routing

Advanced Example: Dynamic Mesh Routing

Custom PeerRef with RouteState

RouterFactory with Type-Based Routing

Rerouting Flow

Local Execution Mode

RpcLocalExecutionMode Values

How It Works

Default Modes

Configuration

When to Use Each Mode

Configuration

Rerouting Delays

Host URL Resolution

Connection Kind Detection

Best Practices

Call Routing ​

Core Concepts ​

RouterFactory ​

RpcPeerRef ​

RpcRouteState ​

RpcRerouteException ​

Simple Example: Hash-Based Routing ​

Advanced Example: Dynamic Mesh Routing ​

Custom PeerRef with RouteState ​

RouterFactory with Type-Based Routing ​

Rerouting Flow ​

Local Execution Mode ​

RpcLocalExecutionMode Values ​

How It Works ​

Default Modes ​

Configuration ​

When to Use Each Mode ​

Configuration ​

Rerouting Delays ​

Host URL Resolution ​

Connection Kind Detection ​

Best Practices ​

Call Routing

Core Concepts

RouterFactory

RpcPeerRef

RpcRouteState

RpcRerouteException

Simple Example: Hash-Based Routing

Advanced Example: Dynamic Mesh Routing

Custom PeerRef with RouteState

RouterFactory with Type-Based Routing

Rerouting Flow

Local Execution Mode

RpcLocalExecutionMode Values

How It Works

Default Modes

Configuration

When to Use Each Mode

Configuration

Rerouting Delays

Host URL Resolution

Connection Kind Detection

Best Practices