PreferredMaxLatency - Go SDK

PreferredMaxLatency type definition

The Go SDK and docs are currently in beta. Report issues on GitHub.

Preferred maximum latency (in seconds). Can be a number (applies to p50) or an object with percentile-specific cutoffs. Endpoints above the threshold(s) may still be used, but are deprioritized in routing. When using fallback models, this may cause a fallback model to be used instead of the primary model if it meets the threshold.

Supported Types

1preferredMaxLatency := components.CreatePreferredMaxLatencyNumber(float64{/* values here */})

PercentileLatencyCutoffs

1preferredMaxLatency := components.CreatePreferredMaxLatencyPercentileLatencyCutoffs(components.PercentileLatencyCutoffs{/* values here */})

1preferredMaxLatency := components.CreatePreferredMaxLatencyAny(any{/* values here */})

Union Discrimination

Use the Type field to determine which variant is active, then access the corresponding field:

1switch preferredMaxLatency.Type {
2 case components.PreferredMaxLatencyTypeNumber:
3 // preferredMaxLatency.Number is populated
4 case components.PreferredMaxLatencyTypePercentileLatencyCutoffs:
5 // preferredMaxLatency.PercentileLatencyCutoffs is populated
6 case components.PreferredMaxLatencyTypeAny:
7 // preferredMaxLatency.Any is populated
8}