PublicEndpoint - Go SDK
PublicEndpoint type definition
The Go SDK and docs are currently in beta. Report issues on GitHub.
Information about a specific model endpoint
Fields
| Field | Type | Required | Description | Example |
|---|---|---|---|---|
Name | string | ✔️ | N/A | |
ModelID | string | ✔️ | The unique identifier for the model (permaslug) | openai/gpt-4 |
ModelName | string | ✔️ | N/A | |
ContextLength | int64 | ✔️ | N/A | |
Pricing | components.Pricing | ✔️ | N/A | |
ProviderName | components.ProviderName | ✔️ | N/A | OpenAI |
Tag | string | ✔️ | N/A | |
Quantization | *components.PublicEndpointQuantization | ✔️ | N/A | fp16 |
MaxCompletionTokens | int64 | ✔️ | N/A | |
MaxPromptTokens | int64 | ✔️ | N/A | |
SupportedParameters | []components.Parameter | ✔️ | N/A | |
Status | *components.EndpointStatus | ➖ | N/A | 0 |
UptimeLast30m | float64 | ✔️ | N/A | |
UptimeLast5m | float64 | ✔️ | Uptime percentage over the last 5 minutes, calculated as successful requests / (successful + error requests) * 100. Rate-limited requests are excluded. Returns null if insufficient data. | |
UptimeLast1d | float64 | ✔️ | Uptime percentage over the last 1 day, calculated as successful requests / (successful + error requests) * 100. Rate-limited requests are excluded. Returns null if insufficient data. | |
SupportsImplicitCaching | bool | ✔️ | N/A | |
LatencyLast30m | *components.PercentileStats | ✔️ | Latency percentiles in milliseconds over the last 30 minutes. Latency measures time to first token. Only visible when authenticated with an API key or cookie; returns null for unauthenticated requests. | {"p50": 25.5,"p75": 35.2,"p90": 48.7,"p99": 85.3} |
ThroughputLast30m | *components.PercentileStats | ✔️ | N/A | {"p50": 25.5,"p75": 35.2,"p90": 48.7,"p99": 85.3} |