| Interface | Description |
|---|---|
| ServingEndpointsDataPlaneService |
Serving endpoints DataPlane provides a set of operations to interact with data plane endpoints
for Serving endpoints service.
|
| ServingEndpointsService |
The Serving Endpoints API allows you to create, update, and delete model serving endpoints.
|
| Enum | Description |
|---|---|
| AiGatewayGuardrailPiiBehaviorBehavior | |
| AiGatewayRateLimitKey | |
| AiGatewayRateLimitRenewalPeriod | |
| AmazonBedrockConfigBedrockProvider | |
| ChatMessageRole |
The role of the message.
|
| EmbeddingsV1ResponseEmbeddingElementObject |
This will always be 'embedding'.
|
| EndpointStateConfigUpdate | |
| EndpointStateReady | |
| ExternalFunctionRequestHttpMethod | |
| ExternalModelProvider | |
| QueryEndpointResponseObject |
The type of object returned by the __external/foundation model__ serving endpoint, one of
[text_completion, chat.completion, list (of embeddings)].
|
| RateLimitKey | |
| RateLimitRenewalPeriod | |
| ServedModelInputWorkloadType |
Please keep this in sync with with workload types in InferenceEndpointEntities.scala
|
| ServedModelStateDeployment | |
| ServingEndpointDetailedPermissionLevel | |
| ServingEndpointPermissionLevel |
Permission level
|
| ServingModelWorkloadType |
Please keep this in sync with with workload types in InferenceEndpointEntities.scala
|
Copyright © 2025. All rights reserved.