You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enable best-effort sticky routing via the `X-Session-Id` request header. Requests carrying the same header value land on the same pod, improving KV cache reuse for agentic workloads. Requests without the header are routed at random. Affinity is NOT durable: scaling, rollouts, restarts, or readiness-probe transitions will remap sessions to different pods. Do not use for irreplaceable in-pod state.
fromplatform_api_python_client.models.create_inference_v3_deployment_requestimportCreateInferenceV3DeploymentRequest# TODO update the JSON string belowjson="{}"# create an instance of CreateInferenceV3DeploymentRequest from a JSON stringcreate_inference_v3_deployment_request_instance=CreateInferenceV3DeploymentRequest.from_json(json)
# print the JSON string representation of the objectprint(CreateInferenceV3DeploymentRequest.to_json())
# convert the object into a dictcreate_inference_v3_deployment_request_dict=create_inference_v3_deployment_request_instance.to_dict()
# create an instance of CreateInferenceV3DeploymentRequest from a dictcreate_inference_v3_deployment_request_from_dict=CreateInferenceV3DeploymentRequest.from_dict(create_inference_v3_deployment_request_dict)