You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enable best-effort sticky routing via the `X-Session-Id` request header. Requests carrying the same header value land on the same pod, improving KV cache reuse for agentic workloads. Requests without the header are routed at random. Affinity is NOT durable: scaling, rollouts, restarts, or readiness-probe transitions will remap sessions to different pods. Do not use for irreplaceable in-pod state.
fromplatform_api_python_client.models.get_inference_v3_deployment_responseimportGetInferenceV3DeploymentResponse# TODO update the JSON string belowjson="{}"# create an instance of GetInferenceV3DeploymentResponse from a JSON stringget_inference_v3_deployment_response_instance=GetInferenceV3DeploymentResponse.from_json(json)
# print the JSON string representation of the objectprint(GetInferenceV3DeploymentResponse.to_json())
# convert the object into a dictget_inference_v3_deployment_response_dict=get_inference_v3_deployment_response_instance.to_dict()
# create an instance of GetInferenceV3DeploymentResponse from a dictget_inference_v3_deployment_response_from_dict=GetInferenceV3DeploymentResponse.from_dict(get_inference_v3_deployment_response_dict)