You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enable best-effort sticky routing via the `X-Session-Id` request header. Requests carrying the same header value land on the same pod, improving KV cache reuse for agentic workloads. Requests without the header are routed at random. Affinity is NOT durable: scaling, rollouts, restarts, or readiness-probe transitions will remap sessions to different pods. Do not use for irreplaceable in-pod state.
[optional] [default to False]
Example
fromplatform_api_python_client.models.get_c_serve_v3_deployment_responseimportGetCServeV3DeploymentResponse# TODO update the JSON string belowjson="{}"# create an instance of GetCServeV3DeploymentResponse from a JSON stringget_c_serve_v3_deployment_response_instance=GetCServeV3DeploymentResponse.from_json(json)
# print the JSON string representation of the objectprint(GetCServeV3DeploymentResponse.to_json())
# convert the object into a dictget_c_serve_v3_deployment_response_dict=get_c_serve_v3_deployment_response_instance.to_dict()
# create an instance of GetCServeV3DeploymentResponse from a dictget_c_serve_v3_deployment_response_from_dict=GetCServeV3DeploymentResponse.from_dict(get_c_serve_v3_deployment_response_dict)