Fixing - gateway: Invalid : error cause: gceSync: generic::invalid_argument

 

Bump into this error here and this is because to deploy inference gateway, we need to configure subnet proxy as defined here: https://cloud.google.com/kubernetes-engine/docs/how-to/deploying-gateways#configure_a_proxy-only_subnet

So basically you need to create a new proxy-subnet in a specific region say australia-southeast2. Once you have done that you can create gateway and httproute resources. 

When you create your gke in autopilot mode, the gateway API is enabled by default. But you still need to create a subnet. 

Gateway: Invalid : error cause: gceSync: generic::invalid_argument: Insert: Invalid value for field 'resource.target': 'regions/australia-southeast2/targetHttpProxies/gkegw1-ctc9-default-inference-gateway-3ehzmvfui5bd'. An active proxy-only subnetwork is required in the same region and VPC as the forwarding rule. ===== Route/default/gateway-route: (1.) error cause: gceSync: generic::invalid_argument: Insert: Invalid value for field 'resource.target': 'regions/australia-southeast2/targetHttpProxies/gkegw1-ctc9-default-inference-gateway-3ehzmvfui5bd'. An active proxy-only subnetwork is required in the same region and VPC as the forwarding rule., (2.) error cause: reference-not-found: Error GWCER104: HTTPRoute "default/gateway-route" is misconfigured, err: failed to validate inference pool extension ref: InferencePool {default/vllm-llama3.1-8b-instruct-ips-7b6e0b8f /v1, Resource=services} not found.





Comments

Popular posts from this blog

gemini cli getting file not defined error

NodeJS: Error: spawn EINVAL in window for node version 20.20 and 18.20

vllm : Failed to infer device type