I have a running cluster with version as below GitVersion:"v1.24.2",
what is the solution for the CNI changes in this version for running clusters that are throwing errors for --network-plugin flags? i cant seem to find any way to fix the running clusters. 2 of my clusters are apparently down due to this and cant seem to figure out how to fix
I have tried changing the file /var/lib/kubelet/kubeadm-flags.env to below and still doesnt help
KUBELET_KUBEADM_ARGS="--pod-infra-container-image=k8s.gcr.io/pause:3.5"
KUBELET_NETWORK_ARGS=''
Related to the issue https://github.com/kubernetes/website/issues/33640
Documentation is updated and merged but what about running clusters? what can be done?
Additional Details
OK maybe i was not clear with my questions / explanation earlier
We had an earlier verion of kubernetes which got upgraded to 1.24.2 and we see the same behaviour across 2 clusters- when i say clusters these are for now 2 virtual machines each behaving as a cluster of its own, hosted on-premise. We deploy containers that connect to Azure - as Self-hosted App gateways On Premise.
Issue - Post upgrading to the current version I see the below errors in kubelet logs, and kubelet doesnt seem to be running / active
kubectl get pods The connection to the server localhost:8080 was refused - did you specify the right host or port?
Errors in Kubelet logs
kubelet[18280]: Error: failed to parse kubelet flag: unknown flag: --network-plugin systemd[1]: kubelet.service: main process exited, code=exited, status=1/FAILURE
I tried to adjust the parameter from /var/lib/kubelet/kubeadm-flags.env by removing the --network-plugin flag , but then no luck. I now see the below error as well in the kubelet logs after the kubelet restart
{ 0 }. Err: connection error: desc = "transport: Error while dialing dial unix: missing address". Reconnecting... Jul 01 15:14:49 kubelet[10297]: Error: failed to run Kubelet: unable to determine runtime API version: rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing dial unix: missing address"
kubelet[10252]: --runtime-request-timeout duration Timeout of all runtime requests except long running request - pull, logs, exec and attach. When timeout exceeded, kubelet will cancel the request, throw out an error and retry later. (default 2m0s) (DEPRECATED: This parameter should be set via the config file specified by the Kubelet's --config flag. See https://kubernetes.io/docs/tasks/administer-cluster/kubelet-config-file/ for more information.)
The git-Hub link talks about the same issue but then more on the documentation change on the flags which are no more used. Is there any fix available for running clusters who has already upgraded to the latest version?
systemctl cat kubelet.service
?ps auwwwx | grep kubelet
? You seem to be asking where that command line flag is coming from but we're not at your computer to know