EndpointSlice with Pods without an existing Node #110639

aojea · 2022-06-17T12:02:40Z

/kind bug
/kind cleanup
/kind documentation

What this PR does / why we need it:

endpointslices: node missing on Pod scenario

When a Pod is referencing a Node that doesn't exist on the local
informer cache, the current behavior was to return an error to
retry later and stop processing.
However, this can cause scenarios that a missing node leaves a
Slice stuck, it can no reflect other changes, or be created.
Also, this doesn't respect the publishNotReadyAddresses options
on Services, that considers ok to publish pod Addresses that are
known to not be ready.

The new behavior keeps retrying the problematic Service, but it
keeps processing the updates, reflacting current state on the
EndpointSlice. If the publishNotReadyAddresses is set, a missing
node on a Pod is not treated as an error.

EndpointSlices with Pod referencing Nodes that doesn't exist couldn't be created or updated.
The behavior on the EndpointSlice controller has been modified to update the EndpointSlice without the Pods that reference non-existing Nodes, and keep retrying until all Pods reference existing Nodes.
However, if service.Spec.PublishNotReadyAddresses is set, all the Pods are published without retrying.
Fixed EndpointSlices metrics to reflect correctly the number of desired EndpointSlices when no endpoints are present.

Fixes: #107927

k8s-ci-robot · 2022-06-17T12:02:48Z

@aojea: This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot · 2022-06-17T12:03:25Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: aojea

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/controller/endpointslice/OWNERS~~ [aojea]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

aojea · 2022-06-17T12:06:15Z

/assign @thockin @robscott

aojea · 2022-06-17T12:15:22Z

/priority important-soon

dcbw · 2022-06-21T18:03:17Z

pkg/controller/endpointslice/metrics/cache.go

@@ -89,7 +89,13 @@ func (spc *ServicePortCache) totals(maxEndpointsPerSlice int) (int, int, int) {
 	for _, eInfo := range spc.items {
 		endpoints += eInfo.Endpoints
 		actualSlices += eInfo.Slices
-		desiredSlices += numDesiredSlices(eInfo.Endpoints, maxEndpointsPerSlice)
+		if eInfo.Endpoints > 0 {
+			desiredSlices += numDesiredSlices(eInfo.Endpoints, maxEndpointsPerSlice)


@aojea Not suggesting a change necessarily, but would it be worth updating numDesiredSlices() to check for numEndpoints==0 and return 0, rather than what it does right now and return 1? Then you can just special-case desiredSlices=0 at the bottom but not have to special-case 0 in the loop.

Also, do we need to change endpointslicemirroring at all?

yeah, I think that is a matter of where we want to special-case the case "there is always one slice, despite there are not endpoints at all" ... I took this approach because numDesiredSlices is also used in

https://github.com/kubernetes/kubernetes/blob/ee08c977fd2e44ceb10506290fbaaf6b242d6a99/pkg/controller/endpointslicemirroring/metrics/cache.go#L139

hmm, seems you are right

th eendpointslicemirroring seems to not use a placeholder, and already returns 0 slices for 0 endpoints, but better @robscott for confirming

There is always a placeholder slice. The ServicePortCache logic was considering always one endpointSlice per Endpoint, but if there are multiple empty Endpoints, we just use one placeholder slice, not multiple placeholder slices.

When a Pod is referencing a Node that doesn't exist on the local informer cache, the current behavior was to return an error to retry later and stop processing. However, this can cause scenarios that a missing node leaves a Slice stuck, it can no reflect other changes, or be created. Also, this doesn't respect the publishNotReadyAddresses options on Services, that considers ok to publish pod Addresses that are known to not be ready. The new behavior keeps retrying the problematic Service, but it keeps processing the updates, reflacting current state on the EndpointSlice. If the publishNotReadyAddresses is set, a missing node on a Pod is not treated as an error.

dcbw · 2022-06-22T14:39:44Z

/lgtm

dcbw · 2022-06-22T16:08:33Z

@aojea the conformance test failure looks odd, because it's about EPS. However, it doesn't look related to EndpointSlice at all; the failure is that the test pods don't get their IPs within a 3m timeout. TLDR the image pulls are taking 2+ minutes for the pod:

Jun 22 08:09:18 kind-worker kubelet[280]: I0622 08:09:18.943215     280 status_manager.go:685] "Patch status for pod" pod="endpointslice-805/pod1" patch="{\"metadata\":{\"uid\":\"61a94b34-adbb-4fc7-8429-8bec09
03d5ed\"},\"status\":{\"$setElementOrder/conditions\":[{\"type\":\"Initialized\"},{\"type\":\"Ready\"},{\"type\":\"ContainersReady\"},{\"type\":\"PodScheduled\"}],\"conditions\":[{\"lastProbeTime\":null,\"last
TransitionTime\":\"2022-06-22T08:09:11Z\",\"status\":\"True\",\"type\":\"Initialized\"},{\"lastProbeTime\":null,\"lastTransitionTime\":\"2022-06-22T08:09:11Z\",\"message\":\"containers with unready status: [co
ntainer1]\",\"reason\":\"ContainersNotReady\",\"status\":\"False\",\"type\":\"Ready\"},{\"lastProbeTime\":null,\"lastTransitionTime\":\"2022-06-22T08:09:11Z\",\"message\":\"containers with unready status: [con
tainer1]\",\"reason\":\"ContainersNotReady\",\"status\":\"False\",\"type\":\"ContainersReady\"}],\"containerStatuses\":[{\"image\":\"registry.k8s.io/e2e-test-images/nginx:1.14-2\",\"imageID\":\"\",\"lastState\
":{},\"name\":\"container1\",\"ready\":false,\"restartCount\":0,\"started\":false,\"state\":{\"waiting\":{\"reason\":\"ContainerCreating\"}}}],\"hostIP\":\"172.18.0.4\",\"startTime\":\"2022-06-22T08:09:11Z\"}}
"

Jun 22 08:12:05 kind-worker kubelet[280]: I0622 08:12:05.273805     280 event.go:294] "Event occurred" object="endpointslice-805/pod1" fieldPath="spec.containers{container1}" kind="Pod" apiVersion="v1" type="Normal" reason="Pulled" message="Successfully pulled image \"registry.k8s.io/e2e-test-images/nginx:1.14-2\" in 2m51.156327184s"

and a few seconds after that the e3e test times out waiting for the pods to be up. Well yeah if the image pull takes 2m51s then I'm not surprised the e2e test's 3m timeout is exceeded.

dcbw · 2022-06-22T16:08:43Z

/retest

…0639-upstream-release-1.24 Automated cherry pick of #110639: fix a bug on endpointslices tests comparing the wrong

…ce_no_node EndpointSlice with Pods without an existing Node

k8s-ci-robot added the needs-priority Indicates a PR lacks a `priority/foo` label and requires one. label Jun 17, 2022

aojea changed the title ~~Slice no node~~ EndpointSlice with Pods without an existing Node Jun 17, 2022

aojea mentioned this pull request Jun 17, 2022

EndpointSlice controller sync fails when Node is not found #107927

Closed

k8s-ci-robot requested review from caseydavenport and khenidak June 17, 2022 12:03

aojea force-pushed the slice_no_node branch from 77aa7c7 to ee08c97 Compare June 17, 2022 12:05

k8s-ci-robot assigned robscott and thockin Jun 17, 2022

k8s-ci-robot added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. and removed needs-priority Indicates a PR lacks a `priority/foo` label and requires one. labels Jun 17, 2022

aojea mentioned this pull request Jun 17, 2022

delaying queue does not suppress duplicates between FIFO and waiting items #110642

Open

dcbw reviewed Jun 21, 2022

View reviewed changes

aojea added 2 commits June 22, 2022 09:40

fix a bug on endpointslices tests comparing the wrong metrics

0d9689a

fix metrics for placeholder slice

baecb19

There is always a placeholder slice. The ServicePortCache logic was considering always one endpointSlice per Endpoint, but if there are multiple empty Endpoints, we just use one placeholder slice, not multiple placeholder slices.

aojea force-pushed the slice_no_node branch from ee08c97 to b8ba6ab Compare June 22, 2022 07:47

k8s-ci-robot assigned dcbw Jun 22, 2022

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 22, 2022

k8s-ci-robot merged commit ae35371 into kubernetes:master Jun 22, 2022

k8s-ci-robot added this to the v1.25 milestone Jun 22, 2022

jluhrsen mentioned this pull request Jun 22, 2022

Automated cherry pick of #110639: EndpointSlice with Pods without an existing Node #110730

Closed

jluhrsen mentioned this pull request Jul 1, 2022

Automated cherry pick of #110639: fix a bug on endpointslices tests comparing the wrong #110920

Merged

k8s-ci-robot added a commit that referenced this pull request Jul 7, 2022

Merge pull request #110920 from jluhrsen/automated-cherry-pick-of-#11…

7ca3297

…0639-upstream-release-1.24 Automated cherry pick of #110639: fix a bug on endpointslices tests comparing the wrong

mfojtik mentioned this pull request Aug 9, 2022

Bug 2117569: UPSTREAM: 110888: feat: fix a bug thaat not all event be ignored by gc controller openshift/kubernetes#1338

Merged

aojea mentioned this pull request Aug 16, 2022

ServiceController should clarify behavior of "being deleted" nodes #111824

Closed

dgrisonnet mentioned this pull request Aug 31, 2022

UPSTREAM: 110639: endpointslices: node missing on Pod scenario openshift/kubernetes#1359

Merged

dgrisonnet pushed a commit to dgrisonnet/kubernetes that referenced this pull request Sep 2, 2022

UPSTREAM: 110639: Merge pull request kubernetes#110639 from aojea/sli…

d77cdac

…ce_no_node EndpointSlice with Pods without an existing Node

laxmanvallandas mentioned this pull request Sep 5, 2022

Chery pick #110639 to k8s 1.22: EndpointSlice with Pods without an existing Node #112244

Closed

aojea mentioned this pull request Dec 22, 2022

Node lifecycle controller takes too long to mark pods NotReady #114295

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EndpointSlice with Pods without an existing Node #110639

EndpointSlice with Pods without an existing Node #110639

aojea commented Jun 17, 2022 •

edited

k8s-ci-robot commented Jun 17, 2022

k8s-ci-robot commented Jun 17, 2022

aojea commented Jun 17, 2022

aojea commented Jun 17, 2022

dcbw Jun 21, 2022

dcbw Jun 21, 2022

aojea Jun 22, 2022 •

edited

aojea Jun 22, 2022

aojea Jun 22, 2022

dcbw commented Jun 22, 2022

dcbw commented Jun 22, 2022 •

edited

dcbw commented Jun 22, 2022

EndpointSlice with Pods without an existing Node #110639

EndpointSlice with Pods without an existing Node #110639

Conversation

aojea commented Jun 17, 2022 • edited

What this PR does / why we need it:

k8s-ci-robot commented Jun 17, 2022

k8s-ci-robot commented Jun 17, 2022

aojea commented Jun 17, 2022

aojea commented Jun 17, 2022

dcbw Jun 21, 2022

Choose a reason for hiding this comment

dcbw Jun 21, 2022

Choose a reason for hiding this comment

aojea Jun 22, 2022 • edited

Choose a reason for hiding this comment

aojea Jun 22, 2022

Choose a reason for hiding this comment

aojea Jun 22, 2022

Choose a reason for hiding this comment

dcbw commented Jun 22, 2022

dcbw commented Jun 22, 2022 • edited

dcbw commented Jun 22, 2022

aojea commented Jun 17, 2022 •

edited

aojea Jun 22, 2022 •

edited

dcbw commented Jun 22, 2022 •

edited