KEP-4753: Expose ownerReferences via downward API #4754

ArangoGutierrez · 2024-07-09T11:59:41Z

One-line PR description: Adding a new KEP
Issue link: KEP-4753: Expose ownerReferences via valueFrom and downward API #4753

ArangoGutierrez · 2024-07-09T12:06:48Z

k8s-ci-robot · 2024-07-09T12:15:05Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: ArangoGutierrez
Once this PR has been reviewed and has the lgtm label, please assign mrunalp, wojtek-t for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/prod-readiness/OWNERS
keps/sig-node/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

kannon92 · 2024-07-23T16:49:35Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+## Summary
+
+Today when a pod wants to pass its 'onwerReferences' to a new object it manage (a ConfigMap, a Secret, etc), it needs to do it call the API server to get it's own ownerReferences and then pass it to the new object. 


Suggested change

Today when a pod wants to pass its 'onwerReferences' to a new object it manage (a ConfigMap, a Secret, etc), it needs to do it call the API server to get it's own ownerReferences and then pass it to the new object.

Today when a pod wants to pass its 'ownerReferences' to a new object it manage (a ConfigMap, a Secret, etc), it needs to do it call the API server to get it's own ownerReferences and then pass it to the new object.

keps/sig-node/4753-expose-ownerref-downard-api/README.md

kannon92 · 2024-07-23T16:55:43Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+- <test>: <link to test coverage>
+
+### Graduation Criteria


Given that this is not a API change, you may be able to skip alpha stage. WDYT @thockin?

It is an API change - a value that used to be rejected is now allowed, so in the event of a rollback, we need to know it is still handled. Ergo "alpha" is required.

thockin · 2024-08-01T22:51:36Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+## Motivation
+
+Deployments and ReplicaSets can orphan and adopt pods dinamically. Any additional information we provide can change over time. On day 1 a pod named "foobar-deadbeef93-xyz76" is owned by replicaset "foobar-deadbeef93" which is owned by deployment "foobar", but after a node restart, the pod name could change to "foobar-deadbeef93-abc12", leaving the object created by "foobar-deadbeef93-xyz76" orphan, and triggering unwanted behavior. Enabling the pod to pass its ownerReferences to the new object it manages will prevent objects from being orphaned by pods that are owned by a higher level object.


I'm afraid this motivation doesn't make much sense to me. I think you've got some context in your head that is not in the text.

Why does a pod need to know it's own ownerRefs ? You seem to be implying that the pod will create some other object (e.g. a ConfigMap) and rather than parent that object itself, it wants to create the ConfigMap and set its ownerRef to the pod's ownerRef? Why?

It seems plausible to me that a pod would be able to know its own ownerRefs, but that use-case eludes me.

Help me understand what a pod will actually DO with this information?

how about now?

keps/sig-node/4753-expose-ownerref-downard-api/README.md

aojea · 2024-08-20T09:16:17Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+## Motivation
+
+DaemonSets and ReplicaSets (via Deployments) can dynamically orphan and adopt pods. Consider a scenario where, on day 1, a pod named foobar-deadbeef93-xyz76 is owned by the ReplicaSet foobar-deadbeef93, which in turn is managed by the Deployment foobar. After a node restart, the pod might be replaced by foobar-deadbeef93-abc12, orphaning the original pod foobar-deadbeef93-xyz76 and potentially causing unwanted behavior. This issue arises when pods create resources like ConfigMaps or CustomResources and these resources are not correctly reassigned to the new pod, leading to orphaned objects. Ensuring that pods can pass their ownerReferences to new objects they manage would prevent this issue, as the ownership hierarchy would be preserved even when pods are recreated.


After a node restart, the pod might be replaced by foobar-deadbeef93-abc12, orphaning the original pod foobar-deadbeef93-xyz76 and potentially causing unwanted behavior

can this really happen? it this not a bug on the replicaset?

This example is confusing because DaemonSet and Deployment are very different - a pod's parent is never a Deployment. I would retool this to JUST talk about DaemonSet

If I am a daemonset pod "foobar-x" on node "x", and I create a ConfigMap that is associated with node "x", then it would be incorrect to set the ownerRef to "foobar-x" - I should set it to the daemonset "foobar". If you set it to the pod, then you can race with garbage-collection.

For a ReplicaSet it's bad because Deployments can replace ReplicaSets, so the proper parent should be the Deployment, which is 2 levels up and is NOT covered by this KEP, right?

aojea · 2024-08-20T09:21:57Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+  uid: 6be1683f-da9c-4f68-9440-82376231cfa6
+```
+
+If `ownerReferences` is empty or not set, the file will be empty.


what happen if ownerReferences is updated? is it reflected in the Pod or the Pod keeps using the original value?

keps/sig-node/4753-expose-ownerref-downard-api/README.md

aojea · 2024-08-20T09:23:04Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+e2e testing will consist of the following:
+- Create a Deployment and or DaemonSet
+- Verify if the pod created by the Deployment/DaemonSet has the ownerReferences of the Deployment/DaemonSet via ENV VAR


is this correct? you said before env variables are out of scope

Is out of the scopt of this KEP to add the ownerReferences to the pod's environment variables.

tengqm · 2024-09-06T13:04:38Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+  - Feature gate name: `DownwardAPIOwnerReferences`
+  - Components depending on the feature gate: `kubelet`


Why cannot this be a new field in kubelet configuration?
If the feature (implementation) turns out to be not appropriate, users can simply update the kubelet configuration and restart kubelet. However, if it is a feature gate, users don't know which components are affected. We don't have documentations elsewhere stating that this gate only affects kubelet. The best bet for them would be to restart all kube-xxx components because, this gate (as with others) will show up in kube-apiserver --help anyway when landed.

We have witnessed a lot of "abuse" of feature gates. Can we think twice about this?

feature gate is a mechanism to introduce new changes to mitigate the risks of breaking existing clusters, they are not meant to be used as flags, once they graduate they disappear and the functionality can not be disabled ... are you implying this KEP should add the functionality as opt-in or opt-out via configuration?

I think I know what feature gate was designed for ... My question is that do we want all new changes to be introduced with a feature gate? All changes could break existing clusters. I raised this concern because we are seeing that quite some new feature gates introduced lately are about trivial changes. e.g. new optimizations to the watch cache, adding new fields to the status of a resource, adding new entry to resource annotation ...

Maybe this one is not so trivial, maybe it actually warrants a new feature gate that takes several releases to become GA. My point is about the user-facing changes. For most cluster admins, feature gates are currently applied system wide, across all components. That is a HEAVY burden. I hope our team will consider this factor.

do we want all new changes to be introduced with a feature gate?

If you think your change could break a cluster, then YES.

If you your change has any aspect of API, then YES.

new optimizations to the watch cache

Could absolutely break a cluster!

adding new fields to the status of a resource

Any API field we add goes through a gate so that we can safely do a rollback without breaking or dropping information.

adding new entry to resource annotation

If this produces a value which version x-1 of Kubernetes would reject, then it needs a gate. Always think about this case:

User upgrades cluster from x->y
User uses your new API field.
Uh oh! Something bad happened.
User rolls cluster back to x.

Your API field is still stored in etcd, and may have some lingering external effect - what will happen? What if the user tries to update the object - will it fail validation? Or will the new field just disappear?

Signed-off-by: Carlos Eduardo Arango Gutierrez <[email protected]>

thockin · 2024-09-10T23:53:43Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+## Summary
+
+Today when a pod wants to pass its `ownerReferences` to a new object it manage (a ConfigMap, a Secret, etc), it needs to do it call the API server to get it's own ownerReferences and then pass it to the new object. 
+The pod then needs GET access on all the intermediate resources which it may not have, and giving it that access is a security risk as it can access other resources it should not have access to.


The pod only needs GET access on pods, not anything else?

thockin · 2024-09-10T23:58:42Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+## Motivation
+
+DaemonSets and ReplicaSets (via Deployments) can dynamically orphan and adopt pods. Consider a scenario where, on day 1, a pod named foobar-deadbeef93-xyz76 is owned by the ReplicaSet foobar-deadbeef93, which in turn is managed by the Deployment foobar. After a node restart, the pod might be replaced by foobar-deadbeef93-abc12, orphaning the original pod foobar-deadbeef93-xyz76 and potentially causing unwanted behavior. This issue arises when pods create resources like ConfigMaps or CustomResources and these resources are not correctly reassigned to the new pod, leading to orphaned objects. Ensuring that pods can pass their ownerReferences to new objects they manage would prevent this issue, as the ownership hierarchy would be preserved even when pods are recreated.


This example is confusing because DaemonSet and Deployment are very different - a pod's parent is never a Deployment. I would retool this to JUST talk about DaemonSet

If I am a daemonset pod "foobar-x" on node "x", and I create a ConfigMap that is associated with node "x", then it would be incorrect to set the ownerRef to "foobar-x" - I should set it to the daemonset "foobar". If you set it to the pod, then you can race with garbage-collection.

For a ReplicaSet it's bad because Deployments can replace ReplicaSets, so the proper parent should be the Deployment, which is 2 levels up and is NOT covered by this KEP, right?

thockin · 2024-09-11T00:06:22Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+DaemonSets and ReplicaSets (via Deployments) can dynamically orphan and adopt pods. Consider a scenario where, on day 1, a pod named foobar-deadbeef93-xyz76 is owned by the ReplicaSet foobar-deadbeef93, which in turn is managed by the Deployment foobar. After a node restart, the pod might be replaced by foobar-deadbeef93-abc12, orphaning the original pod foobar-deadbeef93-xyz76 and potentially causing unwanted behavior. This issue arises when pods create resources like ConfigMaps or CustomResources and these resources are not correctly reassigned to the new pod, leading to orphaned objects. Ensuring that pods can pass their ownerReferences to new objects they manage would prevent this issue, as the ownership hierarchy would be preserved even when pods are recreated.
+
+For example, a CustomResource created by a pod managed by a DaemonSet may be unexpectedly garbage collected if the pod is deleted. This can disrupt system behavior, as the necessary resource is lost. By allowing the pod to inherit and pass down ownerReferences, the CustomResource would remain correctly managed by the DaemonSet, avoiding such disruptions.


There's a cause-effect implied here that I don't understand. I think you mean to say that now you have enough information to set the owner of the CR to the DS itself?

thockin · 2024-09-18T21:12:44Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+### Risks and Mitigations
+
+- Security Risk: Some environments may not want to expose the ownerReferences of a pod to the pod itself.
+  - Mitigation: This feature could be added as a feature gate, disabled by default.


This is not a mitigation to this KEP. If there is some cluster wherein the admin DOES NOT want pods to be able to know their owners, the mitigation would be to install an admission check (webhook or Validating Admission Policy) to reject pods that try to use this.

thockin · 2024-09-18T21:33:50Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+Logs from the pod will show the ownerReferences of the pod:
+
+```bash
+$ kubectl logs kubernetes-downwardapi-volume-example


This is under-specified.

I don't know if we should dump the WHOLE object, but it's probably fine - can you think of any reason to hide any of these?

YAML is a terrible file format for parsing - I think it would be better to spec this as something like:

"""
The file representation of this fieldref is a JSON list of objects, each of which is a serialized k8s.io/apimachinery/pkg/apis/meta/v1.OwnerReference.
"""

Open question: Should we instead produce a metav1.List? e.g.

{ "kind": "OwnerReference", "apiVersion": "meta/v1", "items": [ { .... }, { ... } ] }

IF there was ever a meta/v2, we would need to allow people to request that - does the output need to be self-documenting? I think probably not.

thockin · 2024-09-18T21:34:17Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+If `ownerReferences` is empty or not set, the file will be empty.
+
+Is out of the scope of this KEP to add the `ownerReferences` to the pod's environment variables.


Why? It's always a smell when we do one but not the other. This value could be specced the same as the file - a JSON list.

thockin · 2024-09-18T21:35:36Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+- <test>: <link to test coverage>
+
+### Graduation Criteria


It is an API change - a value that used to be rejected is now allowed, so in the event of a rollback, we need to know it is still handled. Ergo "alpha" is required.

thockin · 2024-09-18T21:40:48Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+  - Feature gate name: `DownwardAPIOwnerReferences`
+  - Components depending on the feature gate: `kubelet`


do we want all new changes to be introduced with a feature gate?

If you think your change could break a cluster, then YES.

If you your change has any aspect of API, then YES.

new optimizations to the watch cache

Could absolutely break a cluster!

adding new fields to the status of a resource

Any API field we add goes through a gate so that we can safely do a rollback without breaking or dropping information.

adding new entry to resource annotation

If this produces a value which version x-1 of Kubernetes would reject, then it needs a gate. Always think about this case:

User upgrades cluster from x->y
User uses your new API field.
Uh oh! Something bad happened.
User rolls cluster back to x.

Your API field is still stored in etcd, and may have some lingering external effect - what will happen? What if the user tries to update the object - will it fail validation? Or will the new field just disappear?

thockin · 2024-09-18T21:41:26Z

keps/sig-node/4753-expose-ownerref-downard-api/README.md

+
+###### Can the feature be disabled once it has been enabled (i.e. can we roll back the enablement)?
+
+No, the missing Downward API field will be perceived as a missing Volume preventing the pod from starting.


Actually, wouldn't it be seen as an unknown field ref?

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Jul 9, 2024

k8s-ci-robot requested review from dchen1107 and derekwaynecarr July 9, 2024 11:59

k8s-ci-robot added kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory sig/node Categorizes an issue or PR as relevant to SIG Node. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Jul 9, 2024

ArangoGutierrez mentioned this pull request Jul 9, 2024

KEP-4753: Expose ownerReferences via valueFrom and downward API #4753

Open

4 tasks

k8s-ci-robot requested a review from thockin July 9, 2024 12:06

ArangoGutierrez force-pushed the KEP-4753 branch from 7a5f904 to 00ef724 Compare July 9, 2024 12:15

ArangoGutierrez force-pushed the KEP-4753 branch 3 times, most recently from 6ae4faf to b57c505 Compare July 9, 2024 17:11

kannon92 reviewed Jul 23, 2024

View reviewed changes

keps/sig-node/4753-expose-ownerref-downard-api/README.md Outdated Show resolved Hide resolved

kannon92 reviewed Jul 23, 2024

View reviewed changes

keps/sig-node/4753-expose-ownerref-downard-api/README.md Show resolved Hide resolved

kannon92 reviewed Jul 23, 2024

View reviewed changes

keps/sig-node/4753-expose-ownerref-downard-api/README.md Outdated Show resolved Hide resolved

kannon92 reviewed Jul 23, 2024

View reviewed changes

thockin reviewed Aug 1, 2024

View reviewed changes

ArangoGutierrez force-pushed the KEP-4753 branch from b57c505 to bad8d3d Compare August 5, 2024 13:28

ArangoGutierrez requested review from kannon92 and thockin August 5, 2024 13:29

ArangoGutierrez force-pushed the KEP-4753 branch from bad8d3d to c904f3b Compare August 5, 2024 13:30

tuibeovince reviewed Aug 20, 2024

View reviewed changes

keps/sig-node/4753-expose-ownerref-downard-api/README.md Show resolved Hide resolved

aojea reviewed Aug 20, 2024

View reviewed changes

keps/sig-node/4753-expose-ownerref-downard-api/README.md Outdated Show resolved Hide resolved

aojea reviewed Aug 20, 2024

View reviewed changes

tengqm reviewed Sep 6, 2024

View reviewed changes

KEP-4753: Expose ownerReferences via downward API

f51a0a1

Signed-off-by: Carlos Eduardo Arango Gutierrez <[email protected]>

ArangoGutierrez force-pushed the KEP-4753 branch from c904f3b to f51a0a1 Compare September 9, 2024 10:44

thockin reviewed Sep 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP-4753: Expose ownerReferences via downward API #4754

KEP-4753: Expose ownerReferences via downward API #4754

ArangoGutierrez commented Jul 9, 2024

ArangoGutierrez commented Jul 9, 2024

k8s-ci-robot commented Jul 9, 2024

kannon92 Jul 23, 2024

kannon92 Jul 23, 2024

thockin Sep 18, 2024

thockin Aug 1, 2024

ArangoGutierrez Aug 5, 2024

aojea Aug 20, 2024

thockin Sep 10, 2024

aojea Aug 20, 2024

aojea Aug 20, 2024

tengqm Sep 6, 2024

aojea Sep 7, 2024

tengqm Sep 7, 2024

thockin Sep 18, 2024

thockin Sep 10, 2024

thockin Sep 10, 2024

thockin Sep 11, 2024

thockin Sep 18, 2024

thockin Sep 18, 2024

thockin Sep 18, 2024

thockin Sep 18, 2024

thockin Sep 18, 2024

thockin Sep 18, 2024


		## Summary

		Today when a pod wants to pass its 'onwerReferences' to a new object it manage (a ConfigMap, a Secret, etc), it needs to do it call the API server to get it's own ownerReferences and then pass it to the new object.

	Today when a pod wants to pass its 'onwerReferences' to a new object it manage (a ConfigMap, a Secret, etc), it needs to do it call the API server to get it's own ownerReferences and then pass it to the new object.
	Today when a pod wants to pass its 'ownerReferences' to a new object it manage (a ConfigMap, a Secret, etc), it needs to do it call the API server to get it's own ownerReferences and then pass it to the new object.


		## Motivation

		Deployments and ReplicaSets can orphan and adopt pods dinamically. Any additional information we provide can change over time. On day 1 a pod named "foobar-deadbeef93-xyz76" is owned by replicaset "foobar-deadbeef93" which is owned by deployment "foobar", but after a node restart, the pod name could change to "foobar-deadbeef93-abc12", leaving the object created by "foobar-deadbeef93-xyz76" orphan, and triggering unwanted behavior. Enabling the pod to pass its ownerReferences to the new object it manages will prevent objects from being orphaned by pods that are owned by a higher level object.


		## Motivation

		DaemonSets and ReplicaSets (via Deployments) can dynamically orphan and adopt pods. Consider a scenario where, on day 1, a pod named foobar-deadbeef93-xyz76 is owned by the ReplicaSet foobar-deadbeef93, which in turn is managed by the Deployment foobar. After a node restart, the pod might be replaced by foobar-deadbeef93-abc12, orphaning the original pod foobar-deadbeef93-xyz76 and potentially causing unwanted behavior. This issue arises when pods create resources like ConfigMaps or CustomResources and these resources are not correctly reassigned to the new pod, leading to orphaned objects. Ensuring that pods can pass their ownerReferences to new objects they manage would prevent this issue, as the ownership hierarchy would be preserved even when pods are recreated.

		- Feature gate name: `DownwardAPIOwnerReferences`
		- Components depending on the feature gate: `kubelet`


		DaemonSets and ReplicaSets (via Deployments) can dynamically orphan and adopt pods. Consider a scenario where, on day 1, a pod named foobar-deadbeef93-xyz76 is owned by the ReplicaSet foobar-deadbeef93, which in turn is managed by the Deployment foobar. After a node restart, the pod might be replaced by foobar-deadbeef93-abc12, orphaning the original pod foobar-deadbeef93-xyz76 and potentially causing unwanted behavior. This issue arises when pods create resources like ConfigMaps or CustomResources and these resources are not correctly reassigned to the new pod, leading to orphaned objects. Ensuring that pods can pass their ownerReferences to new objects they manage would prevent this issue, as the ownership hierarchy would be preserved even when pods are recreated.

		For example, a CustomResource created by a pod managed by a DaemonSet may be unexpectedly garbage collected if the pod is deleted. This can disrupt system behavior, as the necessary resource is lost. By allowing the pod to inherit and pass down ownerReferences, the CustomResource would remain correctly managed by the DaemonSet, avoiding such disruptions.


		If `ownerReferences` is empty or not set, the file will be empty.

		Is out of the scope of this KEP to add the `ownerReferences` to the pod's environment variables.


		###### Can the feature be disabled once it has been enabled (i.e. can we roll back the enablement)?

		No, the missing Downward API field will be perceived as a missing Volume preventing the pod from starting.

KEP-4753: Expose ownerReferences via downward API #4754

Are you sure you want to change the base?

KEP-4753: Expose ownerReferences via downward API #4754

Conversation

ArangoGutierrez commented Jul 9, 2024

ArangoGutierrez commented Jul 9, 2024

k8s-ci-robot commented Jul 9, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment