Remove need for the apiserver to contact kubelet for current container state #156

jjhuff · 2014-06-18T17:59:29Z

While the kubelet certainly is the source of truth for what is running on a particular host, it might be nice to have it push that info to the apiserver on a regular basis (and on state changes) rather than force the apiserver to ask.

Reasons:

In some deployment scenarios, the apiserver might not have direct contact to individual kubelets.
It'd give the apiserver (and it's clients) access to a reasonably current state of the world without needing to poll each kubelet. This would be handy for improved replication/placement/auto-scale algorithms.
The update could include other info like statistics for both the host and containers
Open the door for auto-registering hosts

The kubelet could publish this to etcd, but I think it'd be good to aim for fewer dependencies on etcd rather than more. Thoughts on that?

brendandburns · 2014-06-18T18:20:41Z

I think we'd rather not have the Kubelet calling back into the master, since it can result in massive fan-in storms of messages though.

However, I can definitely see the value in caching the information inside the apiserver. So what do you think about having the apiserver periodically poll all Kubelets for information, and caching that information locally. I think that would satisfy all of the needs you enumerated, while still enabling the master to control the flow of information.

What do you think?

jbeda · 2014-06-18T18:30:44Z

In the past I've suggested a strategy where we have the master (or some fellow traveller server process) scrapes the nodes regularly and writes back to the master/etcd. We'd then return how stale results are in our API.

Further, there'd be an API option to ask for "up to the second" results that would result in a sync call out to the node.

jjhuff · 2014-06-18T18:33:19Z

I think that any apiserver->kublet polling will have problems in some
deployments. For example;

Using a mix of internal machines and GCE. Without a fair amount of work,
neither has full access to the other's network -- each has to talk via NAT.
Running the master on, say, AppEngine. That'd be super handy for
reducing common-mode failures.

I hear ya on the fan-in problem. That problem already exists to some extent
with the HTTP polling option. Since apiserver is stateless (yay!), it
should scale-out reasonably well as long as the backing store can handle
it...but we already have that problem.

Perhaps only push updates on state changes?

On Wed, Jun 18, 2014 at 11:20 AM, brendandburns notifications@github.com
wrote:

I think we'd rather not have the Kubelet calling back into the master,
since it can result in massive fan-in storms of messages though.

However, I can definitely see the value in caching the information inside
the apiserver. So what do you think about having the apiserver periodically
poll all Kubelets for information, and caching that information locally. I
think that would satisfy all of the needs you enumerated, while still
enabling the master to control the flow of information.

What do you think?

—
Reply to this email directly or view it on GitHub
#156 (comment)
.

brendandburns · 2014-06-18T18:45:17Z

I hear you about the problems with mixed topologies, pushing updates on state changes actually does have the same fan-in problems, suppose all of your tasks fail at the same time with a packet of death, you'll still see a storm of messages.

Hacking in the polling from the apiserver is going to be easier in the short term, so I think we'll at least do that first.

bgrant0607 · 2014-06-18T19:52:44Z

The main reasons for the apiserver to contact kubelets rather than the other way are:

connection storms, which admittedly can be mitigated by fuzzing
other communication storms, which can be mitigated by the apiserver returning callback times to the kubelets
facilitating use of kubelets without apiserver, which admittedly could be controlled via configuration
support multiple independent state scrapers
shard state scraping amongst multiple servers, which admittedly could be handled with redirects
natural for heartbeating, starting new containers, and remote management of kubelet itself

Regardless which components initiate the connection, we maywant to implement a number of optimizations, especially once we start to collect resource stats:

state caching
change notification rather than polling
change significance filtering

jjhuff · 2014-06-18T20:48:30Z

My concern has mainly been figuring out the blockers to actually
deploying kubernetes outside of a strictly GCE environment given security
and network topology constraints. That makes me want to take scissors to
these all-all communication patterns:)

Perhaps a hybrid approach is going to be the best:

Cache state on the master -- either in memory or backed by etcd/whatever.
In memory should be reasonable for a single apiserver instance, but we'd
need persistence for anything more.
Optional state push (interval or change based) from the kubelets. Much
like all of it's existing options. The state push would just populate the
cache

This adds caching to the baseline config, gives the option to reverse the
apiserver-kubelet communication, and preserves the ability for other tools
to scrape the kubelets.

On Wed, Jun 18, 2014 at 12:52 PM, bgrant0607 notifications@github.com
wrote:

The main reasons for the apiserver to contact kubelets rather than the
other way are:

connection storms, which admittedly can be mitigated by fuzzing

other communication storms, which can be mitigated by the apiserver
returning callback times to the kubelets

facilitating use of kubelets without apiserver, which admittedly
could be controlled via configuration

support multiple independent state scrapers

shard state scraping amongst multiple servers, which admittedly
could be handled with redirects

natural for heartbeating, starting new containers, and remote
management of kubelet itself

Regardless which components initiate the connection, we maywant to
implement a number of optimizations, especially once we start to collect
resource stats:

state caching

change notification rather than polling

change significance filtering

—
Reply to this email directly or view it on GitHub
#156 (comment)
.

brendandburns · 2014-06-18T21:03:05Z

SGTM.

I will send a PR for the poll and cache support today, and then work on the
optional Kubelet -> apiserver code path.

Thanks for bearing with us as we sort through this stuff ;)

--brendan

On Wed, Jun 18, 2014 at 1:48 PM, Justin Huff notifications@github.com
wrote:

My concern has mainly been figuring out the blockers to actually
deploying kubernetes outside of a strictly GCE environment given security
and network topology constraints. That makes me want to take scissors to
these all-all communication patterns:)

Perhaps a hybrid approach is going to be the best:

Cache state on the master -- either in memory or backed by
etcd/whatever.
In memory should be reasonable for a single apiserver instance, but we'd
need persistence for anything more.

Optional state push (interval or change based) from the kubelets. Much
like all of it's existing options. The state push would just populate the
cache

This adds caching to the baseline config, gives the option to reverse the
apiserver-kubelet communication, and preserves the ability for other tools
to scrape the kubelets.

On Wed, Jun 18, 2014 at 12:52 PM, bgrant0607 notifications@github.com
wrote:

The main reasons for the apiserver to contact kubelets rather than the
other way are:

connection storms, which admittedly can be mitigated by fuzzing

other communication storms, which can be mitigated by the apiserver
returning callback times to the kubelets

facilitating use of kubelets without apiserver, which admittedly
could be controlled via configuration

support multiple independent state scrapers

shard state scraping amongst multiple servers, which admittedly
could be handled with redirects

natural for heartbeating, starting new containers, and remote
management of kubelet itself

Regardless which components initiate the connection, we maywant to
implement a number of optimizations, especially once we start to collect
resource stats:

state caching

change notification rather than polling

change significance filtering

—
Reply to this email directly or view it on GitHub
<
https://github.com/GoogleCloudPlatform/kubernetes/issues/156#issuecomment-46485272>

.

—
Reply to this email directly or view it on GitHub
#156 (comment)
.

jjhuff · 2014-06-18T21:13:30Z

No problem! Thanks for even doing the work -- I was happy to do that. I was thinking of tackling #134 as well, but I can wait if you want to avoid conflicts.

brendandburns · 2014-06-18T21:22:04Z

We're happy to take the work! Feel free to take on #134, and I might
delegate the push stuff to you too. We'll see how the pull cache goes....

Best
--brendan

On Wed, Jun 18, 2014 at 2:13 PM, Justin Huff notifications@github.com
wrote:

No problem! Thanks for even doing the work -- I was happy to do that. I
was thinking of tackling #134
#134 as well,
but I can wait if you want to avoid conflicts.

—
Reply to this email directly or view it on GitHub
#156 (comment)
.

brendandburns · 2014-06-19T20:50:49Z

Polling from the master was added in #171

I'll work on optional push next.

smarterclayton · 2014-08-11T04:41:03Z

#846 isolates the kubelet from etcd except for writes of logs (covered by #285) with the same semantics via the apiserver.

bgrant0607 · 2014-12-16T19:19:48Z

Related to #2726 and #2483. The Kubelet is effectively a "pod controller", and should be providing pod status.

We're also pursuing auto-registration, #2303.

smarterclayton · 2015-02-13T17:18:09Z

David is going to open an issue on the security of the kubelet in general with a proposal to restrict the kubelet via TLS client certs. We may also need to let the kubelet ask the master whether certain things are allowed which would be covered under his SubjectAccessReview proposal

bgrant0607 · 2015-02-13T18:10:54Z

cc @erictune

alex-mohr · 2015-02-13T18:23:01Z

@roberthbailey @cjcullen Robby and CJ have also been looking into securing the kubelet <-> master communication with TLS certs. We definitely need to harden that path, but I think that's a separate issue than this one?

smarterclayton · 2015-02-13T18:29:02Z

@deads2k

----- Original Message -----

@roberthbailey @cjcullen Robby and CJ have also been looking into securing
the kubelet <-> master communication with TLS certs. We definitely need to
harden that path, but I think that's a separate issue than this one?

Reply to this email directly or view it on GitHub:
#156 (comment)

bgrant0607 · 2015-02-13T18:43:20Z

This proposal was to change to unidirectional communication from kubelet to apiserver. If we replicated apiserver #473, I think that could work reasonably well, and would be inline with the approach used by other components, such as controller-manager and scheduler.

bgrant0607 · 2015-02-13T20:38:46Z

@yujuhong

bgrant0607 · 2015-02-13T21:07:48Z

Related: #3168, #2435, #2303. All issues predicted by @jjhuff.

Change provisioning to pass all variables to both master and node. Run Salt in a masterless setup on all nodes ala http://docs.saltstack.com/en/latest/topics/tutorials/quickstart.html, which involves ensuring Salt daemon is NOT running after install. Kill Salt master install. And fix push to actually work in this new flow. As part of this, the GCE Salt config no longer has access to the Salt mine, which is primarily obnoxious for two reasons: - The minions can't use Salt to see the master: this is easily fixed by static config. - The master can't see the list of all the minions: this is fixed temporarily by static config in util.sh, but later, by other means (see kubernetes#156, which should eventually remove this direction). As part of it, flatten all of cluster/gce/templates/* into configure-vm.sh, using a single, separate piece of YAML to drive the environment variables, rather than constantly rewriting the startup script.

dchen1107 · 2015-03-27T18:53:46Z

Should we close this one?

bgrant0607 · 2015-03-28T06:43:06Z

Yes

Change provisioning to pass all variables to both master and node. Run Salt in a masterless setup on all nodes ala http://docs.saltstack.com/en/latest/topics/tutorials/quickstart.html, which involves ensuring Salt daemon is NOT running after install. Kill Salt master install. And fix push to actually work in this new flow. As part of this, the GCE Salt config no longer has access to the Salt mine, which is primarily obnoxious for two reasons: - The minions can't use Salt to see the master: this is easily fixed by static config. - The master can't see the list of all the minions: this is fixed temporarily by static config in util.sh, but later, by other means (see kubernetes#156, which should eventually remove this direction). As part of it, flatten all of cluster/gce/templates/* into configure-vm.sh, using a single, separate piece of YAML to drive the environment variables, rather than constantly rewriting the startup script.

Remove useless unmount code

devel/local-up: doc cfssl requirement

V1.7.4 patchset

Typo fixes

fixup

Pushes go to staging-k8s.gcr.io

….chains split auth/authz chains even more

[release v1.30] k8s v1.30.4

jbeda added the enhancement label Jun 18, 2014

bgrant0607 mentioned this issue Jun 25, 2014

Configurable restart behavior #127

Closed

smarterclayton mentioned this issue Jun 29, 2014

Kubelet needs a way to send results #285

Closed

bgrant0607 added sig/scalability priority/awaiting-more-evidence labels Dec 3, 2014

bgrant0607 added the area/kubelet label Dec 16, 2014

bgrant0607 added the area/nodecontroller label Dec 16, 2014

davidopp added the area/hosting label Jan 16, 2015

dchen1107 added the sig/node label Feb 4, 2015

bgrant0607 mentioned this issue Feb 13, 2015

IPAddresses reported by the Cloud Providers are ignored #4384

Closed

bgrant0607 mentioned this issue Feb 13, 2015

Secure node -> master communication #3168

Closed

zmerlynn mentioned this issue Mar 6, 2015

Change GCE to use standalone Saltstack config #5119

Merged

bgrant0607 closed this as completed Mar 28, 2015

gosharplite mentioned this issue Sep 21, 2015

kubectl logs returns x509: certificate signed by unknown authority #14097

Closed

resouer pushed a commit to resouer/kubernetes that referenced this issue Dec 5, 2016

Merge pull request kubernetes#156 from resouer/hyperhq-fix-vol

92275ad

Remove useless unmount code

xingzhou pushed a commit to xingzhou/kubernetes that referenced this issue Dec 15, 2016

Merge pull request kubernetes#156 from euank/local-up-docs

0ad78c9

devel/local-up: doc cfssl requirement

iaguis pushed a commit to kinvolk/kubernetes that referenced this issue Feb 6, 2018

Merge pull request kubernetes#156 from coreos/v1.7.4-patchset

4bb697e

V1.7.4 patchset

oomichi mentioned this issue Jul 23, 2018

Add e2e test for "GET /apis/extensions" API #66522

Closed

liyongxian mentioned this issue Jul 31, 2018

CLOSE_WAIT connections on master node Not released when WebSocket closed #66820

Closed

bladerunner512 mentioned this issue Aug 8, 2018

coredns fails to start with endless "ContainerCreating" #67141

Closed

Jake-Smullin mentioned this issue Aug 13, 2018

Can't use a load balancer in front of the etcd cluster for etcd-servers. #67341

Closed

brahmaroutu mentioned this issue Aug 14, 2018

Enable Flex Volume Drivers to support metrics #67400

Closed

ghost mentioned this issue Sep 11, 2018

k8s service creation gets struck in pending state forever #68284

Closed

fyuan1316 mentioned this issue Dec 28, 2018

Service NodePort not working #72392

Closed

kmadnani mentioned this issue Mar 5, 2019

kubectl describe node shows the Allocatable memory in milli bytes ('m') #74986

Closed

seans3 pushed a commit to seans3/kubernetes that referenced this issue Apr 10, 2019

Merge pull request kubernetes#156 from kubernetes/idvoretskyi-patch-1

055c0b8

Typo fixes

marun added a commit to marun/kubernetes that referenced this issue Jun 24, 2020

Merge pull request kubernetes#156 from marun/marun-testing10-02

708a8b5

fixup

pjh pushed a commit to pjh/kubernetes that referenced this issue Jan 31, 2022

whereami gRPC - address thread starvation issue. (kubernetes#156)

715b3a8

fox-md mentioned this issue Mar 1, 2023

Ability to create pods allows access to secrets in the same namespace #116188

Closed

linxiulei pushed a commit to linxiulei/kubernetes that referenced this issue Jan 18, 2024

Merge pull request kubernetes#156 from thockin-tmp/gcr-vanity

c430627

Pushes go to staging-k8s.gcr.io

sttts pushed a commit to sttts/kubernetes that referenced this issue Dec 12, 2024

Merge pull request kubernetes#156 from mjudeikis/judeikis/authz.split…

70835f6

….chains split auth/authz chains even more

krunalhinguu pushed a commit to krunalhinguu/kubernetes that referenced this issue Jan 13, 2025

Merge pull request kubernetes#156 from chiukapoor/rancher-v1.30.4

713ffce

[release v1.30] k8s v1.30.4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove need for the apiserver to contact kubelet for current container state #156

Remove need for the apiserver to contact kubelet for current container state #156

jjhuff commented Jun 18, 2014

brendandburns commented Jun 18, 2014

jbeda commented Jun 18, 2014

jjhuff commented Jun 18, 2014

brendandburns commented Jun 18, 2014

bgrant0607 commented Jun 18, 2014

jjhuff commented Jun 18, 2014

brendandburns commented Jun 18, 2014

jjhuff commented Jun 18, 2014

brendandburns commented Jun 18, 2014

brendandburns commented Jun 19, 2014

smarterclayton commented Aug 11, 2014

bgrant0607 commented Dec 16, 2014

smarterclayton commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

alex-mohr commented Feb 13, 2015

smarterclayton commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

dchen1107 commented Mar 27, 2015

bgrant0607 commented Mar 28, 2015

Remove need for the apiserver to contact kubelet for current container state #156

Remove need for the apiserver to contact kubelet for current container state #156

Comments

jjhuff commented Jun 18, 2014

brendandburns commented Jun 18, 2014

jbeda commented Jun 18, 2014

jjhuff commented Jun 18, 2014

brendandburns commented Jun 18, 2014

bgrant0607 commented Jun 18, 2014

jjhuff commented Jun 18, 2014

brendandburns commented Jun 18, 2014

jjhuff commented Jun 18, 2014

brendandburns commented Jun 18, 2014

brendandburns commented Jun 19, 2014

smarterclayton commented Aug 11, 2014

bgrant0607 commented Dec 16, 2014

smarterclayton commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

alex-mohr commented Feb 13, 2015

smarterclayton commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

bgrant0607 commented Feb 13, 2015

dchen1107 commented Mar 27, 2015

bgrant0607 commented Mar 28, 2015