feat: As a user, I want kubernetes service discovery to support more configuration items #8311

tangzhenhuang · 2022-11-11T03:17:42Z

Description

Recently, we deployed apisix on different clouds and used the feature of kubernetes service discovery. The problem is that on different clouds, the proxy layer (LB) in front of apiserver has different idle timeouts. However, in apisix's kubernetes service discovery, The time of a watch is fixed, which will cause a problem: when there is no endpoints event in the cluster for a long time, the server will time out instead of the client, and then the service discovery will restart the list-watch after a fixed 40 seconds , so if you can add some configuration items, such as the duration of a watch, retry time or strategy, etc., thank you!

tokers · 2022-11-11T09:36:42Z

The current watch timeout is hard coded with a built-in sample algorithm. I think we can add a new field for users to configure the watch timeout.

zhixiongdu027 · 2022-11-11T10:31:49Z

I think the goal is to avoid "re list-watch".
and that's not what "40 seconds" brings

tokers · 2022-11-13T10:19:06Z

I think the goal is to avoid "re list-watch". and that's not what "40 seconds" brings

Any suggestions?

zhixiongdu027 · 2022-11-14T06:42:31Z

In order to solve the problem,
Maybe we can make events via mock endpoints change in a specific namespace to keep tcp active
@crazyMonkey1995 @tokers

tangzhenhuang · 2022-11-14T07:44:21Z

In order to solve the problem, Maybe we can make events via mock endpoints change in a specific namespace to keep tcp active @crazyMonkey1995 @tokers

How about making timeout a configurable parameter? Because the user himself knows what the timeout of the target apiserver (or its proxy) is.

zhixiongdu027 · 2022-11-16T01:44:13Z

Too short watchSeconds value will produce many "re list-watch"
Too long watchSeconds value will cause the proxy to terminate the connection early

do we have to use an proxy before apiserver ?

tangzhenhuang · 2022-11-16T02:10:41Z

Too short watchSeconds value will produce many "re list-watch" Too long watchSeconds value will cause the proxy to terminate the connection early

do we have to use an proxy before apiserver ?

In actual usage scenarios, such as Alibaba Cloud, AWS, Azure, etc., the apiserver will have a proxy

tzssangglass · 2022-11-16T02:14:12Z

Too short watchSeconds value will produce many "re list-watch" Too long watchSeconds value will cause the proxy to terminate the connection early

do we have to use an proxy before apiserver ?

In fact, if you use resty.http or ngx.tcp.socket, even if you don't set the timeout, there will be a default timeout, which is 60 s as I remember.

zhixiongdu027 · 2022-11-16T02:23:51Z

In fact, if you use resty.http or ngx.tcp.socket, even if you don't set the timeout, there will be a default timeout, which is 60 s as I remember.

The problem is not here, and in the code it is already set
httpc:set_timeouts

apisix/apisix/discovery/kubernetes/informer_factory.lua

Lines 199 to 206 in 288708c

    
           local function watch(httpc, apiserver, informer) 
        
               local watch_times = 8 
        
               for _ = 1, watch_times do 
        
                   local watch_seconds = 1800 + math.random(9, 999) 
        
                   informer.overtime = watch_seconds 
        
                   local http_seconds = watch_seconds + 120 
        
                   httpc:set_timeouts(2000, 3000, http_seconds * 1000)

The problem is that in a network topology like the following
discovery --(1)--> proxy --(2)--> apiserver

Position(1) does not match timeout policy for Position(2)

@tzssangglass

zhixiongdu027 · 2022-11-16T09:39:10Z

@crazyMonkey1995 @tokers @tzssangglass

I would like to make a PR for "support configuration watchSeconds and retryInterval" latter

tzssangglass · 2022-11-16T10:43:54Z

The problem is that in a network topology like the following
discovery --(1)--> proxy --(2)--> apiserver

we can make 2000, 3000, http_seconds * 1000 in the code httpc:set_timeouts(2000, 3000, http_seconds * 1000) be configurabled by the user.

How about making timeout a configurable parameter? Because the user himself knows what the timeout of the target apiserver (or its proxy) is.

As described here, the user needs to configure the timeout to be smaller than the proxy.

zhixiongdu027 · 2022-11-22T02:55:53Z

I would like to make a PR for "support configuration watchSeconds and retryInterval" latter

I tend to use a config in the following format, or any other suggestions ?

kubernetes:
    service:  ...
    client:    ...
    retry_interval: 30
    min_watch:    1800
    max_watch:   2000

@crazyMonkey1995 @tokers @tzssangglass @spacewander

tzssangglass · 2022-11-22T06:07:20Z

kubernetes:
    service:  ...
    client:    ...
    retry_interval: 30
    min_watch:    1800
    max_watch:   2000

what about

kubernetes:
    service:  ...
    client:    ...
    retry_interval: 30
    watch: 
      connect: 
      send:
      read:

ro4i7 · 2023-03-12T15:17:46Z

Hello @spacewander @tokers @tzssangglass @crazyMonkey1995

if this issue is still open, please assign it to me:
please give the feedback on following solution:

To solve this issue, we can add some configuration items to the Kubernetes service discovery such as the duration of a watch, retry time, or strategy, as shown below:

service:
  client:
    retry_interval: 30
  watch:
    duration: 60
    retry_strategy: exponential_backoff

In this configuration, the duration of a watch is set to 60 seconds, and the retry strategy is set to exponential backoff. The retry interval is set to 30 seconds, which means that the client will retry connecting to the service after 30 seconds if the initial connection attempt fails.

spacewander added the good first issue Good for newcomers label Nov 11, 2022

tangzhenhuang mentioned this issue Nov 15, 2022

help request: As a user, I use kubernetes service discovery ,same apisix instance ，It took a long time to get the changed ip #8313

Closed

zhixiongdu027 mentioned this issue Dec 26, 2022

help request: Service discovery uses K8S to start an error report ，endpoints is forbidden 403 #8552

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: As a user, I want kubernetes service discovery to support more configuration items #8311

feat: As a user, I want kubernetes service discovery to support more configuration items #8311

tangzhenhuang commented Nov 11, 2022

tokers commented Nov 11, 2022

zhixiongdu027 commented Nov 11, 2022

tokers commented Nov 13, 2022

zhixiongdu027 commented Nov 14, 2022 •

edited

Loading

tangzhenhuang commented Nov 14, 2022

zhixiongdu027 commented Nov 16, 2022

tangzhenhuang commented Nov 16, 2022

tzssangglass commented Nov 16, 2022

zhixiongdu027 commented Nov 16, 2022 •

edited

Loading

zhixiongdu027 commented Nov 16, 2022

tzssangglass commented Nov 16, 2022

zhixiongdu027 commented Nov 22, 2022

tzssangglass commented Nov 22, 2022

ro4i7 commented Mar 12, 2023

feat: As a user, I want kubernetes service discovery to support more configuration items #8311

feat: As a user, I want kubernetes service discovery to support more configuration items #8311

Comments

tangzhenhuang commented Nov 11, 2022

Description

tokers commented Nov 11, 2022

zhixiongdu027 commented Nov 11, 2022

tokers commented Nov 13, 2022

zhixiongdu027 commented Nov 14, 2022 • edited Loading

tangzhenhuang commented Nov 14, 2022

zhixiongdu027 commented Nov 16, 2022

tangzhenhuang commented Nov 16, 2022

tzssangglass commented Nov 16, 2022

zhixiongdu027 commented Nov 16, 2022 • edited Loading

zhixiongdu027 commented Nov 16, 2022

tzssangglass commented Nov 16, 2022

zhixiongdu027 commented Nov 22, 2022

tzssangglass commented Nov 22, 2022

ro4i7 commented Mar 12, 2023

zhixiongdu027 commented Nov 14, 2022 •

edited

Loading

zhixiongdu027 commented Nov 16, 2022 •

edited

Loading