Skip to content

Pods with security groups cannot resolve/too slow to resolve domain names #3126

@uyilmaz

Description

@uyilmaz

What happened:

I have this following setup, using "security groups for pods" and "prefix delegation":

  • POD_SECURITY_GROUP_ENFORCING_MODE is set to "standard"
  • ENABLE_PREFIX_DELEGATION is set to "true"
  • AWS_VPC_K8S_CNI_EXTERNALSNAT is set to "false"
  • node type is "r6g.medium"
  • pods that have a security group assigned also have a network policy assigned
  • 2 nodes

Pods without a security group work normally, however, pods with a security group resolve DNS addresses so slowly that I first thought they couldn't resolve at all, but after numerous retries I get a few successful resolves. For example curl example.com times out with "could not resolve address" most of the time.

Environment:

  • Kubernetes version (use kubectl version): v1.31.2-eks-7f9249a
  • CNI Version: v1.18.6-eksbuild.1 (aws-network-policy-agent:v1.1.4-eksbuild.1)
  • OS (e.g: cat /etc/os-release): Amazon Linux 2
  • Kernel (e.g. uname -a): Linux ip-x-x-xxx-xx.ap-northeast-1.compute.internal x.xx.xxx-xxx.xxx.amzn2.aarch64 #1 SMP Tue Oct 22 16:38:25 UTC 2024 aarch64 aarch64 aarch64 GNU/Linux

I have another cluster with the following setup that does not have the same problem (differences in bold):

  • POD_SECURITY_GROUP_ENFORCING_MODE is set to "standard"
  • ENABLE_PREFIX_DELEGATION is set to "false"
  • AWS_VPC_K8S_CNI_EXTERNALSNAT is set to "false"
  • node type is "m6g.xlarge"
  • pods that have a security group assigned also have a network policy assigned
  • 1 node

Environment:

  • Kubernetes version (use kubectl version): v1.28.15-eks-7f9249a
  • CNI Version: v1.15.4-eksbuild.1
  • OS (e.g: cat /etc/os-release): Amazon Linux 2

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions