You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What happened:
We have encountered several deadlock issues with JFS in our environment, which consists of AWS RDS and AWS S3. Specifically, 500 nodes are simultaneously writing logs to JFS, and these logs are updated in real-time.
We also enable quota in JFS. What you expected to happen:
I don't know why deadlock happen, want to fix it. How to reproduce it (as minimally and precisely as possible):
Launch 500 aws ec2 with JFS mounted, start a python application on each of them, keep writing log into JFS.
We may destroy ec2 instance, and launch new ec2 instance from time to time. Anything else we need to know?
here is some monitor data:
Process 25448 waits for ExclusiveLock on tuple (2,32) of relation 24708 of database 5; blocked by process 31328.
Process 31328 waits for ShareLock on transaction 685022274; blocked by process 31374.
Process 31374 waits for ExclusiveLock on tuple (5,57) of relation 24708 of database 5; blocked by process 31902.
Process 31902 waits for ShareLock on transaction 685023299; blocked by process 25448.
Environment:
JuiceFS version (use juicefs --version) or Hadoop Java SDK version: juicefs version 1.1.1+2023-11-28.437f4e6
What happened:

We have encountered several deadlock issues with JFS in our environment, which consists of AWS RDS and AWS S3. Specifically, 500 nodes are simultaneously writing logs to JFS, and these logs are updated in real-time.
We also enable quota in JFS.
What you expected to happen:
I don't know why deadlock happen, want to fix it.
How to reproduce it (as minimally and precisely as possible):
Launch 500 aws ec2 with JFS mounted, start a python application on each of them, keep writing log into JFS.
We may destroy ec2 instance, and launch new ec2 instance from time to time.
Anything else we need to know?
here is some monitor data:
Process 25448 waits for ExclusiveLock on tuple (2,32) of relation 24708 of database 5; blocked by process 31328.
Process 31328 waits for ShareLock on transaction 685022274; blocked by process 31374.
Process 31374 waits for ExclusiveLock on tuple (5,57) of relation 24708 of database 5; blocked by process 31902.
Process 31902 waits for ShareLock on transaction 685023299; blocked by process 25448.
Environment:
juicefs --version
) or Hadoop Java SDK version: juicefs version 1.1.1+2023-11-28.437f4e6cat /etc/os-release
): VERSION="22.04.4 LTS (Jammy Jellyfish)"uname -a
): 6.5.0-1024-awsThe text was updated successfully, but these errors were encountered: