Scalability Test and Cluster Management Doc #511

hanahmily · 2024-08-14T09:22:43Z

Update the CHANGES log.

Signed-off-by: Gao Hongtao <[email protected]>

docs/operation/cluster.md

wu-sheng · 2024-08-14T09:32:43Z

docs/operation/cluster.md

+
+The cluster's availability is also improved by increasing the number of data nodes, as active data nodes need to handle a lower additional workload when some data nodes become unavailable. For example, if one node out of 2 nodes is unavailable, then 50% of the load is re-distributed across the remaining node, resulting in a 100% per-node workload increase. If one node out of 10 nodes is unavailable, then 10% of the load is re-distributed across the 9 remaining nodes, resulting in only an 11% per-node workload increase.
+
+Increasing the number of etcd nodes can increase the cluster's metadata capacity and improve the cluster's metadata query performance. It can also improve the cluster's metadata availability, as the metadata is replicated across all the etcd nodes. However, the cluster size should be odd to avoid split-brain situations.


etcd could be a potential risk when we run larger scale deployment, I believe.

Absolutely.

During the test, I used 10 data nodes and only 1 etcd node in a medium-sized cluster. Moving forward, we need to include more extensive and complex scenarios in the scale testing.

Yes, and when we meet Lenovo team next month, we need to verify the scale with them.

wu-sheng · 2024-08-14T12:46:46Z

Others are good, please fix menu structure for operation docs.

Signed-off-by: Gao Hongtao <[email protected]>

wu-sheng · 2024-08-14T12:49:31Z

Why your yaml is so different?

Signed-off-by: Gao Hongtao <[email protected]>

hanahmily added 4 commits August 13, 2024 13:48

Introduce scale tes

bd7943d

Signed-off-by: Gao Hongtao <[email protected]>

Add cluster management guide

977b927

Signed-off-by: Gao Hongtao <[email protected]>

Add cluster management guide

301060f

Signed-off-by: Gao Hongtao <[email protected]>

Merge remote-tracking branch 'origin/cluster' into cluster

ce90012

hanahmily added documentation Improvements or additions to documentation testing labels Aug 14, 2024

hanahmily added this to the 0.7.0 milestone Aug 14, 2024

hanahmily requested review from wu-sheng and wankai123 August 14, 2024 09:22

wankai123 reviewed Aug 14, 2024

View reviewed changes

docs/operation/cluster.md Show resolved Hide resolved

Merge branch 'main' into cluster

9bbc050

wu-sheng reviewed Aug 14, 2024

View reviewed changes

Update menu

c564b65

Signed-off-by: Gao Hongtao <[email protected]>

remove compact sequence indent

5d2a3ea

Signed-off-by: Gao Hongtao <[email protected]>

wu-sheng approved these changes Aug 14, 2024

View reviewed changes

wu-sheng merged commit c27d562 into main Aug 14, 2024
15 checks passed

wu-sheng deleted the cluster branch August 14, 2024 13:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalability Test and Cluster Management Doc #511

Scalability Test and Cluster Management Doc #511

hanahmily commented Aug 14, 2024

wu-sheng Aug 14, 2024

hanahmily Aug 14, 2024

wu-sheng Aug 14, 2024

wu-sheng commented Aug 14, 2024

wu-sheng commented Aug 14, 2024


		The cluster's availability is also improved by increasing the number of data nodes, as active data nodes need to handle a lower additional workload when some data nodes become unavailable. For example, if one node out of 2 nodes is unavailable, then 50% of the load is re-distributed across the remaining node, resulting in a 100% per-node workload increase. If one node out of 10 nodes is unavailable, then 10% of the load is re-distributed across the 9 remaining nodes, resulting in only an 11% per-node workload increase.

		Increasing the number of etcd nodes can increase the cluster's metadata capacity and improve the cluster's metadata query performance. It can also improve the cluster's metadata availability, as the metadata is replicated across all the etcd nodes. However, the cluster size should be odd to avoid split-brain situations.

Scalability Test and Cluster Management Doc #511

Scalability Test and Cluster Management Doc #511

Conversation

hanahmily commented Aug 14, 2024

wu-sheng Aug 14, 2024

Choose a reason for hiding this comment

hanahmily Aug 14, 2024

Choose a reason for hiding this comment

wu-sheng Aug 14, 2024

Choose a reason for hiding this comment

wu-sheng commented Aug 14, 2024

wu-sheng commented Aug 14, 2024