How to auto-rebalance and duplicate data?

KennethAdamMiller · March 2, 2022, 12:04am

Suppose I have a small cluster, and it isn’t at all uniform, so there are drives of different sizes. I have a data processing task, and it uses more than any one single drive can hold on it’s own. Across the many drives, there is plenty of space however. I would like IPFS to automatically perform some form of balancing and distribution of pinned data. Also, I would like to configure that it implicitly duplicate anything that is pinned by a parameter I give, and ensure that duplication spreads across several drives.

How can I configure IPFS to auto-rebalance and duplicate my data when I pin it?

hector · March 2, 2022, 8:46am

See https://cluster.ipfs.io.

KennethAdamMiller · March 2, 2022, 6:59pm

Actually, after I wrote this I did go and read about cluster ipfs. That solves part of my problem, the duplication problem. I still need to find a way to make sure that data is siphoned to the larger drives.

hector · March 3, 2022, 12:24pm

Create an LVM volume or RAID array for your disks.

KennethAdamMiller · March 10, 2022, 5:38pm

I can see that I can manage the replication with this, but not the rebalancing.

KennethAdamMiller · March 10, 2022, 5:38pm

That only works for the extent that I have a single machine with many disks. My data center may be very unbalanced in terms of hardware.

Topic		Replies	Views
Trigger redistribution of files across nodes	0	171	August 5, 2023
Is there any doc on IPFS file replication, how it avoid single point failture?	10	4709	November 8, 2017
Pinning data without duplication? Help	7	97	August 21, 2024
Auto healing in IPFS-CLUSTER IPFS Cluster ipfs-cluster	1	486	August 24, 2021
How does IPFS Cluster implement replication Help ipfs-cluster	5	1273	April 29, 2020

How to auto-rebalance and duplicate data?

Related topics