Sync data from an existing node to the entire cluster

hbarcelos · March 25, 2021, 2:50pm

I have a node which was not previously part of a cluster.
It already has lots of production data (65377 pins to be exact).

To add the cluster, I’m using ipfs-cluster-service@0.13.1 in this node and another one.
I followed the guide on the docs and I can see the nodes are connected through ipfs-cluster-ctl peers ls.

However the data is not being copied from one node to the other.

Edit:
I should also clarify that stopping the current node to copy all the data to the other node is not an option. Ideally this should be done without a downtime.

Does that mean that I must manually pin all local pins to the entire cluster?
Is that a better way to handle this?

hector · March 25, 2021, 6:55pm

Do you mean that you are adding an node in which IPFS has 65377 into a cluster?

Those pins are in the ipfs daemon. ipfs-cluster does not know about them (ipfs-cluster-ctl pin ls shows what cluster knows about). For ipfs cluster to track those pins, you need to pin them using ipfs cluster.

ipfs pin ls --type=recursive | xargs ipfs-cluster-ctl pin add <your options>

would do it I think?

If you are using CRDT-consensus mode, after pin-adding those 60k pins, you may want to compact the state by manually doing state-export and then state-import on your nodes (Data, backups and recovery - Pinset orchestration for IPFS).

hbarcelos · March 25, 2021, 7:10pm

I see, thank you for that.

Regarding the export/import, I was reading through the docs and found this section:

In crdt , importing will clean the state completely and create a single batch Merkle-DAG node. This effectively compacts the state by replacing the Merkle-DAG, but to prevent this peer from re-downloading the old DAG, all other peers in the Cluster should have replaced or removed it to.

I’m not sure I understood the implications. Could you please clarify on that?
Should I make sure I run the export/import in all nodes in the cluster?

hector · March 25, 2021, 7:28pm

It means that if you export/import, you need to do it in all your cluster nodes (with the same file you exported in one of them). (or import it in one, but do a state cleanup in the rest).

hbarcelos · March 26, 2021, 12:44pm

Gotcha… Thanks again.

Now I’m trying to run ipfs-service state export, however I keep getting the following error:

error creating state manager: could not determine the consensus component

The cluster node was started with the default CRDT consensus. The .consensus section of service.json contains only:

  "consensus": {
    "crdt": {
      "cluster_name": "ipfs-cluster",
      "trusted_peers": [
        "*"
      ]
    }
  },

Any ideas?

hector · March 26, 2021, 1:33pm

Are you sure your configuration is where ipfs-cluster-service expects it to be?

Can you run with --debug: ipfs-cluster-service --debug state export.

I don’t think this error can happen with state export and not happen when running with daemon

hbarcelos · March 30, 2021, 4:49pm

Oh, nevermind… it was something wrong with the configuration of the VM.
It worked perfectly in another one I had.
Thank you.

Topic		Replies	Views
'ipfs cluster' sync but 'ipfs' node is not getting values Help go-ipfs , ipfs-cluster , libp2p	4	597	March 29, 2022
IPFS nodes in Cluster are not duplicating pinned data/sharing pins IPFS Cluster	11	1012	November 17, 2020
Error in executing ipfs-cluster-service state export IPFS Cluster	3	866	March 30, 2021
Help Understand core concepts of IPFS Node and IPFS Cluster Help	13	1355	June 18, 2021
Is a node A to node B connection necessary to pin on cluster A-B? Help ipfs-cluster	2	169	March 11, 2024

Sync data from an existing node to the entire cluster

Related topics