Regarding efficiency and completeness of findProviders API

rohitiagon · July 24, 2018, 9:53pm

How does the findProviders API in libp2p is resolving the list of nodes (providers) that provide a given content. Does the underlying search involve querying all the nodes in the network, or is there a node that maintains a dictionary of all the providers for that content. Or is the data returned based on some local cached copy, so not ALL the providers in the network for that content are guaranteed to be returned.

Thanks!

stebalien · July 25, 2018, 1:59am

We use a DHT (like bittorrent). When a peer provides a block, it sticks a provider record on the DHT server responsible for that block.

Note: we are working on sending out fewer provider records (e.g., only announcing roots of files, roots of directory trees, etc) as announcing provider records for every chunk is a massive bottleneck for us.

rohitiagon · July 26, 2018, 2:07pm

Thank you @stebalien. Few other questions -

Is there an API available that can be called by a node to dump a list of all the provider blocks that it is storing.
Is there an issue if the node (DHT server) that is storing provider blocks is behind NAT
Is there a way to associate TTL with the provider block being stored on the DHT server.
Does a node storing the provider block on the DHT server has to keep checking of the DHT server is alive, and republish if needed.

Thanks
Rohit

stebalien · July 27, 2018, 8:15pm

Is there an API available that can be called by a node to dump a list of all the provider blocks that it is storing.

Not that I know of. Note: DHT nodes don’t store the blocks, just records of where they’re stored.

Is there an issue if the node (DHT server) that is storing provider blocks is behind NAT

Yes. However, we generally write provider blocks to multiple nodes to reduce these issues. I’d like to have nodes only promote themselves to full DHT nodes after some period of uptime/reachability but we don’t currently do that.

Is there a way to associate TTL with the provider block being stored on the DHT server.

Provider records last 24 hours by default.

Does a node storing the provider block on the DHT server has to keep checking of the DHT server is alive, and republish if needed.

No, we just reprovide every few hours anyways.

rohitiagon · August 6, 2018, 5:26am

Thanks. One other question - given a Kademlia id which API I can call to determine the corresponding node that is responsible for the id. Basically I want to know the exact node which stores the key-value mapping when the put(key, value) API is called. Thanks

rohitiagon · August 6, 2018, 6:30am

Will the findPeer() API of libp2p return the successor node for any provided Kademlia id.

stebalien · August 7, 2018, 4:16am

That’ll find the exact peer (used when connecting to the peer in question). The method you’re looking for is probably getClosestPeers. That’ll return an array of peer IDs close to the key in question.

Topic		Replies	Views
[LibP2P] Support for finding a node that should handle a key? Help libp2p , dht	0	643	December 21, 2018
`ipfs dht findprovs` doesn't find local node for hash I just got Kubo go-ipfs	2	981	September 6, 2019
Can a node alturistically store files? Help files	3	261	September 3, 2023
IPFS seems to (also) store Provider Records in far distant nodes. Why? Help	1	168	December 8, 2022
Finding a known provider Help	0	495	September 23, 2017

Regarding efficiency and completeness of findProviders API

Related topics