A question about CID hash

genghis · January 6, 2025, 11:22am

Note: If CID bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy is CIDv2, then please ignore this question.

Greetings!

I am reading an article about IPFS and I have downloaded this file bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy.

Because this CID refers to a single file, I have expected it to represent the hash of the file itself, yet the CID hash and the checksum of the file differ, provided that this is CIDv1.

$ ipfs get bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy
Saving file(s) to bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy
 155.88 KiB / 155.88 KiB [=============================================================] 100.00% 0s
$ sha1sum bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy
4effb299ca044c1efa1279038b33454dd91a8024  bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy
$ sha256sum bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy
1abced61e25c1ddcd5e52e1f7171e7f352293eaf52344713f07dd0f707ca717e  bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy

If this CID is CIDv1, then why are the hashes differ?

References:

https://norman.life/posts/ipfs-bittorrent
An IPFS guide for newbies? - #3 by danieln
ipfs://bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy

danieln · January 6, 2025, 11:53am

Hey there,

bafkreia2xtwwdys4dxonlzjod5yxdz7tkiut5l2sgrdrh4d52d3qpstrpy is a CIDv1. There’s no such thing as CIDv2. When in doubt, use the CID Inspector
The CID uses name: sha2-256 as the hashing algorithm. sha256sum is the right command to calculate with this hash algorithm.
The CID is encoded as a string using base32, where as the output of sha256sum is in HEX (base16).
The hashes do in fact match, if you look in the CID inspector, you will see that the output from sha256sum matches the digest (hex) in the CID Inspector

genghis · January 8, 2025, 3:17pm

Daniel, could you please write commands that can be executed in a bash console?

I want to analyse CID, and further understand it, in order to be able to ask for improvements and ideas for IPFS.

zacharywhitley · January 9, 2025, 12:30pm

I don’t think this is a correct assumption. If the file is larger than the maximum block size it will be the root of a tree with multiple blocks. I believe the only case where it would be as you assumed is if the file is smaller than the maximum block size and was added with “raw leaves” and even then I’m not sure it would be the case. I think you’d still have a root node referring to the single raw leaf. In this case you’d easily be able to retrieve the CID of the single node and get what you’re looking for.

danieln · January 20, 2025, 8:25am

I suggest taking a look at https://dag.ipfs.tech/ as another tool too see how the Merkle DAG is constructed.

Topic		Replies	Views
Receive a file with an known SHA256 sum from IPFS Help	14	2030	January 21, 2023
Comparing two different CIDs derived from the same file Help	8	463	November 17, 2021
What is the formula to calculate cid from sha256 hash Protocol	9	500	June 13, 2023
Is there an API method to get the IPFS hash of a file? Help	8	1189	May 17, 2023
File hash is different from original Help	28	2021	September 16, 2020

A question about CID hash

Related topics