From @markg85 on Tue Aug 02 2016 09:46:06 GMT+0000 (UTC)
Hi,
I just know about IPFS for about a day, everything seems really neat. Specifically the ability to mount an IPFS site just like any local mount. Thatās in my opinion a great feature!
Having seen some videos and read some stuff about it, iām still left with a couple open questions. Iām sure most of them are asked already, but i apparently lack the search skills to find them
So here we go.
- If you visit an IPFS site, will you essentially become a peer for the information you requested?
If IPFS follows P2P (which i think it does) then visiting an IPFS site makes you a peer. Next visitors could reach your computer and get the files required to view the site. That principle is fine by me. But iām slightly puzzled about for example large files. Imagine i downloaded a docker image via IPFS, will another user download it from me then? Since if thatās the case then i could be getting bandwidth issues since it would suck my bandwidth up completely.
- How would you make a site truly decentralized?
If i look at ipfs.pics itās only partially decentralized. The storage of the pictures is fully P2P, but the MySQL database it needs is still very much centralized. How would one make a service like that, but fully decentralized? Iām guessing one would need a decentralized database for that as well, like https://github.com/rqlite/rqlite, but how would one install that in IPFS? But even if it somehow magically gets connected to IPFS, how would you make it secure? By secure i mean a database used by Site X should only be accessible by Site X. Or would this involve public/private key signing of the database which only Site X can unlock? If that is the case, then where would the private key be stored (which would be needed to unlock the database)? Lots of sub questions here
- The concept of IPNS, a bit fuzzy.
Imagine this. A populair website or service, accessible by name (thus using IPNS), but as far as i understood it IPNS points to one hash. Basically a readable alias for a hash (or is that oversimplifying?). But what would happen if the hash it points to is for whatever reason not accessible? Would that make the site unavailable? Or does the name magically know (how?) which peers it has available to try and just send data back from another peer? For a user, the site would seem online.
- Is IPNS searchable?
With āthe regular webā we have Google (among dozens of other search engines) to find a site. Sites in IPFS are probably indexed just fine as well if there is some public link somewhere, but what about the other stuff that isnāt linked? In āthe regular webā there is the concept of the ādeep webā which consists of sites that are online, but no links to exist thus search engines canāt find them. With IPNS everything is essentially in the DHT so doesnāt that make it possible to index every IPFS site? And does something like that exist? Or is there a (technical) reason why this isnāt possible?
- Site āmirrorsā?
From what i understand, there currently is no concept of mirroring a site in IPFS, but i wonder if a āmirrorā is the right term anyway. Iāve seen something about pointers to hashes where a new version of something would just change the pointer. So i guess the question is something like this. How would one mirror the site content and let the pointers know (or the DHT structure) that all/some data is available at some other locations (hash)? This would for instance be vital to big sites that mainly rely on having their data in multiple data centers as mirrors, as primary means of site hosting and load balancing. A secondary place would be the P2P structure where individual users host individual pieces of the site. But i donāt think what i describe here is possible (yet?)? Or iām wrong
- Getting notified about pointer updates.
This is basically an extension of the question above. If you host something and it becomes outdated. For instance because a news article is posted or a new video or whatever your site is about. How would one - that wants to āmirrorā your site - gets notified about a change? If IPFS has the concept of pointers, does it also have the concept of push notifications or publish/subscribe? Iād imagine it would just be a list of hashes (the ones subscribed) that need to be ānotifiedā or āpingedā when the thing they point to isnāt the latest version anymore. Itās then op to the receiving side of that āpingā to act upon it.
- Real time collaborative editing in IPFS possible?
The presentations about IPFS claim low latency and tons of benefits, iām sure thatās all true to some extend, but what about collaborative editing of documents? Is that possible? If so, how would it work? Is there a direct connection between all connected peers that want to edit something? Even then, how is the diff being made and how are conflicts resolved? On the other hand, this might be an application specific issue, not for IPFS to solve.
I keep it at those 7 questions for now
Cheers,
Mark
Copied from original issue: https://github.com/ipfs/faq/issues/154
From @whyrusleeping on Tue Aug 02 2016 11:40:13 GMT+0000 (UTC)
Hey Mark, Thanks for the great questions, iāll try to answer them with as much detail as you asked them with
- Fetching content via ipfs temporarily caches it on your node. Currently, temporarily means āuntil a garbage collection is runā and garbage collection is at this time a manual process (you need to run
ipfs repo gc
). That means that yes, your node may be asked for content it has by other peers on the network. Bandwidth limitations are on our TODO list.
- Its actually quite easy to make a good number of websites ātruly decentralizedā. Anything that doesnt require a backend server is trivially decentralizable. The difficult part (as you mention) comes when you need to have a database or some sort of backend logic. These scenarios are still possible with ipfs, but most of that work is still somewhat āin progressā. For some examples, check out orbit and orbit-db. Orbit currently depends on a ācentralizedā pubsub server, but a replacement system using only ipfs is in the works.
- IPNS is a system that uses PKI to create and verify name records. It maps a consistent public key hash to a changeable IPFS hash. The owner (or owners) of the keypair are able to change the value pointed to by this entry. If the hash pointed to is unavailable, the page would be unavailable.
- IPFS should be pretty searchable. Iāve thought about the problem a few times and think that the most interesting way to start indexing all of ipfs would be to put a number of nodes out on the network such that they cover a significant portion of the kademlia keyspace. In doing this, the nodes you control should be able to see provider records for the majority of content on the network as it is created and requested. Once you know the hashes of all the content, you can go through and request and index objects are you like. This is all pretty easy to do in a centralized way, but think about doing it in a completely distributed manner
- All you have to do to āmirrorā a site in ipfs is to visit it. If you really want to mirror it permanently then do an
ipfs pin add
on it and your node will not stop mirroring it even after a garbage collection is run.
- This is one of those harder problems weāre working on. Take a look at some of the discussion here: https://github.com/ipfs/notes/issues/148
- For collaborative editing type applications iām going to refer you back to orbit. A āgoogle docsā clone should be doable with orbit-db pretty easily
Thanks!
From @markg85 on Tue Aug 02 2016 12:08:45 GMT+0000 (UTC)
Hi @whyrusleeping,
All sounds clear to me, but i keep having questions about point 3.
The way you describe it sounds like there still is a single point of failure. Even if the site is backed by thousands of mirrors. The main entry point seems to be from name/certificate -> single hash. If that hash is offline, the site is offline. Sure, the owner can make it point to another hash, but that seems like a manual task that can (right?) be easily automated.
I might also be misunderstanding somethingā¦
Could you elaborate on this please? It sounds very interesting!
From @whyrusleeping on Tue Aug 02 2016 12:56:34 GMT+0000 (UTC)
For a hash (site) to be āofflineā all the peers that have mirrored it will also have to go offline.
From @Ghoughpteighbteau on Tue Aug 02 2016 17:49:18 GMT+0000 (UTC)
@markg85
Generally speaking, you would think someone who is capable of resolving an IPNS name would also be hosting the IPFS content. I guess thatās not technically necessary.
Also! Generally speaking itās very easy to rehost things on IPFS if you have the file. If I add a kitten gif, and you add the same kitten gif, it resolves to the same address.
and IPNS is pretty easy to automate, all you have to do is run this command:
~ $ ipfs name publish /ipfs/QmNwoE1vkQeEwY3dyDdK4uyaYpm2GYTUn68mqkf4kdvXcn
Published to QmRzYFGy9M5CyjEwNdh62udgBZV6BGbNZec8gMB9mXFhX6: /ipfs/QmNwoE1vkQeEwY3dyDdK4uyaYpm2GYTUn68mqkf4kdvXcn
~ $ ipfs name resolve QmRzYFGy9M5CyjEwNdh62udgBZV6BGbNZec8gMB9mXFhX6
/ipfs/QmNwoE1vkQeEwY3dyDdK4uyaYpm2GYTUn68mqkf4kdvXcn
https://ipfs.io/ipns/QmRzYFGy9M5CyjEwNdh62udgBZV6BGbNZec8gMB9mXFhX6
Also, I can now kill my daemon (so that I no longer answer IPNS requests) and that link will still be resolved. My understanding is that ipfs.io will be able to resolve the IPNS address for me, even though Iām not online, for another 24 hours.
Just for science sake I did this:
~ $ ipfs name publish -t 336h /ipfs/QmNwoE1vkQeEwY3dyDdK4uyaYpm2GYTUn68mqkf4kdvXcn
Published to QmRzYFGy9M5CyjEwNdh62udgBZV6BGbNZec8gMB9mXFhX6: /ipfs/QmNwoE1vkQeEwY3dyDdK4uyaYpm2GYTUn68mqkf4kdvXcn
That should mean other people can resolve the IPNS address for another 2 weeks? Wonder how long the gateway hangs on to contentā¦
From @whyrusleeping on Tue Aug 02 2016 22:31:50 GMT+0000 (UTC)
@Ghoughpteighbteau Thats all mostly correct.
That should mean other people can resolve the IPNS address for another 2 weeks?
Close! That actually means that the record itself is valid for two weeks. The network will only hold onto records for at most 36 hours before they are discarded, but other nodes can put
the same exact signed record out to the dht again for up to two weeks to keep the record resolveable.
From @Ghoughpteighbteau on Wed Aug 03 2016 00:39:15 GMT+0000 (UTC)
hmmmm. In what circumstances will put
s occur?
From @Ghoughpteighbteau on Thu Aug 04 2016 19:01:43 GMT+0000 (UTC)
Still resolving two days later
though, it took ~ 1 minute to resolve
From @Kubuxu on Thu Aug 04 2016 19:10:20 GMT+0000 (UTC)
It is because only very few nodes will have this record after two days.
Meaning you have to connect to many nodes in the key space bucket to find someone that has the record. DHT record republishing is hard to get right and as we plan replacing IPNS with IPRS (Record System) long lived record wasnāt focus for us.
What I can say is that we plan improving both get and put performance of IPNS.
From @jbenet on Thu Aug 04 2016 20:58:22 GMT+0000 (UTC)
IPNS resolution will get way better with pubsub improvements coming.
On Thu, Aug 4, 2016 at 15:10 Jakub Sztandera notifications@github.com
wrote:
It is because only very few nodes will have this record after two days.
Meaning you have to connect to many nodes in the key space bucket to find
someone that has the record. DHT record republishing is hard to get right
and as we plan replacing IPNS with IPRS (Record System) long lived record
wasnāt focus for us.
What I can say is that we plan improving both get and put performance of
IPNS.
ā
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
Questions after first learning about IPFS. Ā· Issue #154 Ā· ipfs-inactive/faq Ā· GitHub, or mute
the thread
Sign in to GitHub Ā· GitHub
.
From @markg85 on Thu Aug 04 2016 21:23:26 GMT+0000 (UTC)
@jbenet is there any ETA for that?
From @markg85 on Tue Aug 09 2016 22:04:16 GMT+0000 (UTC)
A few more question for site principles that i donāt see possible, but i hope it is and that itās merely my lack of knowledge about ipfs that makes me think āitās impossibleā.
(i just continue with the numbering from the first post)
- Imagine a site that aggregates data (a feed parser would be a perfect example). Youād want to download and fetch all the feeds on āthe serverā, combine all fetched items to sort them by date and present the data sorted by date to the user. In a āserver - clientā setup like the normal weg, this is easy. I donāt see how that is possible in an ipfs setup (where there is no concept of a server, right?).
- This is a continuation of the question above. Imagine the case where you want to periodically run a task. For instance to fetch those feeds in the earlier example and pre-parse the results. How would such a thing (cronjobs basically) be done with ipfs?
From @Ghoughpteighbteau on Wed Aug 10 2016 18:54:41 GMT+0000 (UTC)
Well, IPFS as a protocol doesnāt let you think in a centralized way. Bitcoin does, thatās why bitcoin was such a splash, because it managed to make decentralization act like a centralized system. It pulled off distributed consensus.
If youāre planning to describe a service that downloads a bunch of feeds from different websites and distributes those feeds to itās clientele, then the question is: Who downloads those feeds?
- In a centralized system you have just one server that every client connects to, the server obviously does all the work
- In a distributed system you have multiple servers that clients connect to, the servers work together and distribute their results between eachother.
- In a decentralized system (like IPFS) then the only people left to do the work are the clients. So the clients must make the requests to the feeds, and share their work with the network.
The problem then becomes: how do you trust the clients to do the job? They could lie, theyāre no longer your servers. They could lie and say: āOh yah, 99% invisible totally released a whole bunch of Herbal Supplement pill ads. Totally. You should buy some.ā
The only way I can see to do this is to establish a network of trust. I download some feeds, you download some feeds, we share our data between eachother because we know weāre not cheating. You add someone you trust, I add someone I trust, our network grows. That kinda thing. A system like this is pretty easy to describe with IPFS, though, the details can be a little complicated.
Regarding point 2: You could literally have cronjobs do it. Itās pretty easy to interact with IPFS whether it be through an actual desktop application, or the browser. That said, itās always going to be on the users whim, no getting around that. You canāt force people to work for the network.