IPFS Performance in AWS

flgrnt · April 25, 2022, 10:17am

Hi

I’m building a dApp that requires us to scrape NFT metadata over potentially large collections (let’s set a baseline at 10k or so) in as fast a manner as possible (aiming for sub 30s). The assumption is that this metadata is stored on IPFS. While the number of files is large, the total size is only a handful of MBs.

As a first step, I set up a single IPFS node on AWS but found that ipfs gets are prohibitively slow (>3-5 mins for 10k collection)

As a fallback, I plan to scale horizontally (say 5-10 nodes) and place the IPFS nodes behind a load balancer. However, even with this approach I see exponential slow-downs getting into the higher numbers of file retrievals (5k+)

Questions

is there anything glaringly obvious that I’m missing? I’m new to the dev side of IPFS so I wouldn’t be surprised if I’m just doing something entirely wrong.
is there something I can do to make ipfs get faster? some configuration etc
what hardware specs would influence an ipfs gateways performance the most? cpu? memory? network i/o? I’ve played around with a few different types of AWS instance and found similar results.

Any help would be greatly appreciated
Thanks in advance

wclayf · April 25, 2022, 9:24pm

That’s 18 milliseconds per retrial. That’s pretty fast isn’t it? …considering the network latency and the fact that each call is at minimum one round trip?

flgrnt · April 25, 2022, 10:44pm

for context - I can retrieve 10k metadata files hosted on normal http server in <7s with the same infrastructure - this too is individual file retrievals not a glob

just wondering if I’m missing anything groundbreaking… I used this guide for setup Host a decentralised application with IPFS and AWS | by Alexander Lechner | Coinmonks | Medium

question - does go-ipfs do any sort of parallel processing of requests in its queue? or is the entire thing sequential?

hector · April 26, 2022, 8:53am

I suggest you greatly increase the configuration values for Bitswap in go-ipfs:

github.com

ipfs/go-ipfs/blob/master/docs/config.md#internalbitswap

# The go-ipfs config file

The go-ipfs config file is a JSON document located at `$IPFS_PATH/config`. It
is read once at node instantiation, either for an offline command, or when
starting the daemon. Commands that execute on a running daemon do not read the
config file at runtime.

## Table of Contents

- [The go-ipfs config file](#the-go-ipfs-config-file)
  - [Table of Contents](#table-of-contents)
  - [Profiles](#profiles)
  - [Types](#types)
    - [`flag`](#flag)
    - [`priority`](#priority)
    - [`strings`](#strings)
    - [`duration`](#duration)
    - [`optionalInteger`](#optionalinteger)
    - [`optionalBytes`](#optionalbytes)
    - [`optionalString`](#optionalstring)

This file has been truncated. show original

For reference, there is a section here about how to configure ipfs for production at larger-scale: Download and setup - Pinset orchestration for IPFS

Topic		Replies	Views
IPFS gateway very slow Help ipld , go-ipfs	2	3337	February 26, 2019
How to Optimize IPFS Node Performance for Large Data Sets? Help	1	114	September 10, 2024
I can't find any speed comparison with http [down\|up]load Meta & Site Feedback	2	1734	May 28, 2019
Extreme Slowness in IPFS Node Sync and Metadata Access on GCP IPFS	18	134	September 10, 2024
Questions about DHT performance Help ipfs-cluster , libp2p , dht	6	548	December 8, 2021

IPFS Performance in AWS

Related topics