DeStor Blog | Decentralized Storage News and Updates

IPFS in one word

Written by Mara McMahon | Nov 22, 2024
Learn about IPFS, a groundbreaking decentralized file system that enhances data accessibility and sharing through unique identifiers and peer-to-peer connections.

Imagine being dropped into a room full of tech experts, all speaking a language that feels like it’s from another planet. That’s been my world for a while now. I’m not a techie, but to keep up (and survive!), I’ve had to learn how to translate all that tech-speak into something people like me can understand. And more importantly, into something that shows why it (the tech) matters.

In this mini blog series, I’m zooming in on Filecoin—breaking it down into simple, digestible parts:

  1. IPFS (InterPlanetary File System) – It sounds otherworldly, but trust me, it's not that complicated.
  2. Filecoin – What it does, how it works, and why it’s built on IPFS.
  3. Proof Beats Promises – This is where we get into why Filecoin isn’t just a buzzword, but something that delivers.

By the end of this series, I hope you’ll see that understanding the tech doesn’t require being a tech wizard—it just needs the right translation. Ready to dive in? Let’s do it!

Part 1 - IPFS in one word

What do you mean I cannot make IPFS one word - just because the acronym is three words - Interplanetary File System.  Be that as it may, there is only one word you need to  know - IMMUTABILITY.  Once your data is on IPFS it is immutable - no changes, none, zilch, zero, nada, aught, nil.  Ok you get the idea. Data immutability is a key ingredient (like cocoa in Molten Chocolate Cake) in many recipes created to deliver specific business outcomes.  Let’s take a look at a few:

  • Data compliance: Immutable backups can help ensure that critical data is retained in accordance with legal and regulatory requirements. They can also provide a reliable repository for audit purposes.
  • Data archiving: Immutable data is ideal for long-term data retention because it prevents intentional or accidental modifications.
  • Data analytics: Immutable data can simplify the analytics process by ensuring the stability of the data used for analysis.
  • Ransomware protection: Data immutability can help prevent ransomware events from deleting or altering data.
  • Risk mitigation: Immutable storage can protect an organization's most important information from malicious tampering or accidental deletions.

A little bit about how:

IPFS creates a hash based on the content of the data; and by content, I mean every single space, comma, letter, number, notation, etc.  If even one thing changes - for example, adding a comma, results in a new hash being created.  Immutability means that something can’t be changed, and with IPFS, change equals new hash, new data set (for example a new LLM training data set). The original data set is still there so now there is version control.  Think of the hash like a fingerprint—unique to that file and impossible to replicate. Just as a person’s fingerprint identifies them uniquely and doesn’t change over time, an IPFS hash uniquely identifies a file. If someone tries to alter the file, it would create a new "fingerprint" (hash), so you’ll always know if the file you’re accessing is the original or has been modified.

One of the most important aspects of IPFS is that it’s not controlled by any one entity. Just like Linux, openstack, kubernetes, etc, anyone in the world can contribute to it, either by running a node (which stores and shares files) or by improving the code that powers it. It goes a step further by then enabling data to spread across nodes so no single company controls access to the data.  Depending on the specific needs of a data set, the ability to find and access the data is customizable. With IPFS, no single company or organization has control over the network, making it a more open and resilient system.

This takes data resiliency and durability to a whole new level.

What is IPFS?

Ok - if you are techie - sorry for being simple.  If you are a non-techie (like me) hang in there, it is not complicated - just some of the words sound a bit intimidating.  IPFS stands for InterPlanetary File System. It’s a network of nodes (servers, computers, storage) with peer-to-peer relationships enabling communication and transfer directly between nodes without going through a centralized location (e.g. DNS server which is what http requires).  From a simplified view, three elements make it all work:

  1. Chunking: When a file is added to IPFS, it's divided into smaller chunks, and each chunk is assigned its own fingerprint (hash). This chunking mechanism allows for efficient distribution and retrieval of files across the network.  
  2. Content Identifier (CID): Each block of data is assigned a unique identifier called a Content Identifier (CID), which is unique to the data from which they were computed, which allows IPFS to fetch data based on its content, rather than its location.  
  3. Distributed hash table (DHT): Any user in the network can serve a file by its content address, and other peers in the network can find and request that content from any node who has it using a DHT.  It is like doing a what’s near me on google - gives a list of all the places that have what you are looking for.

Think of all this as the foundation, on top of this software, features are developed and services are created.  Providers building on IPFS within the decentralized storage industry, create specialized services to help data owners (you) achieve specific business outcomes.  Here are few examples:

  • Pinata: Pins or holds data on the IPFS network preventing data deletion through garbage collection.This ensures that data is retained indefinitely and is always accessible. Pinning is useful for a variety of workflows, such as accessing data files from around the world without managing sharing permissions.
  • Temporal: Instead of permanently pinning data, Temporal concentrates on temporarily pinning data, which lowers expenses while maintaining accessibility. Projects like content sharing platforms and event-driven systems that require transient but dependable data storage can benefit greatly from this service.
  • Fleek: The platform prioritizes user-friendliness and collaboration, making it a great option for tasks that call for seamless content delivery and administration

Last but not least, IPFS is not small or particularly new and it is used by media and entertainment companies for content distribution, by healthcare companies for secure and immutable patient records, by large research organizations for immutability and retrievability.  

As more and more humans, companies, and organizations begin to decipher what they actually want to keep (without change and/or corporate/governmental access control), IPFS is a technology that will be employed.  

Read the next blog in the series “Filecoin in one word” to understand the next layer in Filecoin based services.