Oddbean new post about | logout
 Running servers and hosting a small collection of personal images isn't expensive, people do it all the time with wordpress and other blogging platforms.

The purpose of blossom is to use the sha256 hash of those files in the url. that way you can verify the file hasn't been tampered with. and if its no longer available from the original server, a client can ask other servers since the hash is universal

Also since sha256 is a universal ID for a file. it allows you to re-upload the exact file if it was take down or your moving servers. most apps that allow image uploads (like discord etc) don't even have a concept of re-uploading an image

also why do you tag fiatjaf on every other post? 
 instead of using a hash to find images you should use a lower resolution version of same image to find the full resolution version.  even Bing image search has a feature where it will automatically show you different "image sizes" for the same image, for example the following are different sizes grouped by Bing

https://i.pinimg.com/originals/d0/0d/59/d00d5967e65b05124e5d277a7c6fb83e.jpg

https://i.pinimg.com/736x/0d/b9/44/0db944ce8f8e469988d35cda0eee9501.jpg

a truly smart system wouldn't even save the image but simply find the highest quality version of it on the web and save just the address.

i tag @fiatjaf because he appears to capable of understanding my brilliant ideas. 
 the smart thing to do would be to save a 100 x 100 pixel "avatar" version of an image to act as a checksum plus the addresses of five highest resolution matches of that image found online.  then display the highest res version that loads first and matches the avatar visually.  almost every image posted on Nostr was originally downloaded from somewhere online where it is already hosted for free.  those images which are original can be marked as such by user, but frankly the user should just host those himself.

there is no need to save 10 different copies of an image every time somebody posts a picture of Putin riding a bear. 
 Its not about all about images and media though. There are tons of other files that need to be hosted and re-uploaded in their original form

IMO a big shortcoming of most hosting services is they only focus on the media, as if that where the only things the internet was made of

Also, brilliant ideas? If I ever claimed to have brilliant ideas I would end up eating my words within a month 
 if a file is treated as a black box there is only so much we can do.  if we can open the file and understand what it contains we can do more.  for example we can see that a word document and pdf document may actually be the same document.  this is obviously not going to be practical for some exotic file types as a computer can't be expected to recognize them all.  but my guess is over 90% of all files uploaded are JPEG and nobody says we can't adopt different approaches to different file types.

i bring all this up because recently GAB was in the news for restricting file uploads only to paying members.  GAB was banned by credit card processors making it harder for them to make ends meet financially.  any anti-establishment social media should expect the same fate of getting cut off from mainstream banking channels and looking for ways to cut costs.

these costs may be low now, but assuming NOSTR grows they could grow exponentially - might as well start thinking about reducing them now.

text and images are fundamentally different.  text takes a lot of energy to create but little to store or transfer, thus decentralized redundant approach is right for text.  images are the opposite.  one size does not fit all.

@fiatjaf  
 sorry by "avatar" i meant "thumbnail" ... 

so for any image you save the exact pixel dimensions for example "1200x800" then your hash, then a THUMBNAIL and links to five instances of that image found online.

if any image matches the hash you use that, if not then you use one that matches dimensions, if not then just the thumbnail etc ...

or something like that ... 
 In a world of AI image generation I don't want a central service that decides that an image is "close enough" to the original. I want exactly the original image that the author intended. Cryptographically Identified by the hash. 
 you remind me of people in early days of Mp3 who said the quality wasn't good enough because the compression was lossy.

the same folks also said that you need $10,000 cables to get good sound quality.

and then blind tests revealed none of them could hear a difference between a $10,000 cable and a steel wire clothes hanger used as a cable.

does anybody still buy compact discs or use FLAC files ? 
 This is the main reason for the sha256 hash being in the URL. that way if you include it in a note and sign the note you are signing the hash and there for proving which version you wanted to post