Is anyone building an open nostr data set formatted for fine tuning llms? For inspiration: https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1
Not fine tuning, but a spec (granted, untested afaik) for having embeddings as nostr events. Wikifreedia just switched to AsciiDoc so there might be formatting errors - made some edits to the specs I wrote. https://wikifreedia.xyz/nkbip-02/npub1m3xdppkd0njmrqe2ma8a6ys39zvgp5k8u22mev8xsnqp4nh80srqhqa5sf
@cloud fodder check this out