Is anyone building an open nostr data set formatted for fine tuning llms? For inspiration: https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1