I have a fair amount of experience with data visualization, analysis etc and thought it would be a fun project to try to visualize the Lemmy network, specifically which instances have strong links to one another via subscriptions from users in one to communities in the other.

How/where can I get that data?

EDIT: It sounds like many people would find this a violation of their data privacy and I simply shouldn’t do it. I had thought this kind of data was intended to be entirely accessible by design, but I learned something new!

  • gabe [he/him]A
    link
    20
    edit-2
    11 months ago

    Likely within the API, but I would say be cautious as people on the fediverse really do not like having their data scraped, especially for projects like this.

    • alex [they, il]
      link
      fedilink
      1211 months ago

      I think this is fine - people I’ve seen who objected to this kind of project were more about their account being indexed. Projects like respective size of instances were always fine

      • gabe [he/him]A
        link
        1011 months ago

        Yes, still doesn’t make people fine with it. People defederate over stuff like that on mastodon. It’s often seen as violating peoples privacy.

            • @mookulator@mander.xyzOP
              link
              fedilink
              2
              edit-2
              11 months ago

              What do you mean? I thought it was an explicit feature of this place that literally all of it was public and nobody owns any of the data. Isn’t it just sitting there in the public domain?

              If people know that’s how it works, they can’t get mad if someone does access the data. Especially for innocuous curiosities like this.

              • @Dave@lemmy.nz
                link
                fedilink
                1011 months ago

                The content of lemmy is in public, but is not in the public domain. They are different concepts.

              • @unexpectedteapot@lemmy.ml
                link
                fedilink
                611 months ago

                Your public domain assumption doesn’t have to apply to others, legally or ideologically.

                Data ownership does exist in the Fediverse, in fact it is one of its selling points that you can set up your server and own the data instead of using a surveillance capitalist SaaS that stores, manipulates and imposes legal rights over your data. Applications like Mastodon do send a federation request to other instances to delete data if submitters want to. Additionally, some users put licenses on their profile that might have restrictions (i.e: CC non-commerical, etc.) on what you are legally allowed to do with the data.

                So no, accessing the data is not the same as using or processing it for many people, legally too in several parts of the world. Also, “innocuous curiosities” label is entirely subjective.

              • @PseudoSpock@lemmy.dbzer0.com
                link
                fedilink
                411 months ago

                I think the more commonly held belief is that the data, while unfortunately available right now, was going to become more secure in the near future; but the exodus from reddit happened too soon. So now there is a lot of that data, and the better management and protection of it hasn’t had time enough to happen.

                In comes you, seeing the opportunity, and you seek to exploit it.

                  • @PseudoSpock@lemmy.dbzer0.com
                    link
                    fedilink
                    111 months ago

                    Well, I am not any official representative of Lemmy. My belief in what the plans are or were are based on things I’ve read with also no citation. Best if you were to find out from the Lemmy development team and from Lemmy instance hosters. They’re the ones with the authority, history, and investment that can accurately tell you what’s what.