Skip Navigation

Is there a way to search through posts across all instances/communities?

Hi, I'm relatively new here. Like probably a lot of people, I appended the word "reddit" to most of my searches to get discussions around my interests/news or to find "curated" things. I noticed that it does not work as great with "Lemmy" for example, as the user base is more spread out and smaller in general.

  • I've looked for a search engine for the Lemmy or fediverse with no luck. There exists some to find communities but not to return posts it seems.
  • If it does not exists, is this even something that can be built? Would one need to build a crawler or use an API? I would be very interested to contribute to such a project.
14

You're viewing a single thread.

14 comments
  • Searching the whole Fediverse, literally all of it, 100%, is technically impossible or at least very hard to implement, and if implemented, it'd eat up lots of CPU power and network bandwidth.

    It's simply next to impossible for any instance of any Fediverse project, also for any centralised or decentralised dedicated search engine, to know all instances and all content on it without all instances actively pushing their existence, their status and all their content to the search engine in real-time.

    A search engine that literally covers all of the Fediverse with no exception has to even know about brand-new instances that have just been started a split-second ago. An instance that's so new doesn't even have any connections into the Fediverse yet, probably no content and only one account, the admin account. (Replace "account" with "channel" on Hubzilla and (streams).)

    So if someone spins up a new instance of whatever project, that search feature has to know about that instance immediately before the instance even connects with anything. That is, I'm not sure when that search feature is expected to know about a new Hubzilla hub since ActivityPub is optional per hub and per channel and AFAIK off by default for both: Shall the search feature already know when ActivityPub is still off, and nothing in the Fediverse that isn't Hubzilla or (streams) can connect to it anyway, or shall it only learn about the instance the second that the hub admin turns ActivityPub on?

    And when the admin of a new instance puts out a test post to see if it runs as desired, and the instance still isn't connected to any other instance, the search feature would immediately know that test post so you can find it if it's that what you're looking for.

    Mind you, Google doesn't know everything on the Internet either.

You've viewed 14 comments.