So when running a Linux desktop, what does one do to avoid these search engines? I use DuckDuckGo for the moment, but if love some alternatve that isnopen source too, self hosted if need be, and federated would be awesome
Searx is pretty great, although I wouldn't recommend self-hosting it. Just use one of the public instances like Searx Belgium because it is harder to fingerprint you off of your searches since a lot of people also do their searches on the same instance.
Searxng throws together results from different engines as far as I know.
Not sure how a federated search engine would work though.
Edit: hash0772 (is there a correct way address someone's username on Lemmy?) already mentioned this but it's generally best to use an existing instance, there's also some on tor. (they obviously still only search the clearnet)