A look at search engines with their own indexes
-
Great article, appreciate that I'm not the only one concerned around some of the ethical choices Kagi has been making.
I got a sub to Kagi a few months ago. It seems pretty good but I'm behind on the news. I've read a few things here and there, but can you explain a bit about the ethics?
-
Looks like for the hardware requirements for self hosting some of the open source options, I'll be saving up quite a bit for SSDs.
FWIW, I gave YaCy a try a while back, and I agree with the article on that one. Shit tier results that make ancient AltaVista look good. Might be fine for intranet search. I like the idea of its distributed hosting, but pass on this one.
Other poster mentioned SearXNG, and while I haven't delved into that too much, it's probably worth a check. Pass on YaCy.
-
FWIW, I gave YaCy a try a while back, and I agree with the article on that one. Shit tier results that make ancient AltaVista look good. Might be fine for intranet search. I like the idea of its distributed hosting, but pass on this one.
Other poster mentioned SearXNG, and while I haven't delved into that too much, it's probably worth a check. Pass on YaCy.
SearXNG is a meta search entirely reliant on other services.
-
I got a sub to Kagi a few months ago. It seems pretty good but I'm behind on the news. I've read a few things here and there, but can you explain a bit about the ethics?
Last I saw they still paid Yandex for access to that index (weigh how important that is yourself), they also pushed back on suicide warnings if you ask Kagi how to kill yourself, and I learned from this article that they may be using additional data sources that contain higher levels of homophobic sentiment.
Basically, the company's tagline is "Humanize the Web", but I don't think their actions thus far show we agree on what Humanize means.
-
This post did not contain any content.
The plural of index is indices.
-
The plural of index is indices.
indexes
is valid: https://www.merriam-webster.com/dictionary/index -
Last I saw they still paid Yandex for access to that index (weigh how important that is yourself), they also pushed back on suicide warnings if you ask Kagi how to kill yourself, and I learned from this article that they may be using additional data sources that contain higher levels of homophobic sentiment.
Basically, the company's tagline is "Humanize the Web", but I don't think their actions thus far show we agree on what Humanize means.
Yeah, definitely something to keep an eye on. Might not renew and just start using Qwant.
-
Consider self-hosting SearXNG, which can aggregate results and filter.
Already am, but it still pulls results from the companies I want to separate myself from. I'd rather see what it takes/how well it performs to have my own indexer.
-
This post did not contain any content.
That was a good read, well done! I'd be interested if you ever reconsider Startpage. I've had good success with that on my work computer.
-
Already am, but it still pulls results from the companies I want to separate myself from. I'd rather see what it takes/how well it performs to have my own indexer.
Good luck with that one. It takes a lot of resources from my limited and aged experience, and I'm sure it's more now. Might be worth focusing the indexer on a topic area to start, just to get a feel for sizing (if your chosen solution supports that).