A great use for reddit is the ability to search posts and opinions about any niche topic. Will that be possible with Lemmy as it grows? Will I be able to Google “instant rice Lemmy” and get a comprehensive tier list of each brand?

I imagine search engines will have trouble with all the different instances(?). EDIT: Especially with instances that don’t have Lemmy in their name, I don’t think search engines would return them for Lemmy searches?

  • marsara9@lemmy.world
    link
    fedilink
    arrow-up
    59
    ·
    1 year ago

    So I’ve been working on a solution for this.

    As I see it Google and others are going to have a hard if not impossible time to incorporate the fediverse, and the fact that the same content can exist on multiple servers.

    So I’m working on a search engine specifically build, for Lemmy at least. Where it’ll take you to whatever your preferred instance is when tapping on a search result.

    I hope to have a MVP up and running in a few more days.

    • mookulator@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      1 year ago

      Can’t emphasize enough how important this is for the growth of Lemmy. Many people I know only access Reddit through google searches.

      • marsara9@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        ·
        1 year ago

        Yep and I’m one of them. Go look me up on Reddit and I think I have maybe 20 posts over the 14+ years I was on the site. …joined Lemmy and immediately got frustrated that I couldn’t find anything. So I figured I take a crack at it. Especially since I couldn’t see how Google would ever be able to link me to my instance. Let alone make it easy to search the entire fediverse without having to write out every possible site, with new ones popping up every day.

    • QuinicV@lemmy.worldOP
      link
      fedilink
      arrow-up
      3
      ·
      1 year ago

      Interesting. I hadn’t even thought about how the fact that instance1.[post] and instance2.[post@instance1] is essentially the same thing and how search engines would handle it. Interested in what you come up with!

      • marsara9@lemmy.world
        link
        fedilink
        arrow-up
        6
        ·
        1 year ago

        Thanks. If you do some digging you can find the project on GitHub but note that it’s a work in progress still. The UI is lacking and it’s rough around the edges but it’s “working”. And I still need to do some optimizations on the crawler itself, etc…

        It’s also going to be completely self-hostable just like Lemmy, etc…

    • Doll_Tow_Jet-ski@kbin.social
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      That is great. Thanks for the initiative. Have you considered contacting the people at DuckDuckGo so that that search engine can access Lemmy/Kbin content?

    • silly goose meekah@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      IDK, isn’t it the same for reddit? It also encourages crossposting, so the same content is on there several times. Maybe I don’t understand the fediverse well enough yet, so please correct me if I’m wrong.

      • 1st@kbin.social
        link
        fedilink
        arrow-up
        1
        ·
        edit-2
        1 year ago

        On reddit you may have the same post twice but the comments will be different. On Lemmy, you have the same post on every federated lemmy with the same comments on all of them. With the way google handles websites right now, if they started including Lemmy instances in their web, it end up having hundreds of the exact same result each hosted on a different lemmy instance.

        Edit for clarity: All lemmy sites share their data with each other unless they explicitly stop doing so (defederating). This is why I can respond to your comment even though I’m on kbin.social and you’re on lemmy.world

    • PierreKanazawa@fedia.io
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      Tried it, pretty cool, though seems still depend on search engines’ indexing. The instance that I’m on seems not indexed.

      Also it’s interesting it uses intext to identify whether the results are from fediverse.

  • OsakaWilson@lemmy.world
    link
    fedilink
    arrow-up
    10
    ·
    1 year ago

    Digg.com was the big thing with Reddit trailing. Digg began tweaking the experience toward a more profitable model. I had already come to Reddit when they went too far and there was a sudden enormous migration from Digg to Reddit. Digg went from being THE social media aggregator to being nothing in a matter of weeks.

    Reddit is more deeply rooted, so I think it will stick around, I’m cool if Reddit keeps those who are happy with corporate model busy so we can do our thing here.

  • krigo666@lemmy.world
    link
    fedilink
    arrow-up
    8
    ·
    1 year ago

    I think it is preferable to ask other search engines like DuckDuckGo to index Lemmy info. Google is full of garbage.

  • Kururin@lemmy.ml
    link
    fedilink
    English
    arrow-up
    8
    ·
    1 year ago

    It’s up to the individual instance owner and Lemmy the software itself enabling SEO. It’s just getting started now so it will be long time before that.

  • Draconic NEO@lemmy.world
    link
    fedilink
    arrow-up
    5
    ·
    1 year ago

    In the future they eventually might be, for some instances. Though definitely not for all of them, since some of the instances might disable indexing.

    I’ve actually already seen a few Lemmy results (lemmy.ml) in Google searches, the trouble is it doesn’t link to individual posts, just the community so it’s not particularly useful. So it definitely is possible, just needs to be improved to be able to index posts.

  • ghariksforge@lemmy.world
    link
    fedilink
    arrow-up
    5
    ·
    1 year ago

    Depends on Google. These tech companies don’t like new platforms, especially those competing with established ones like Reddit. You’ll see that Google often discriminates against Lemmy or Mastodon.

  • RIotingPacifist@lemmy.world
    link
    fedilink
    arrow-up
    3
    ·
    1 year ago

    Respectfully: Fuck that.

    If you want to find the best instant rice recommendations on Lemmy, Lemmy should have a functional post search function, rather than me relying on a malevolent corporate entity like google to index all the content.

    Search has gone to shit as the Internet has embraced social media sites, an upside of this is that wikipedia+Lemmy+key word search, mayas accurate as asking Google Bard or bing, and they can be built on entirety open tech.

    • Dr. Moose@lemmy.world
      link
      fedilink
      arrow-up
      4
      ·
      edit-2
      1 year ago

      Cool rage but you dismissing search indexing is kinda hilarious. It’s not going away and it’s what makes the web. Would you rather have 3 big websites instead of indexed web?

      • RIotingPacifist@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        1 year ago

        Would you rather have 3 big websites instead of indexed web?

        That’s what we already have, I’d you need to find stuff by doing site specific googles, both google & that site have failed.

        The web is dead, it’s been dead for a while, now is the time to build something new in it’s wake that rather than depending on closed source algorithms, indexing 3 big websites, we could just search the 3 big websites directly.

        • PlexSheep@feddit.de
          link
          fedilink
          arrow-up
          1
          ·
          1 year ago

          I disagree. I’m not sure why you say that. I Google stuff as a job and it’s certainly not just the big 3 websites. I personally rely on selfhosted searxng.

  • static@kbin.social
    link
    fedilink
    arrow-up
    3
    ·
    1 year ago

    Reddit did not start out as the thing to google, it’s 15+ years old, only in the last 5y I started prefixing my google searches with reddit.

    • BrerChicken @lemmy.world
      link
      fedilink
      arrow-up
      4
      ·
      1 year ago

      I actually found Reddit by googling things. I had seen it 5 or 6 times over a few years, and eventually I just went to the main site. I might have even used Reddit in the search before I joined. Regardless, I had recognized that all the best answers for tricky problems that I had were coming from Reddit before I even joined 11 years ago.

    • Fondots@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      Everyone’s experience on this will be different, but I personally started using reddit about 12 years or so ago largely because at that point a lot of my Google searches were already pointing me towards reddit. I wasn’t necessarily going to google specifically to find reddit results, but since that’s where I kept ending up i figured I might as well go straight to reddit. And since reddit’s search function is and has always been trash, i pretty much immediately started using Google to search reddit.

    • QuinicV@lemmy.worldOP
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      I realize Lemmy needs to get much bigger for that to happen. My question was more directed at how search engines would handle the fediverse. Though I see now that that wasn’t very clear.

  • WhipTheLlama@lemmy.world
    link
    fedilink
    arrow-up
    3
    ·
    1 year ago

    If Lemmy becomes a source of enough information like Reddit is, search engines will index it. SEO is a marketing thing, and a place like Lemmy doesn’t really need that. Google, DDG, etc. all put engineering effort into making sure sites with lots of information are indexed and available in their search results.

    • Ben@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      I searched “3 days no poop meme” and I saw 2 from Reddit on the first page (via qwant),

      No Lemmy.

        • Ben@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          1 year ago

          How many of the millions of people doing searches will add Lemmy specifically? This only works if you wish to specifically search Lemmy, for which I’d suggest adding (intext:"modlog" & "instances" & "docs" & "code" & "join lemmy" | intext:"powered by kbin") to a Whoogle search would do better.

        • balls_expert@lemmy.blahaj.zone
          link
          fedilink
          arrow-up
          1
          ·
          1 year ago

          Like xmpp

          Make a giant instance, get all the content there by pumping users and making cool shit, slowly customize your instance and extend the protocol with features so that ours become incompatible in annoying ways

          Add wikis, overhaul user profiles, achievements, posting to your own profile, games, whatever, then get tired of supporting the fediverse interface and shut off the API

          You instantly can’t read 95% of your subscribed /f/s

    • 2pt_perversion@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      I wish there was a way to get an entire Reddit archive over here. Realistically I’m still going to have to search Reddit because it has 10+ years of answers to obscure questions.

      • CascadeDismayed@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        Minds more intelligent than mine are probably already at work on these problems. I’ve seen multiple discussion of people saying they are designing and working on solutions. It may take some time to see results, though.

  • speaker_hat
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    You just need to backlink to Lemmy from an already known websites and the crawlers will find their way.

    See: https://backlinko.com/link-building-strategies

    About the search engine to dispaly the correct instance, that is something I don’t think that works now and would require optimizations.