• @twelve@sh.itjust.works
    link
    fedilink
    English
    181 year ago

    I still find astonishing that tech crunch buys the argument of ML model training.

    No one in their sane mind would use the API (that have always been rate limited) for fetch data for text generation. People would use HTTP or, even better, archives of reddit.

    Why? Because there is better or no rate limit, there is no need to write anything (only reading) and it will stay free 🙂 Also super fresh data is not dramatically useful (except in very specific corner cases when something in the news change the way we talk)

    • @Hotzilla@sopuli.xyz
      link
      fedilink
      English
      10
      edit-2
      1 year ago

      Web crawling has always worked through raw HTTP/HTML parsing, why create site specific API calls that require authentication and are throttled.

      This excuse is pure bullshit.

    • @AstralJaeger@lemmy.ml
      link
      fedilink
      English
      51 year ago

      Considering the Reddit API has a hilariously low limit, I fully understand why the AI bro’s will use a scraping approach instead. I’ve built small discord bots that had a difficult time following the API because you had so little Requests available! I was in the process of building an event-driven system which used multiple API tokens in order to be able to keep up with multiple feeds. Its just terrible.

  • @RedSky2200@lemmy.fmhy.ml
    link
    fedilink
    English
    61 year ago

    Wait an article I read earlier is claiming that subreddits are business as usual. Now, this article claims the opposite?

    • ruffslOP
      link
      fedilink
      English
      21 year ago

      Could you share the link to that one? Thanks. Looks like this TechCrunch article is sourcing info from emails with advertisers partnered with Reddit, not just from public statements about visitor traffic published by Reddit themselves.

      I wonder what the measured metrics are internally. Funny that those earning metrics would’ve been more readily available had they already IPO’ed on the public market.

  • @EeeDawg101@lemmy.ml
    link
    fedilink
    English
    61 year ago

    I think it’ll take a while for us to know the real overall impact spez’s decision has made on Reddit’s user base. Until then, it’s really just speculation unless something concrete comes out (like financial reports etc).

    • Boz (he/him)
      link
      English
      11 year ago

      This. There are short-term indicators, but a lot of Reddit’s cash flow now is probably from deals negotiated weeks or months ago, and a drop in sales this month might not show up on the balance sheet until weeks or months from now.

    • @Potato@sh.itjust.works
      link
      fedilink
      English
      1
      edit-2
      1 year ago

      They’re lining up an IPO. Anything suggesting that they can’t maintain 5-10% real growth year after year (like other companies that investors could put their money) is truly damming. A sustained decrease in revenue, even a small one, is going to gut the IPO valuation.

    • @dhork@lemmy.world
      link
      fedilink
      English
      11 year ago

      Except we can be sure that the entire drop is due to humans deciding Reddit is dead. How much of the remaining traffic are bots?

      • @ironsoap
        link
        English
        11 year ago

        There have been multiple reports on this not being upheld by Reddir admins. Wait on account deletion until after the GPDR and others have fought the right to be forgotten battle.

  • @Hazzard@lemm.ee
    link
    fedilink
    English
    21 year ago

    Don’t love the framing of this paragraph from TechCrunch. It’s not that they’re charging for the API. That’s understandable and obvious, and we all wanted the platform to survive. I’ll be happy to volunteer to contribute to lemmy development/server costs/app development one day. It’s that they’re grossly overcharging for the API to such an extreme degree that paid subscriptions to third party apps actually lose money.

    In April, Reddit announced its plans to start charging developers to access data through its API. The move was obvious — to restrict third parties from accessing Reddit data that can help build text-generating machine learning models such as OpenAI’s GPT 4. Developers building apps and bots to assist people using Reddit and researchers who wish to study the platform for noncommercial purchases were among the few exceptions. However, as a result, third-party apps, including popular Reddit client Apollo, found it difficult to pay for those charges and decided to go offline. Various popular subreddit moderators came in support of those apps and developers and started protesting against the API pricing move.