• FLeX@lemmy.world
    link
    fedilink
    English
    arrow-up
    65
    arrow-down
    13
    ·
    2 days ago

    So, not a single developer thought about filtering useless words locally before triggering the request ?

    How can they be so dumb ?

    • Nighed@feddit.uk
      link
      fedilink
      English
      arrow-up
      1
      ·
      16 hours ago

      The company I worked for tried that as an experiment on how much money it saves.

      Absolutely awful, even removing connectives causes problems.

    • Endmaker@ani.social
      link
      fedilink
      English
      arrow-up
      69
      arrow-down
      2
      ·
      edit-2
      2 days ago

      useless words

      The writer of this article doesn’t consider these words useless though. They are suggesting that these words may improve response quality.

        • Dran@lemmy.world
          link
          fedilink
          English
          arrow-up
          39
          arrow-down
          3
          ·
          2 days ago

          Anecdotally, I use it a lot and I feel like my responses are better when I’m polite. I have a couple of theories as to why.

          1. More tokens in the context window of your question, and a clear separator between ideas in a conversation make it easier for the inference tokenizer to recognize disparate ideas.

          2. Higher quality datasets contain american boomer/millennial notions of “politeness” and when responses are structured in kind, they’re more likely to contain tokens from those higher quality datasets.

          I haven’t mathematically proven any of this within the llama.cpp tokenizer, but I strongly suspect that I could at least prove a correlation between polite token input and dataset representation output tokens

          • snooggums@lemmy.world
            link
            fedilink
            English
            arrow-up
            9
            arrow-down
            31
            ·
            2 days ago

            It FEEEEEEEEEEEELS better is what the authors said too. Both articles were completely worthless dreck about how they felt about the responses.

            • Dran@lemmy.world
              link
              fedilink
              English
              arrow-up
              21
              arrow-down
              2
              ·
              2 days ago

              Yes they were, so I’m offering you an actual theory as to why this may actually be true, yet difficult to “prove”.

              Smoking was bad for your health long before anyone sat down and took the time to prove it. Autoregressive LLM tokenizer are a very new field of computer science and it’s going to take a while for the community to collectively understand everything we’re currently doing by trial and error.

              • snooggums@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                arrow-down
                18
                ·
                2 days ago

                Smoking was known to be bad for your health long before anyone did studies because it was easily correlated with coughing and other breathing issues and early death. The evidence was obvious and apparent.

          • DreamButt@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            ·
            2 days ago

            Honestly they were better until recently. GPT (at least) has gotten really good at de-escalation and providing (mostly) factual responses when you get irate

        • halcyoncmdr@lemmy.world
          link
          fedilink
          English
          arrow-up
          15
          arrow-down
          2
          ·
          2 days ago

          Please may be useless. Thank you isn’t useless. That tells you that the prior response gave them the answer they were looking for. No response at all could mean that, or that they gave up, or any number of other things.

          • FLeX@lemmy.world
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            2
            ·
            2 days ago

            What if it’s a sarcastic thanks ?

            Also, the public models are fixed right ? Not perpetually training AFAIK ? So it should really change nothing unless it’s linked to those “thumb up/down” buttons

          • snooggums@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            10
            ·
            edit-2
            2 days ago

            Both authors state that the phrasing from the AI is what is improved based on how they felt about the answers, not the accuracy.

        • Viri4thus@feddit.org
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          12
          ·
          2 days ago

          You’re being downvoted, this is a perfect example of:

          *they hated Jesus because he spoke the truth 😂🤣

          • snooggums@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            13
            ·
            edit-2
            2 days ago

            I’m smart enough to know that an article peppered with assumption and zero facts is dogshit.

            Presumably

            might

            could

            Doesn’t matter how educated someone is when they write a bunch of words about possibilities with no actual evidence. They are morons because they are spouting a bunch of useless speculation about a shitty and unreliable technology and naval gazing about whether ‘being polite’ to a bullshit generator is beneficial. I feel dumber for having read both the article and the linked article.

              • snooggums@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                arrow-down
                9
                ·
                2 days ago

                Maybe don’t write an article speculating about something possible being true based on another article that is also speculating about something being possible when it being able to confirm it is possible. Like speculating about dinosaurs makes sense as we don’t have a way to verify their soft tissues. But when it comes to AI, there are ways to actually confirm the reliability of responses.

          • killeronthecorner@lemmy.world
            link
            fedilink
            English
            arrow-up
            13
            arrow-down
            1
            ·
            2 days ago

            Hi, I have a degree in computer science and work with AI every day.

            Feelings aren’t a good way to measure things scientifically, they are right about that.

            But saying that words can just be filtered is easier said than done. You’re back at needing to do a lot of processing to identify and purge these words. This is still going to cost a lot of money and potentially lead to less meaningful inputs. Now you also have to maintain the software that does the word identification, keep it well tested, maintain monitoring and analytics for it, and so on.

            So, in short, everyone here is wrong and I’m considering packing it all in and buying a small potato farm with no internet connection.

            • snooggums@lemmy.world
              link
              fedilink
              English
              arrow-up
              3
              arrow-down
              4
              ·
              edit-2
              2 days ago

              The big thing here is that ‘polite’ words are being singled out as extraneous when there are tons of extraneous words being used. The focus is on words that make it seem like AI has feelings or intent.

              There is no reason to filter any words, because the entire point of LLMs is to take inefficient human communication and do stuff with it. ‘Please’ isn’t any more of a waste that ‘the’ or including a period at the end of a sentence.

              Not to mention the fact that the whole thing is so horribly inefficient that ‘extra’ words cost millions of dollars to process. Holy shit that is terrible design.

      • FLeX@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        2 days ago

        They talk about separate messages though, if you just send “thanks” it changes nothing to the answer

        • ikt@aussie.zone
          link
          fedilink
          English
          arrow-up
          7
          arrow-down
          8
          ·
          2 days ago

          thanks, you’re clearly a genius, these LLM providers should pay you a lot of money to implement this, you’d save them millions 🙄

            • ikt@aussie.zone
              link
              fedilink
              English
              arrow-up
              10
              arrow-down
              1
              ·
              2 days ago

              noo the joke was he was supposed to reply

              "you're welcome my dude"

              • FLeX@lemmy.world
                link
                fedilink
                English
                arrow-up
                5
                ·
                edit-2
                2 days ago

                I wrote “== thanks”, not “contains thanks”. All this conversation is about messages containing ONLY a SINGLE useless word.

                Obviously if it’s just at the beginning or the end of a legit message, it’s not the same thing…