I love to show that kind of shit to AI boosters. (In case you’re wondering, the numbers were chosen randomly and the answer is incorrect).

They go waaa waaa its not a calculator, and then I can point out that it got the leading 6 digits and the last digit correct, which is a lot better than it did on the “softer” parts of the test.

  • diz@awful.systemsOP
    link
    fedilink
    English
    arrow-up
    13
    ·
    edit-2
    1 day ago

    Try asking my question to Google gemini a bunch of times, sometimes it gets it right, sometimes it doesn’t. Seems to be about 50/50 but I quickly ran out of free access.

    And google is planning to replace their search (which includes a working calculator) with this stuff. So it is absolutely the case that there’s a plan to replace one of the world’s most popular calculators, if not the most popular, with it.

    • HedyL@awful.systems
      link
      fedilink
      English
      arrow-up
      12
      ·
      1 day ago

      Also, a lawnmower is unlikely to say: “Sure, I am happy to take you to work” and “I am satisfied with my performance” afterwards. That’s why I sometimes find these bots’ pretentious demeanor worse than their functional shortcomings.

      • lIlIlIlIlIlIl@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        14
        ·
        1 day ago

        “Pretentious” is a trait expressed by something that’s thinking. These are the most likely words that best fit the context. Your emotional engagement with this technology is weird

        • ebu@awful.systems
          link
          fedilink
          English
          arrow-up
          7
          ·
          10 hours ago

          “emotional”

          let me just slip the shades on real quick

          “womanly”

          checks out

        • diz@awful.systemsOP
          link
          fedilink
          English
          arrow-up
          13
          ·
          1 day ago

          Pretentious is a fine description of the writing style. Which actual humans fine tune.

          • swlabr@awful.systems
            link
            fedilink
            English
            arrow-up
            12
            ·
            1 day ago

            Given that the LLMs typically have a system prompt that specifies a particular tone for the output, I think pretentious is an absolutely valid and accurate word to use.

            • HedyL@awful.systems
              link
              fedilink
              English
              arrow-up
              9
              ·
              22 hours ago

              Also, these bots have been deliberately fine-tuned in a way that is supposed to sound human. Sometimes, as a consequence, I find it difficult to describe their answering style without employing vocabulary used to describe human behavior. Also, I strongly suspect that this deliberate “human-like” style is a key reason for the current AI hype. It is why many people appear to excuse the bots’ huge shortcomings. It is funny to be accused of being “emotional” when pointing out these patterns as problematic.