NinjaZ@infosec.pub to

AI@lemmy.ml · 8 months ago

The GPT-4 barrier has finally been broken

simonwillison.net

5

51

The GPT-4 barrier has finally been broken

simonwillison.net

NinjaZ@infosec.pub to

AI@lemmy.ml · 8 months ago

5

Four weeks ago, GPT-4 remained the undisputed champion: consistently at the top of every key benchmark, but more importantly the clear winner in terms of “vibes”. Almost everyone investing serious …

Four weeks ago, GPT-4 remained the undisputed champion: consistently at the top of every key benchmark, but more importantly the clear winner in terms of “vibes”. Almost everyone investing serious time exploring LLMs agreed that it was the most capable default model for the majority of tasks—and had been for more than a year.

Today that barrier has finally been smashed. We have four new models, all released to the public in the last four weeks, that are benchmarking near or even above GPT-4. And the all-important vibes are good, too!

Those models come from four different vendors.

Chat

TheAnonymouseJoker@lemmy.ml
link
fedilink
arrow-up
25
arrow-down
2·
edit-2
4 months ago
Removed by mod
- mozz@mbin.grits.dev
  link
  fedilink
  arrow-up
  2·
  8 months ago
  Mistral has a lot of open source models that are quite good. Their largest ones are closed; for what reason I don’t know.
  - TheAnonymouseJoker@lemmy.ml
    link
    fedilink
    arrow-up
    2·
    edit-2
    4 months ago
    Removed by mod

AI@lemmy.ml

artificial_intel@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !artificial_intel@lemmy.ml

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
4 users / week
79 users / month
2.11K users / 6 months
1 local subscriber
4.12K subscribers
417 Posts
1.26K Comments
Modlog

mods: