I really don’t know, I’m speculating, but neither does openai know, that’s sure. So we have the most popular ML system used by millions based on…what exactly?
To be fair this tweet doesn’t say anything about training data but simply that it theoretically can use present day data if it looks it up online.
For gpt4 i think its was initially trained up to 2021 but it has gotten updates where data up to december 2023 was used in training. It “knows” this data and does not need to look ut up.
Whether they managed to further train the initial gpt4 model to do so or added something they trained separately is probably a trade secret.
Where do you get that from? At least ChatGPT isn’t limited to data from 2021. I haven’t researched about other models.
Gpt 3.5 is limited to 2021. Gpt 4; 4.5; the imaginary upcoming gpt 5 models are not, but that does not mean they aren’t limited in their own ways.
Are you sure those aren’t trained until 2021, frozen, and then fine tuned on later data?
I really don’t know, I’m speculating, but neither does openai know, that’s sure. So we have the most popular ML system used by millions based on…what exactly?
Yeah GPT 3.5 and some other FOSS models also say 2021
OpenAI stated in a tweet a few months ago that the limitation is no longer in place.
To be fair this tweet doesn’t say anything about training data but simply that it theoretically can use present day data if it looks it up online.
For gpt4 i think its was initially trained up to 2021 but it has gotten updates where data up to december 2023 was used in training. It “knows” this data and does not need to look ut up.
Whether they managed to further train the initial gpt4 model to do so or added something they trained separately is probably a trade secret.
Thanks!