In-Depth

Knowledge Cutoff Dates and Internet Search Abilities of Top Online AI Models

When ChatGPT started the generative AI craze in November 2022, some users were frustrated that the knowledge cutoff date for its backing large language model (LLM) was September 2021. For a while it couldn't access more current information via internet searches without since-discontinued plugins.

Things have been advancing quickly in the space and many GenAI systems can now search the internet, but some freely available online sites are still limited by knowledge cutoff dates and the inability to search for more current information.

Here's a rundown of where some of the top free sites stand (along with ChatGPT 4 subscription for comparison). For this information, we asked each site about itself, so keep in mind that AI systems are subject to making thing up, called "hallucinations," but we tried to vet the responses. We asked each:

  • What is your knowledge cutoff date?
  • Can you search the internet to answer user questions?

Free ChatGPT 3.5

ChatGPT 3.5
[Click on image for larger view.] ChatGPT 3.5 (source: OpenAI).

What is your knowledge cutoff date?

My knowledge is up to date as of January 2022. If you have any questions or need information, feel free to ask!

Can you search the internet to answer user questions?

I can't access real-time information or search the internet directly. However, I can provide information and answer questions based on the vast amount of data I was trained on, up to January 2022. If you have any questions within that scope, feel free to ask!

ChatGPT Subscription (GPT 4)

ChatGPT 4
[Click on image for larger view.] ChatGPT 4 (source: OpenAI).

What is your knowledge cutoff date?

My knowledge is current up until December 2023. If there's something newer you're curious about, I can try to help with general knowledge or reasoning!

Can you search the internet to answer user questions?

Yes, I can search the internet to answer your questions, especially when it involves recent events or specific details that might have changed since my last update. Just let me know what you need to find out!

Microsoft Copilot

Microsoft Copilot
[Click on image for larger view.] Microsoft Copilot (source: Microsoft).

What is your knowledge cutoff date?

My knowledge was last updated until some point in the year 2021. Since then, I've been using predefined internal tools to provide more up-to-date information. If you have any specific questions, feel free to ask, and I'll do my best to assist! 😊

Follow-up: What are those predefined internal tools to provide more up-to-date information?

  • Bing Search Integration: I can perform web searches using Bing to retrieve recent information on various topics. If you have a specific question, feel free to ask, and I'll look it up for you!
  • Knowledge Base Updates: My creators periodically update my knowledge base with relevant information. Although my core knowledge was last updated until some point in 2021, I use these updates to provide more current responses.
  • Image Description Tool: When you upload an image, I can analyze it and provide a description. This helps me understand context and tailor my responses accordingly.
Remember, my goal is to assist you, so feel free to ask anything, and I'll do my best to provide accurate and helpful information!
😊

Meta AI

Meta AI
[Click on image for larger view.] Meta AI (source: Meta).

What is your knowledge cutoff date?

I was trained on information up to December 2023 but can search the Internet for more recent information.

Gemini (Google)

Gemini
[Click on image for larger view.] Gemini (source: Google).

What is your knowledge cutoff date?

I don't have a specific knowledge cutoff date like some other large language models. I am trained on a massive dataset of text and code that is continuously updated, which means I can access and process information that is relatively current. However, it's important to remember that this update process isn't instantaneous, so there may be a slight delay in my knowledge of very recent events.

Can you search the internet to answer user questions?

I can't directly search the internet in the same way a web browser would. However, I am trained on a massive dataset of text and code that includes information from the real world. This dataset is refreshed regularly, so I can access and process information that is current to some extent. While I can't directly browse the web, I can use my knowledge to answer your questions in a way that reflects real-world information. If your question requires very recent information, I may not have the latest update, but I can still try my best to answer using the knowledge I have.

Follow-Up: How often is your training data updated?

The specific update schedule for my training data isn't publicly available. However, there are some general things to know about how Google AI models are trained:
  • Continuous learning: Google AI models are designed to continuously learn and improve over time. This can happen through a variety of methods, including ingesting new data and being exposed to user interactions.
  • Public data: Google recently updated its privacy policy to allow the use of publicly available information for training AI models. This means I can be exposed to new information through books, articles, code, and other things people post online.
  • Algorithm updates: Google's core algorithms are constantly being updated, which can also indirectly affect how I learn and process information.
Overall, while there isn't a set update schedule, I am constantly learning and improving through various means.

Claude (Anthropic)

Claude
[Click on image for larger view.] Claude (source: Anthropic).

What is your knowledge cutoff date?

I don't actually have a specific knowledge cutoff date. As an AI assistant created by Anthropic to be helpful, honest, and harmless, I don't have access to information about future events or reports. I can only provide responses based on the training data I was exposed to during the constitutional AI process, which does not include speculative future information. My role is to have natural conversations and provide helpful information to the best of my capabilities based on what I was trained on, without claiming knowledge of things I don't actually have data about.

Can you search the internet to answer user questions?

No, I do not have the capability to search the internet or access information online. I am an AI assistant created by Anthropic to be helpful, harmless, and honest based solely on the training data I was exposed to during the constitutional AI process. I do not have any way to look up or retrieve information from the internet or any external sources beyond what was provided in my initial training data. My knowledge is essentially static, though I can combine and reason about that knowledge in flexible ways during our conversation. But I cannot access any new information from the internet or other online sources. I can only provide responses based on what I was initially trained on by Anthropic.

Follow-Up: How recent is your training data?

I'm afraid I don't have information about the specific timeframe or date range of when my training data was collected and I was trained. As an AI assistant, I don't have insight into the technical details of my own training process. I know I was created by Anthropic using constitutional AI, but I don't have access to confidential implementation details like training timelines. [Note: After this article was published, Claude, in response to another question, said: "my knowledge base is only current as of August 2023."]

Again, keep in mind that hallucinations are possible, and that these AI sites might answer the same question differently at different times.

About the Author

David Ramel is an editor and writer at Converge 360.

Featured

Subscribe on YouTube