«No matter how much LLMs sound like a human, the truth is that they really don’t really understand language, despite being quite good at stringing words together. This proficiency in language can create the illusion of broader intelligence, leading to more elaborate responses. So basically, research shows what we suspected: LLMs are great at bullshitting you into thinking they know the answer. Many people buy this illusion because they either simply want to believe or because they just don’t use critical thinking—something that Microsoft’s researchers discovered in a new study looking at AI’s impact on cognitive functioning.»
– Jesus Diaz, 2025 (1)
«There are few areas where AI has seen more robust deployment than the field of software development. From «vibe» coding to GitHub Copilot to startups building quick-and-dirty applications with support from LLMs, AI is already deeply integrated. However, those claiming we’re mere months away from AI agents replacing most programmers should adjust their expectations because models aren’t good enough at the debugging part, and debugging occupies most of a developer’s time. That’s the suggestion of Microsoft Research, which built a new tool called debug-gym to test and improve how AI models can debug software.»
– Samuel Axon, 2025 (2)

I denne samlingen vil det komme tester av ulike samtaleroboter som ikke vil bli inkludert i sammenligningsgrunnlaget i første omgang. Testene beskrevet her er korte og ment som Work in progress gjennom 2025.
- Test av den kinesiske Deepseek R1
- Test av Grok
- Test av Kompas AI
- Test av NorskGPT
- Test av OpenAI sin ChatGPT Study mode
- Test av OpenAI ChatGPT-5

Leseliste
- Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
- OpenAI is launching a version of ChatGPT for college students
- Kompas AI: Deep Research & Report Generation for Comprehensive Insights
- Comparing Leading AI Deep Research Tools: ChatGPT, Google, Perplexity, Kompas AI, and Elicit
- Comparing Elicit, ChatGPT Deep Research, and Kompas AI: UX, Capabilities & Use Cases
- Grok, Gemini, ChatGPT and DeepSeek: Comparison and Applications in Conversational Artificial Intelligence
- A Systematic Review and Comprehensive Analysis of Pioneering AI Chatbot Models from Education to Healthcare: ChatGPT, Bard, Llama, Ernie and Grok
- Why Chatbots Are Not the Future
- Takk for nå, ChatGPT, dette gidder jeg ikke mer
