With pure LLM-based chatbots this is beyond question, as the responses provided range between plausible to completely delusional. Grounding LLMs with RAG reduces the amount of made-up nonsense ...
I challenged all those vendors with a grueling question on RAG and LLM evaluation, but only one of them had a good answer (Galileo, via their "Evaluation Intelligence" platform). After that, I kept ...