I tried Gemini 2.5 Deep Think, was not very impressed ... too much hallucination... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Davidzheng 27 days ago \| parent \| context \| favorite \| on: Frontier AI agents violate ethical constraints 30–... I tried Gemini 2.5 Deep Think, was not very impressed ... too much hallucinations. In comparison GPT 5.2 extended time hallucinates at like <25% of the time and if you ask another copy to proofread it goes even lower.

mapontosevenths 27 days ago [–]

I never tried 2.5. Three is pretty solid though, at least for my use case.

If there's a specific query you want me to run through it for comparison I'm happy to give it a go.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact