I've always found the Gemma models to vastly under-perform on vision tasks compa... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		thot_experiment 5 days ago \| parent \| context \| favorite \| on: Gemma 4 12B: A unified, encoder-free multimodal mo... I've always found the Gemma models to vastly under-perform on vision tasks compared to Qwen so that's nothing new.
		help

mountainriver 5 days ago [–]

The Qwen series adopted vision wayyy earlier than anyone else. No idea why the other labs were sleeping on it but they had about 2 years of experimentation without any competition.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact