Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
thot_experiment
5 days ago
|
parent
|
context
|
favorite
| on:
Gemma 4 12B: A unified, encoder-free multimodal mo...
I've always found the Gemma models to vastly under-perform on vision tasks compared to Qwen so that's nothing new.
help
mountainriver
5 days ago
[–]
The Qwen series adopted vision wayyy earlier than anyone else. No idea why the other labs were sleeping on it but they had about 2 years of experimentation without any competition.
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: