Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
esyir
55 days ago
|
parent
|
context
|
favorite
| on:
Accelerating Gemma 4: faster inference with multi-...
I'll add an expansion here. It's more useful to you locally, as you have excess compute that's generally wasted. If you're serving multiple user and trying to max output, you might cost some in this case
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: