More

Imanari · 2026-06-11T19:19:00 1781205540

Only tangentially related: MiMo-2.5Pro is fast, cheap and very capable, although not quite gpt5.5 level iontelligence (I dont use the claudes). It works flawlessly in Pi and is an excellent workhorse. I expect big things from their next model.

Imanari · 2026-06-08T06:31:32 1780900292

I always feel GPT5.5 is better at ‘getting the bigger picture‘ when I am describing something vaguely vs Chinese models. What’s your experience with that?

freakynit · 2026-06-08T07:20:07 1780903207

That's true. The open models still do not match these extreme high end models yet on very high levels of understanding.

But that's also not needed in most of the times. There will always be a "better" model... but that doesn't make other models "bad".

For my use-cases, open models are now almost on par with these top models... and it's only extremely rare that I genuinely "need" the help of top-of-the line closed models.

Imanari · 2026-06-03T08:46:35 1780476395

What do the model have as inputs? What’s their harness like? Just price data or are they free to pull reports etc. from the web?

Imanari · 2026-05-20T04:35:46 1779251746

Very cool work! Regarding your finding "the tool ran successfully and returned data" and "the tool ran successfully but found nothing." Couldn’t this be solved by designing better tool responses instead of adding another layer in between? Just curious and probing my understanding.

zambelli · 2026-05-20T05:21:03 1779254463

100%, a better tool would work or even remove the problem overall.

The isssue/use-case is more around, say, a database table or legacy systems where your tool is just hitting a legacy API that may or may not be good. A surface you don't control.

It didn't come up as a use-case in this eval honestly, it's more the concept of a standard, like 4xx vs 5xx. I just felt it was missing from the ecosystem overall.

Imanari · 2026-05-18T07:25:13 1779089113

Good old aider ahead of its time

Imanari · 2026-05-08T10:57:44 1778237864

That’s exactly the approach of smolagemts. The only “tool“ available is writing python code

Imanari · 2026-05-08T10:20:33 1778235633

As with so many things aider.chat was ahead of its time with its ability to create deterministic scripts.

Imanari · 2026-04-24T06:02:43 1777010563

Just tested it via openrounter in the Pi Coding agent and it regularly fails to use the read and write tool correctly, very disappointing. Anyone know a fix besides prompting "always use the provided tools instead of writing your own call"

rane · 2026-04-24T07:14:22 1777014862

FWIW, works great in Claude Code.

https://api-docs.deepseek.com/guides/coding_agents#integrate...

tariky · 2026-04-24T18:19:34 1777054774

If you have access to any other model it can create create pi extension that fixes problem. At least worked for me.

Imanari · 2026-04-24T19:19:09 1777058349

Like a special parser? Would you mind elaborating?

tariky · 2026-04-25T06:10:31 1777097431

It intercepts json commands and turns them in tool calls

abstracthinking · 2026-04-24T06:05:35 1777010735

They have just released it, give it some time, they probably haven't pretested it with Pi

Imanari · 2026-04-24T06:19:58 1777011598

How can they fix it after the release? They would have to retrain/finetune it further, no?

zargon · 2026-04-24T06:26:36 1777011996

It's only in preview right now. And anyway, yes, models regularly get updated training.

But in this case, it's more likely just to be a tooling issue.

mark33vh · 2026-04-24T09:58:50 1777024730

Yeah hope they fix this for PI

Imanari · 2026-04-23T06:42:43 1776926563

Why do the Xiami releases never get any attention? Mimo-V2-Pro was pretty good, excited to try V2.5

passive · 2026-04-23T22:58:15 1776985095

I am curious about this myself, as it's a major company that I would think is worth taking seriously. But this and the previous release got suspiciously few comments.

Imanari · 2026-04-20T10:39:05 1776681545

> Listening often means not jumping to a solution; but absorbing and processing someone’s pain

> When in actuality, they should [...] finding a way to solve the pain points

Honest question, how do I 'absorb someones pain'? And how do I transition from that into eventually formulating the feature/ticket?