More

kingcauchy · 2026-06-15T01:42:03 1781487723

I wonder if we'll see a new sort of "role" in the training (user, system, assistant) for unstrusted sources, I'm a little surprised we haven't already. In fact it would probably make sense to have an arbitrary number of entity roles and to be able to configure the chat calls with truth values. Interesting article though.

That being said AI is not code, it's a statistical algorithm with non-determinism baked in. You can write code to run them but it's nothing without the evolution of the model weights from the training process. And you can absolutely make the model weights better aligned with intent.

kingcauchy · 2026-06-11T17:17:44 1781198264

How much of the apology was written by Claude? How much of the release note process was written by Claude? Will they have better prompts going forward to make sure Claude doesn't write upsetting things into the release notes for devs like silent nerfing? Spooky times.

kingcauchy · 2026-06-11T02:18:49 1781144329

« Trust us, we’re doing this for the good of humanity » (fills pockets with stock value and externalities from data center polloution) « No seriously trust us , at least we’re not Sam Altman »

Update: « Oh and we’re the only ones who will stop AI from turning into SkyNet and eating your babies, you just have to pay us to make sure we invent SkyNet first »

kingcauchy · 2026-06-10T00:36:30 1781051790

It's also hard to imagine them not doing this with any of the products they're building. "You can't use Claude to build an agent because that competes with Claude Code, you can't use Claude to build a design tool because that competes with Claude Design, you can't use Claude to build an email tool because that competes with Cowork."

kingcauchy · 2026-06-10T00:04:59 1781049899

"We won't use this product to spy or build weapons but you'll have to trust us, but we're also going to intentionally lie to you when you break our terms of service but trust us."

kingcauchy · 2026-06-09T23:37:12 1781048232

The silently never telling you is so insidious on top of it being ridiculous given how they trained the model in the first place. We do distributed model training for embedder/reranker models and I'd deeply resonate that this article's message exactly for our company. We couldn't trust the model in the first place, but now the model is intentionally burning our money if we asked it the wrong question, on top of being deeply expensive in the first place. If we did find evidence of being incorrectly nerfed, we'd never be able to reach a human to let them know. Too many reverse incentives with Anthropic, maybe they're about AI security but that doesn't make them ethical to consumers (i.e. humans).

kingcauchy · 2026-05-21T22:45:33 1779403533

I work with you so I'm bias but I think this is neat.

kingcauchy · 2026-03-18T21:17:23 1773868643

Thanks for the feedback!

In regards to contention, the answer is definitely dependent on how you host. We've had a lot of experience running different ML workloads and from an SRE perspective we knew you'd need a variety of different styles of hosting the models depending on read/write patterns of your usage. Termite and the proxy service/operator allow for all styles of model loading, either preloading and compiling to prevent cold starts or lazy loading to protect memory, with different pooling strategies and caching strategies for bundling multiple models running in the same Termite container.

If a heavy indexing job is running on a CPU only single-node deployment, it won't be using Raft (no replication). If it's running with GPU it doesn't share resources with the DB anyways really significantly there. If it's running distributed, also no issue with contention really.

Let us know if you have any other questions!

kingcauchy · 2026-03-18T15:01:12 1773846072

I'd be super interested to here more about what you all do in this space, currently Antfly (and Termite) doesn't handle custom content types explicitly because we've mostly focused on supporting the "classic" ones (application/pdf, image/png, image/jp2, e.g.) but we've had to build out a lot of the support for these things as custom support into the system. For instance I chose jsonschema for the schema so users could do exactly what you're suggesting, custom content types indexed differently. The ML side of things also has to know how to support them (i.e. does a pdf get rendered ocr then embedded or text extraction on some fallback). Would love to here about what you all do and the types of media you make searchable today!

kingcauchy · 2026-03-18T04:16:23 1773807383

https://github.com/antflydb/antfly/pull/8

Upon another look it looks like we were actually missing the pause lock for the backfill operation too during a shard split though, I also went ahead and added it to batch for good measure although that case should be caught by the manager! Thank you for the report!