Hacker Newsnew | past | comments | ask | show | jobs | submit | HanClinto's commentslogin

Paper plates of software development.

Some of the coding-specific fine-tunes were really impressive boosts. Qwen2.5-3B-Instruct is also available [0] -- if it's not too much to ask, I'd be curious how more general models stack up in your benchmark?

[0] - https://huggingface.co/Qwen/Qwen2.5-3B-Instruct


Setting up good book publishing pipelines with version control + CI/CD might sounds simple, but I don't think it's trivial.

One of the best examples of this that I've ever seen is The Sourdough Framework [0] -- really impressed with the way that versioning and publishing is integrated in that book.

And yes -- I know it sounds like yet another Javascript library -- but it's actually a book about sourdough bread making. It's been discussed here several times before, but this one from 2023 [1] may have been the most popular (103 comments)

[0] - https://github.com/hendricius/the-sourdough-framework [1] - https://news.ycombinator.com/item?id=35961590


This was an early inspiration for me that I failed to mention in the article. I'm glad you mentioned it. It really does have a lot of good examples, especially the complex lists and diagrams it implements in TeX.


If the phone had broken comms, how did you get the code onto the phone to run?


Not OP, but I'm guessing by running the code on itself, i.e. turning the code into a QR code (or a series of them), then scanning those QR codes on the phone and reassembling them using a text-editing app on the phone.


In ye olden days we used cables to sync all of our apps and data to our phones.


But why not move the data over that cable? Some kind of iOS thing that made that more complicated than pushing an app to the phone?


With great difficulty...


This is very cool!

If you're looking for a good test suite, I wonder if you might be able to adapt any of the tests available in XMage? They have a pretty extensive test suite (such as for copy effects [0]) and if you point your agent at their code, I wonder how many could be usefully adapted to your system?

[0] - https://github.com/magefree/mage/tree/master/Mage.Tests/src/...


Million thanks for this! I stumbled upon XMage at some point but I didn't think of reusing their test suite. I'll see what Codex can cook up, I already have a WASM API which looks fairly similar to what XMage is doing for their tests so hopefully it won't be too hard to port them over!


Oh cool!

I've been doing similar experiments lately (using ViT's) to do card recognition, and so far it's been working really well for me. If you want to compare notes, I've open-sourced my code / weights [0] and written some blogs about how mine works [1]. I'd love to see if we can collaborate!

> Push the inference to the client-side (WebGPU / Web Workers).

I have an example of this working in webgpu / wasm here [2] along with a playground environment (demonstrated here [3]). I'm currently training a new version that uses a different ViT backbone more optimized for WASM inference -- it's currently converging, and I hope to have it finish training (or at least reach parity with the previous model) in about a week (took ~200 epochs for my last one to reach the level that it's at, and it takes about an hour per epoch in my current setup).

You mentioned WebGPU -- I've run into issues with the MobileViT-XXS backbone producing bad results in WebGPU on Android, so YMMV in whether or not WebGPU is stable enough to use for this or not. I don't know if it's my problem or a true bug in the platform, but I've fallen back to WASM and things are working much better since then.

[0] - https://github.com/HanClinto/CollectorVision

[1] - https://blog.hanclin.to/posts/gh-19/

[2] - https://hanclinto.github.io/CollectorVision/

[3] - https://youtu.be/MHieOcmC7Dw


I resonate with this blog post a lot.

I think there is something to be said for monetizing ones' hobbies, but I've recently been taking some forays into this world of "build something amazing and give it away for free" as well. I recently took a very big experimental plunge in this path, and I'm curious how well it will work out for me.

Open-source state-of-the-art Magic: The Gathering card identification pipeline:

https://www.youtube.com/watch?v=MHieOcmC7Dw

I used to do this kind of image recognition for a living, but I've been out of the business for a little while now. I had some ideas for a different approach from what I've done in the past and decided to code it up. This version is far better than anything else I've ever done -- especially for scanning against busy backgrounds or with occlusions, and also for noticing fine differences between otherwise difficult-to-distinguish printings.

I didn't have any interested customers waiting for this, so -- much like the OP -- decided to create an experiment and release it open source. I'm not opposed to having paths to monetize it (for people who want to license it for closed-source commercial projects), but I'm not trying to commercialize it so much as I would love to see how far we can take it with open-source.

I don't know which path I should take with this.

The biggest downside is that I feel like I've had a hard time getting people to be as interested in this project as I would have expected -- I believe this truly is the best identification software available (I've built some benchmarks to test it [0]), and maybe the market is just a bit flooded for such things (?), but I suspect that one very strong problem is that if you don't charge for something, then there is a perceived lack of value.

Sometimes I wonder if I would have more interest in this project if I _weren't_ trying to give it away.

For me, that's been the most negative aspect about releasing this for free so far.

[0] - https://blog.hanclin.to/posts/gh-26/


I don’t know how big the market is, but seems pretty commercial-friendly to this old magic player. I have a big box of cards from a few decades ago I’ve held onto. I’ve thought about selling them, but it seems i either take them to a shop and get lowballed, or spend hours meticulously researching each card and then figuring out how to sell it for what it’s worth. taking a pile of photos and having the ID and valuation automated could go a long way! Hard to sell to individuals like me, but i would think a card marketplace would find it invaluable?


> it seems i either take them to a shop and get lowballed, or spend hours meticulously researching each card and then figuring out how to sell it for what it’s worth.

Well if you want to use the scanner for something useful, you can run the web version here: https://hanclinto.github.io/CollectorVision/

No install -- scan your cards with your phone or desktop (downloads the weights in WASM -- runs 100% local -- the only web request it makes is to look up card names and prices online -- no image data ever leaves your machine), export the list as CSV, take your cards to your friendly local game store, and expect to receive 50-75% of TCG-low for your cards. This app currently only displays TCG Market, so probably about 50% of this price is what you could realistically expect.

> Hard to sell to individuals like me, but i would think a card marketplace would find it invaluable?

Yes -- and part of this might be that this would have been much more amazing several years ago, but by now -- most marketplaces (I used to do work for some of the big ones) have their own recognition tools. If they aren't actively looking to replace their current software, many companies would rather stick with what's currently working "good enough" than expend effort to migrate to something with only incremental benefit that is difficult to quantify. It's possible that would happen, but it's a tricky sales call to make.

I might just be imagining things, but I'm also picturing what one of those sales calls might look like, and it feels like I've opened the kimono a bit. The cat's out of the bag. There's no mystery or allure behind it anymore, and I feel like that puts me on the back foot somehow -- almost like I've played my strongest cards (hah!) first and have nothing left. By being open-source from the beginning (and talking freely about my architecture and what makes my solution different), there's very little sales-pitch build-up. Maybe it's just a part of the problem of how I'm presenting it, but I think people (especially the big houses) are probably just-as (or more) inclined to silently learn from me and improve their own scanners than try to use / build-upon what I've provided.

It's funny -- that angle is almost more about raising expectations and forcing the big houses to improve their own tech and catch up to open-source, more than getting anyone to adopt my solution in particular.

Am I okay with that? Absolutely -- I made that decision when I open-sourced it. I feel like the tech has been stagnating for several years, and I want to increase the quality of scanners across the board. I want to be the rising tide that lifts all boats.

That's one of the strongest arguments in favor of open-sourcing it (it would be very difficult for a closed-source product to have that same effect), and I remain hopeful for that long-term.


As a mtg player with an absurd amount of bulk, this is awesome! I think there is something to be said about the perceived lack of value, I appreciate greatly open source and even hold it to a higher value all things considered. Keep up the good fight :)


Thank you -- I appreciate that. :)


This is awesome. I’ve been interested in something like this for some time as I’ve been working on slowly indexing my mtg collection and selling cards I don’t want/need. Will be checking it out this weekend!


Thank you! If you want to test out my tool, here's a link to the web version that is built for scanning in lists of cards:

https://hanclinto.github.io/CollectorVision/

It's still super rough (doesn't support foil-toggling yet, still some issues with double-sided cards, crashing on some iPhones), but overall the rough structure is there -- it can create lists and export as CSV.

If you have feedback or feature requests for your needs, please leave them on Github and I'll get to them as soon as I can. I'd love to hear more user feedback!


It's not just about web search though -- there's another element too. I go to Grok to find things I have failed to find with web search.

I agree with GP -- if I want sourced commentary on current events, Grok is my go-to above the other models. For whatever reason, its search feels better and more up-to-date -- whereas the others feel more like filters of media, Grok feels more like filters of sources.

Could just be my perception though. YMMV


This is seriously impressive!!

What are you planning on doing with this? Where should I follow along?


This sounds delightful and wholesome. :D

I think it's easy to lose sight of these pockets of mundane goodness, and I appreciate you highlighting them.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: