Hacker Newsnew | past | comments | ask | show | jobs | submit | theredsix's commentslogin

Extrapolating the benchmarks, this would imply the best RYS 27B is capable of out performing the 397B MoE?

super clever and awesome!

The freeze sometimes does capture in between states. What I've seen the agent does in those cases is that it recognizes it's in between states and calls browser_wait(). Where the agent goes off the rails isn't a snapshot in the middle of a state transition, (it's smart enough to know to retry in that case), it's when the DOM changes after the agent believes the page has settled.

For async, lots of people smarter than me working on the smarter agent problem. Though there's a latency floor with inference due to prompt processing, and output generation. Without tools like ABP, the LLM is always aiming at a moving target.


*bows


Right now, it's evading all anti-botting detectors I've tested it on. I believe it's due to the fact it runs in headful mode and I've removed all detectable CDP signatures. Input events are also simulated at a system level (typing is at 200 WPM) so it's very hard for a page's javascript to detect it's not in a human operated chrome. A lot of detection on headless happens due to the webGPU capabilities being disabled since a modern computer is very unlikely to not support those. You could also wire up one of the Heretic models as a dedicated Captcha solver, I recommend Qwen 3.5 27b Heretic! https://huggingface.co/coder3101/Qwen3.5-27B-heretic


Maintaining the fork isn't so bad, the core chromium changes are only a few hundred lines and I was able to extend already existing concept like debugger pausing and virtualtime emulation while riding off mojo IPC for cross thread communications.


That's actually super smart. If you're just piggybacking on native debugger pauses and Mojo IPC instead of hardcoding custom locks into V8, the patchset is genuinely microscopic. I take back the meat grinder comment - with a diff that clean, automated rebases across major Chrome releases are actually totally manageable. Hats off!

Exactly! This race condition is exactly the category of problems ABP will solve.


I've consolidated most of the changes in chrome/browser/abp and used shim's for the other modifications so rebase is light and handleable by Claude. I'd love to get this upstreamed. An intro to the chromium maintenance team would be greatly appreciated!


Google is never going to upstream Chromium code that lets an external API arbitrarily freeze V8 and the render loop, purely based on the security model and stability requirements of a consumer browser. Your only real path forward is maintaining a custom patchset on top of stable releases, exactly like Brave or Electron do. Just be prepared that Claude won't save you when they inevitably rewrite the Blink architecture again


It's a long shot but getting ABP to be a first party citizen alongside CDP would be my dream!


/superpowers! that plugin is the GOAT


Thanks! I assume you are referring to this https://github.com/obra/superpowers

I use it as well (a customized version suited for my worflow). It is indeed the GOAT.


those are factored into the wait heuristic only if there's a navigation event since clicks on an already loaded page won't trigger those. You can point Claude/codex at https://github.com/theredsix/agent-browser-protocol/tree/dev... and have it walk you through the wait heuristic step by step.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: