Hacker Newsnew | past | comments | ask | show | jobs | submit | huss97's commentslogin

Steel.dev | Founding Engineers (Infrastructure, Applied AI), Growth, GTM | Toronto, San Francisco | In-office

Steel is building open-source browser infrastructure for AI agents & apps. We make it easy for AI engineers to ship products that interact with the web using our Sessions API - An API that controls fleets of browser sessions in the cloud while we handle everything from scaling to antibot and more.

Our tech stack:

- Frontend: Vanilla React + TailwindCSS + Shadcn, built with Vite, deployed on Vercel

- Backend: Fastify, Node.js, Zod, NATs, Drizzle with Redis, PostgreSQL, and S3

- Infrastructure: Pulumi, Go, Firecracker VMs, Docker

- Languages: Mostly Typescript, Rust, Go

With over 6,000 GitHub stars, dozens of paying customers, and millions of sessions served, we grew our platform 50x in 2025 purely through word-of-mouth and our open-source community. Backed by world-class investors, we're building a small, talent-dense team that's shaping the future of how humans & agents interact with the internet.

Current roles (all roles include competitive comp + equity):

- Infrastructure Engineer: Design, optimize, and scale our browser VM infrastructure

- Applied AI Engineer: Build on agent frameworks, integrations, and real-world AI use cases

- Founding Growth Lead: Own the growth machine to turn insane organic traction into a category-defining brand

- Founding GTM Lead: Be our first sales hire, closing deals with AI startups transforming entire industries

To learn more & apply (Mention HN): https://jobs.ashbyhq.com/steel

P.S. If you're feeling spicy, try our optional hiring challenge and submit the flag with your application once you reach the final step → https://challenge.steel.dev


Hey HN!

With all the recent developments in AI web agents space (like OpenAI Operator and Manus), I’ve been connecting with countless engineers looking to navigate this space and build similar capabilities into their own products.

That’s why I created Awesome Web Agents - A comprehensive list of tools, frameworks, and resources focused on AI agents that can browse and interact with the web.

The list includes: Autonomous Web Agents (like OpenAI Operator, Browser-Use), AI Web Automation Tools & Dev Tools, AI-powered Web Scrapers, Research Benchmarks & Datasets, Tutorials & Implementation Guides.

The list is open source and maintained on GitHub. Stars, forks, and contributions welcome! If you’re currently working on a web agent tool or product, I’d love to feature you.

Would love to hear your thoughts/feedback. Any tools I'm missing?


Totally agreed. Only good as inputs to a stack decision but not the deciding factor. WebVoyager itself feels like it's approaching saturation though as scores as getting high and it only tests on a narrow domain of use-cases. We'll definitely see more challenging and interesting evals pop up in the next little bit.

IMHO, if you're building AI products, most of the time building and running your own evals is the only right way to build something good.

BTW - Arch looks super cool! Just starred and looking forward to playing around with it :)


Hey all! Wanted to share this leaderboard we put together to centralize some of the different models available in the AI browser agent space.

Since working on Steel, we've seen a ton of people have a hard time putting the browser agent space and how it's progressing into perspective and it felt odd to us that there were no centralized leaderboards like there were for so many other agentic use cases.

So we launched this leaderboard to help! It's open-source and we're open to any contributions we may be missing. We're committed to keeping this up to date as the space progresses (which it seems to be doing quite quickly).

Let us know if you have any feedback/thoughts :)


Hey HN! We're the team behind steel.dev, and today we're excited to share Surf (https://surf.new) - an open-source playground where you can test and experiment with different web agents. Think of it as a sandbox where you can see how different AI agents/models interact with the web in real-time. With OpenAI's Operator launch last week, web agents are having a moment. But with it being closed source, not having an API, not SOTA, and costing $200 a month, we felt like there was a gap in clearly understanding where the space is at today for both developers evaluating for prod and end-users. The open-source community is actually leading the way in performance (Shoutout Browser-use, currently SOTA on WebVoyager), but testing different approaches requires a bunch of work, setup, and debugging just to get started.

We built Surf to solve this. It's a hosted playground where you can chat with different web agents/models and test automation tasks - basically a "try before you deploy" sandbox for web agents. The most interesting challenge wasn't the agents themselves - it was designing Surf to be generalizable enough for contributors to plug in new agents and models easily.

Right now, you can switch between Browser-use’s agent and our experimental Claude Computer-use-based agent. But the real goal is to make it trivial for anyone building web agents to add their own. The whole thing is open source (https://github.com/steel-dev/surf.new) and built on our Steel's Sessions API. We built it with Next.js, Vercel AI SDK, & LangChain to keep everything clean, familiar, and hackable.

Heads up - it's new, and slow, and agents can fail in all sorts of ways. But that is kind of the point - we want to make it easier for everyone to understand the current state of web agents, whether you’re evaluating them for production use, just curious about how they work or want to automate some tasks for yourself. We also don’t yet support operator features like taking control and multiple tabs yet either.

Try it out at surf.new (no signup needed!), and let us know what breaks. We're actively maintaining it and would love your feedback on what you'd like to see next.


Hey HN! We're the team behind steel.dev, and today we're excited to share Surf (https://surf.new/) - an open-source playground where you can test and experiment with different web agents. Think of it as a sandbox where you can see how different AI agents/models interact with the web in real-time.

With OpenAI's Operator launch last week, web agents are having a moment. But with it being closed source, not having an API, not SOTA, and costing $200 a month, we felt like there was a gap in clearly understanding where the space is at today for both developers evaluating for prod and end-users. The open-source community is actually leading the way in performance (Shoutout Browser-use, currently SOTA on WebVoyager), but testing different approaches requires a bunch of work, setup, and debugging just to get started.

We built Surf to solve this. It's a hosted playground where you can chat with different web agents/models and test automation tasks - basically a "try before you deploy" sandbox for web agents. The most interesting challenge wasn't the agents themselves - it was designing Surf to be generalizable enough for contributors to plug in new agents and models easily.

Right now, you can switch between Browser-use’s agent and our experimental Claude Computer-use-based agent. But the real goal is to make it trivial for anyone building web agents to add their own. The whole thing is open source (https://github.com/steel-dev/surf.new) and built on our Steel's Sessions API. We built it with Next.js, Vercel AI SDK, & LangChain to keep everything clean, familiar, and hackable.

Heads up - it's new, and slow, and agents can fail in all sorts of ways. But that is kind of the point - we want to make it easier for everyone to understand the current state of web agents, whether you’re evaluating them for production use, just curious about how they work or want to automate some tasks for yourself. We also don’t yet support operator features like taking control and multiple tabs yet either.

Try it out at surf.new (no signup needed!), and let us know what breaks. We're actively maintaining it and would love your feedback on what you'd like to see next.


Steel.dev | Founding Engineers (Full-stack, Infrastructure, Applied AI) | Remote EST ± 5 hrs

Steel is building open-source browser infrastructure for AI agents & apps. We make it easy for AI engineers to ship products that interact with the web using our Sessions API - An API that controls fleets of browser sessions in the cloud while we handle everything from scaling to antibot and more.

Our tech stack: - Frontend/Backend: Typescript, React/Next.js, Node.js, Fastify - Infrastructure: Firecracker VMs, Docker, AWS - Key tools: Zod, Drizzle, Redis, PostgreSQL

We're a small but fast-moving team looking for experienced engineers who thrive on autonomy and technical challenges. You'll work directly with our founding team to make Steel the go-to browser infrastructure powering the next billion AI agents. If you love dev tools, open-source, and/or AI, you're at the right door.

Current roles (all roles include competitive comp + equity): - Full Stack Engineer: Build dev tools, APIs, and UIs to help devs control browsers at scale - Infrastructure Engineer: Design, optimize, and scale our browser VM infrastructure - Applied AI Engineer: Work on integrations, agents, and real-world AI use cases

With 2.8k+ GitHub stars, paying customers, and millions of API calls served, we're building a remote-first team that's shaping how humans interact with the internet of the future.

Learn more: https://steel-dev.notion.site/jobs-at-steel To apply: Email careers@steel.dev with Github & resume (mention HN)


Looks awesome! DM'd :)


scalar is the absolute best; we've had a great experience building on them so far :P


Our end goal is to build the LLM OS. We think the best place to start is the hardest but most hairy problem - getting agents to use an internet design for humans on our behalf. So, as we go, we want to keep building towards relieving blockers by handling auth securely, translating webpages, and creating the right toolset for driving sessions. And we want to do this in an open-source and communal way. That's also why we're attacking it bottom-up with infra, so we can build a community of people much smarter than us around getting there. Join the discord if you want to stay on top of our journey :)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: