Hacker Newsnew | past | comments | ask | show | jobs | submit | 0atman's commentslogin

I believe the principles the article author is looking for are core to Pivotal Tracker. I've often tried to persuade teams to use Tracker, but everyone sure loves the whiteboard metaphor!


Unfortunately, Pivotal Tracker was decommissioned on April 30, 2025. It's dead and gone.

Related HackerNews discussion: https://news.ycombinator.com/item?id=41591622


I used to use PT for years. Nowadays I use Linear instead for project management and love it.

In true pivotal tracker spirit their mobile apps are terrible, but perhaps that’s for the best.


My go-to is "Alice has 3 brothers and also has 6 sisters. How many sisters does her brother have?". They all say 6!

This test is nice because, as it's numeric, you can vary it slightly and test it easily across multiple APIs.

I believe I first saw this prompt in that paper two years ago that tested many AI models and found them all wanting.


Wow, I would not have expected frontier models to be caught on something like this but I tried it and they absolutely do. I don't really have a great explanation for why they might have such a hard time with this either. I even tried varying the prompt to make it "easier" by pointing out that Alice was a girl and no change.


> I would not have expected frontier models to be caught on something like this but I tried it and they absolutely do.

This got me curious. Tried on Gemini 2.5 Pro out of curiosity. It gets it right, but it did think about it, and then even explain itself a lot in its official text response.

Thinking:

Here's my thought process for solving this classic riddle: Identify the key players: The problem mentions Alice, her brothers, and her sisters. Establish Alice's family structure: Alice is a girl. She has 3 brothers. She has 6 sisters. Determine the total number of children in the family: Girls: Alice (1) + her sisters (6) = 7 girls Boys: Alice's brothers (3) = 3 boys Total children: 7 girls + 3 boys = 10 children Focus on the question: "How many sisters does her brother have?" Shift perspective: Now, consider one of Alice's brothers (let's call him Bob, although his name doesn't matter). Analyze Bob's siblings: Bob is one of the children in the family. The family consists of 7 girls and 3 boys. Count Bob's sisters: From Bob's perspective, all the girls in the family are his sisters. How many girls are in the family? We already determined there are 7 girls (Alice + her 6 sisters). Count Bob's brothers: From Bob's perspective, the other boys in the family are his brothers. How many boys are in the family? 3. How many other boys are there besides Bob? 3 total boys - 1 (Bob himself) = 2 brothers. (This isn't asked, but it helps confirm the perspective shift). Answer the specific question: The question asks how many sisters Bob (Alice's brother) has. We found there are 7 girls in the family. All 7 girls are his sisters. Final Check: Does the answer make sense? Yes. All the children in a family share the same set of siblings, just categorized differently based on their own gender. All the boys have the same number of sisters, and all the girls have the same number of sisters. Alice has 6 sisters (the other 6 girls). Any brother has 7 sisters (Alice + the other 6 girls).

Final message: Here's how to break down the problem: 1. Identify the total number of girls in the family: Alice (1) + her sisters (6) = 7 girls. 2. Identify the total number of boys in the family: Alice has 3 brothers. 3. Consider one of Alice's brothers: Any one of her brothers is part of the same family. 4. Count his sisters: From the brother's perspective, all the girls in the family are his sisters. So, her brother has 7 sisters.


Reasoning models seem to answer this correctly (tested Gemini pro 2.5, ChatGPT + reason). Grok answered correctly even without turning on "Thinking".


Here is the paper written about this prompt: https://arxiv.org/html/2406.02061v1


o4-mini got this right 4 times out of 4.


o4 got this wrong multiple times. claude 3.7 got it right the first time


10/10, no notes XD


I love LSP as much as the next person, but bacon's flicker-free zero config clippy watching basically TAUGHT me Rust!


I don't know why OP felt moved to write this weird guide, https://leptos.dev is incredible!


This feels like a leptos conspiracy


Welcome to the cargo cult!


For anyone wanting a actual full-stack framework with sensible defaults, type checked sql+api+rsx, and overwhelming community approval, don't reinvent the wheel: It has existed for years and is called https://leptos.dev

(If you're just making a simple rest api backend, I still dig the simplicity of poem-openapi + sqlx)

I can't understand if rust on rails is a joke, badly written by gpt, or written by someone who doesn't understand both Rust and web development in general. It's full of typos and baffling design decisions. (shelling out to an external comand to generate rust code?! MACROS EXIST!)


recognized leptos, dioxus, egui, yew (this one is frontend) what do you think?


I don't understand the question, but only one of those is a full stack web framework.


This is the right answer. How can you go back to a non-typed world after Rust I don't know.

I use html-node which has a typed submodule for even greater strictness. I just build some valid html, make a string and write that to the output directory and get on with my life. Eg: https://github.com/0atman/noboilerplate/

Templates... Rust functions

Flow control... Rust match expression

Abstraction... Rust modules

Html validation... Rust compiler

I like to think that in writing the html-node crate, Vidhan taught the compiler to speak html! https://lib.rs/crates/html-node


YES! sqlx::query_as in the wild!

Making queries like this checks the sql is valid at compile time by connecting to the real database and running it with demo data inside a rolled-back transaction!

(If you'll forgive the self promo, I explained it better in this video https://youtu.be/pocWrUj68tU)

Rust's macro system allows you to program the compiler with each new program, and sqlx is my favourite example of this.


Trust, but verify.


"[If we all wear seat belts] we could end up in a less secure future, where nobody understands why, with red tape that prevents you from finding out what went wrong."

Insane.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: