Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You can certainly try this kind of data augmentation strategy. Pretty sure it'll fail because of the text analysis still being biased.

You would need equalized amounts of text referring to various groups of people and topics too. That's much harder to augment.



What's stopping someone from taking training text that mentions races by name (using any common name) and having it automatically changed to a different race for a new training sample?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: