Someone has to generate the training data, and they will not be paid by their work unless the business model for this stuff changes from "scrape the entire public internet and insist that fair use means you don't have to pay anyone a single fraction of a cent for their work".