Outside HPC/ML I think our programs are now trading off useful ops per watt to t...

Outside HPC/ML I think our programs are now trading off useful ops per watt to take some advantage of the elusive beast called thread level parallelism. A web browser is happy to get a speedup of N by throwing 2N or 4N spinning threads at the problem if correctness and stability can be retained.