Due to "politics" my teams main responsibility went from developing ai agents to just testing out a chat it developed by another team.
When you spend 8 hours a day doing mind numbing tasks, tasks that won't help you land another job and is constantly under stress of being fired, its bad for your mental health
It's hard to imagine if you have not experienced it. The air would still be hot even after the sun sets in some parts of India. Usually when wind blows over you you feel cool. With hot air it's like a blow dryer in your face. Just thermal energy being dumped on you making you feel even worse
There is also the possibility that an LLM judge would be happy with some code that looks like LLM generated code. But a maintainer for a specific project might not merge it for stylistic reasons
> The main idea is we provide documents to the LLM and it asks lot of questions which clear ambiguity and possible misconceptions the LLM might have
This kind of works but the difficulty is that you have to be very explicit about everything. It was mentioned in a spec document that a particular excel file is treated as a source of truth throughout the whole company and it is treated as an append only database. The agent still decided to add a check to see if a previous row was modified. It pushed back on its decision when asked why it decided to do so. "What if someone entered it wrong and had to correct it"? Valid question but it's not my teams responsibility to check for it
This check makes sense from a traditional development view point and that's why the agent did it. I would say it's good practice too but it's beyond the scope of the project it was working on. If what you are doing is beyond the norm you have to watch out for things like this
During the time that this paper was written agents were not really a thing. I would be more concerned about centralisation of work itself as a bigger concern
I wish I could find it but Simon Clark, someone who specialized in climate science, had put out a video about how we were only recently able to model the AMOC and it's shifting patterns thanks to this measurement we were doing
Yrah probably someone at doge or the dept of war watched that and suggested dismantling it. What's next, closing down the NOAA because it's a "con job" and climate change is "a hoax"?
I've become a big fan of Simon lately. It's difficult to make videos about such a serious and all-encompassing topic without either falling into some combination of doomerism, misinformation, or apathy. He's got a real talent for bringing his expertise to a general audience, and I always come away from a video being clear about what the problem is he's talking about, why it matters but importantly how we deal with it (and how I can deal with it personally, without falling into the "only individual actions matter" trap)
reply