More

rawicki · 2026-04-12T15:12:26 1776006746

For me definitely the worst regression was the system prompt telling claude to analyze file to check if it's malware at every read. That correlates with me seeing also early exhausted quotas and acknowledgments of "not a malware" at almost every step.

It is a horrible error of judgement to insert a complex request for such a basic ability. It is also an error of judgement to make claude make decisions whether it wants to improve the code or not at all.

It is so bad, that i stopped working on my current project and went to try other models. So far qwen is quite promising.

bcherny · 2026-04-12T15:15:12 1776006912

I don't think that's accurate. The malware prompt has been around since Sonnet 3.7. We carefully evaled it for each new model release and found no regression to intelligence, alongside improved scores for cyber risk. That said, we have removed the prompt for Opus 4.6 since it no longer needed it.

rawicki · 2026-04-12T15:17:03 1776007023

I started seeing "not a malware, continuing" in almost every reply since around 2 weeks ago. Maybe you just reintroduced it with some regression? Opus 4.6

bcherny · 2026-04-12T15:21:37 1776007297

That's weird. Would you mind running /feedback and sharing the id here next time you see this? I'd love to debug

rawicki · 2026-04-12T15:55:00 1776009300

Sure, I really appreciate you looking at this.

a6edd0d1-a9ed-4545-b237-cff00f5be090 / https://github.com/anthropics/claude-code/issues/47027

I'm happy to provide any other info that can be useful (as long as i'm not sharing any information about the code or tools we use into a public github issue).

bcherny · 2026-04-12T16:54:52 1776012892

Thanks for the report! This was fixed in v2.1.92.

Please:

1. Upgrade to the latest: claude update (seems like you did this already)

2. Start a new conversations (resuming an old convo may trigger this bug again in that convo)

egamirorrim · 2026-04-12T19:39:00 1776022740

This is bloody great Boris. Thank you.

bcherny · 2026-04-12T16:37:58 1776011878

Thank you! Looking

obrajesse · 2026-04-12T15:43:32 1776008612

I’ve seen this a couple of times recently. Including right after compact. I’ll /feedback it next time I see it

ElFitz · 2026-04-13T09:43:27 1776073407

Same. Will run it too when I next get it.

bavell · 2026-04-12T15:29:28 1776007768

I've been using CC a decent amount the past few weeks and have never seen this malware stanza...?

echelon · 2026-04-12T15:33:10 1776007990

1. I've never seen this. Is there a config option to unhide it if it's happening? Is this in Claude Code? Does it have to be set to verbose or something?

2. Can we pay more/do more rigorous KYC to disable it if it's active?

bcherny · 2026-04-12T15:39:11 1776008351

This warning is not enabled for modern models. No action needed. I'm digging into the report above as soon as they're able to /feedback.

rawicki · 2025-05-29T11:48:43 1748519323

I’m familiar with postal services both in Poland and Japan and I like the Japanese solution even more - most of the new buildings have package lockers operated by the building owner and independent from the delivery service. Everyone could put the packages there and my building would notify me about a waiting package when I entered.

szszrk · 2025-06-08T10:27:07 1749378427

That's actually rad, but... it's not that different from making current mailboxes bigger. In PL in large buildings those are on ground floor, next to each other. If you make them bigger you only need to add notifications to match that.

rawicki · on Sept 10, 2024

Poland already has one of the biggest military drone and radio manufacturers - WB Group

rawicki · on Aug 29, 2024

No, but both Arinc 825 and OBD2 are based on CAN, so at very small expense some of them could have.

rawicki · on Aug 29, 2024

DPP-4 drugs are less effective also on other metrics. Would be far more interesting to see the comparison of SGLT-2 inhibitors vs GLP-1 agonists.

For some reason GLP-1 drugs are not that popular in Korea (and still not prescribed just for the weight loss), so that may explain why these researchers haven't done that.

rawicki · on July 22, 2024

Nothing stops them from writing a minimal kernelspace driver that sends events to the userspace part and evaluates rules there.

rawicki · on Oct 21, 2023

When I was talking to Amazon support they provided instructions on how to strip tokens together with the ask for a HAR file.

totetsu · on Oct 21, 2023

https://repost.aws/knowledge-center/support-case-browser-har... Oh you7re right. I didn't read till the end.

rawicki · on Sept 10, 2023

Unless you want to chase Tesla with marketing numbers trading off torque for something else is almost always a good tradeoff.

Modern electric motors are much stronger than most users need.

rawicki · on April 1, 2023

Okubo is great. I'd say it's easier to find (if you sample random restaurants in Japan) good Korean than good Indian or good Thai.

rawicki · on April 1, 2023

From experience of traveling to many countries and talking with people about food - not everyone is interested in it. I've heard many bad recommendations or people surprised I know the dishes they have never tried. And that's okay.

thesalsabear · on April 4, 2023

But I read stuff like:

> Now to be clear, India has no such dish as a curry

from the "educators" here