More

albert_e · 2026-04-30T04:11:32 1777522292

If a tiny misconfiguration of reward system can cause such noticeable annoyance ...

What dangers lurk beneath the surface.

This is not funny.

reducesuffering · 2026-04-30T16:41:54 1777567314

This is the real nugget of wisdom here. This should be confirmation to everyone that no one understands the LLM internals and they are not aligned. When they are eventually given control to run things, they will behave in wildly unexpected ways, and past the point of being able to change them.

andai · 2026-04-30T04:19:24 1777522764

For every gremlin spotted, many remain unseen...

TychoCelchuuu · 2026-04-30T05:04:57 1777525497

This is a worry that people have been talking about in various forms for a while now, and I think it's a gigantic one. The only reason this was caught is that the quirk was a very noticeable verbal one. When words like "goblin" and "gremlin" pop up it is easy for us to spot. If the quirk takes another shape (say, ranking certain people with certain features as less trustworthy) it might be too subtle or too weird for us to notice it. Would I ever notice if ChatGPT consistently rates people born in June to be untrustworthy?

Here is an academic paper discussing this kind of worry: https://link.springer.com/article/10.1007/s11023-022-09605-x

albert_e · 2026-04-30T02:50:23 1777517423

we cant view without a login anymore? i dont have a instal login but used tobe able to watch single videos shared directly with just a login nag

Jblx2 · 2026-04-30T03:36:54 1777520214

https://www.youtube.com/shorts/D_k8Tx7GCqo

Supernaut · 2026-04-30T07:54:25 1777535665

When I read the headline, I pictured something like the flying cars from "Blade Runner". What I see in the video is a helicopter with six rotors.

I get that the leap forward here is that it's battery-powered. Still, I can't help feeling underwhelmed.

troyvit · 2026-04-30T15:33:28 1777563208

The main difference I read is that those airfoils actually come into play when it's not taking off and landing. That still doesn't make it nearly as cool as the air cars in Blade Runner but it's slightly better than just a helicopter too.

albert_e · 2026-04-29T03:51:01 1777434661

I want to create a "harness" that does this with Claude Code and other expensive agents.

Buffer user prompts, use conversation history and repo state as context -- and run a local model or a cheap and fast cloud model like Haiku to determine the optimal way to address the user's ask, reframe the query with better context (user reviews and approves if needed) and THEN let expensive models like Opus have a go at it.

If we are operating within Anthropic ecosystem with Haiku and Opus -- this sort of logic should ideally be doable within Claude Code as harness. Currently skills cannot be tagged to different models. Ideally we should be able to say -- for trivial tasks, the skill should always use Haiku even if invoked from a session with Opus xhigh.

koenvdb · 2026-04-29T05:48:07 1777441687

> Currently skills cannot be tagged to different models. Ideally we should be able to say -- for trivial tasks, the skill should always use Haiku even if invoked from a session with Opus xhigh.

You can set the model for a skill. You just set model: haiku at the top and it will use haiku! You can even set the effort level, look for “Frontmatter reference” in this doc article: https://code.claude.com/docs/en/skills

dmitry_dv · 2026-04-29T07:16:19 1777446979

Same works for subagents — .claude/agents/triager.md with model: haiku plus a Task call from the main loop. The reason to roll your own was the sandbox, not the routing.

albert_e · 2026-04-29T06:02:16 1777442536

Interesting - thanks!

Not sure when this was added.

I found "open" feature requests in GitHub asking for this exact thing

shad42 · 2026-04-29T03:59:58 1777435198

We considered wrapping Claude Code when we started building Mendral (this agent in the article). We ended up building our own agent, it's lot more work because we followed all the right patterns as the models evolved (sub-agents, proper token caching, redo basic tools like read,write,edit,bash, etc...). But it paid off over time when you build an agent that is focused on a specific task (not a general coding agent).

The main driver for writing our own agent was to leave it out of the sandbox (the agent loop runs on our backend, we call the sandbox only when needed). We wrote another post about that (it's the latest post on the blog).

However, I am curious how would you implement the triager pattern by only using Claude Code as harness.

albert_e · 2026-04-28T12:01:50 1777377710

is $10 Pro monthly subscription a pre-requisite before i can purchase $10 in API credits?

PS: i would have loved if I can directly buy $10 in credits and be free to spend it as quickly or as leisurly as I want -- without any monthly expiry or fixed recurring payments

albert_e · 2026-04-28T10:39:59 1777372799

Why are satellite trails not continuous lines

Is the camera exposure taking a few seconds of break between takes that get stacked later with some "missing" moments in between?

max-m · 2026-04-28T13:29:47 1777382987

My time to shine! I've spent yesterday morning to track the photo down and answer this question. The APOD description is lacking. Yes, this was an exaggerated stack of 153 four-second exposures (the rejection map of the satellite trails was added on top of the image), and the gaps happened when the camera took its time to save between two exposures.

Here is a link to the original photo and it's description (German) by Uli Fehr: https://www.facebook.com/groups/Nachtfotografie/posts/264063...

pta2002 · 2026-04-28T12:55:25 1777380925

Probably exactly that. If you take a single 10 minute exposure (or really, anything more than a few seconds) you'll get noticeable star trails if you don't put your camera on a rotating mount. Stacking multiple exposures also has other nice benefits such as noise canceling itself out and being able to remove satellite trails.

Last time I did astrophotography was a few years ago, before Starlink made the problem considerably worse, but satellite trails were relatively easy to remove with stacking. I'm sure it's harder now but definitely still possible, so I'm assuming in this case leaving them in was done on purpose to highlight the problem.

EDIT: Looking better at the picture, I belive this was taken with a star tracker and then composited with a shorter exposure of the foreground. Notice how the foreground, even far away, looks considerably blurrier than the stars, and how the tower in the background has some light streaks. This is exactly what you'll see if you use a star tracker. Rather than star trails, you'll have "foreground trails". This would explain why there are relatively few gaps in the satellite trails, since the exposures can be much longer.

pta2002 · 2026-04-28T21:50:23 1777413023

Update: I was wrong, check max-m's sibling comment! The satellites just move really fast across the camera because they're in LEO, so they can traverse rather large distances before there's a new exposure and a small gap.

debugnik · 2026-04-28T11:12:27 1777374747

My guess is the camera itself was taking photos of shorter exposure and the final image was composed in post-production, yes.

goodcanadian · 2026-04-28T12:56:20 1777380980

I am guessing, but I think it likely has to do with the shape and orientation of the satellite with respect to the sun and the camera. Depending on the relative positions, the brightness reflected off the satellite and reaching the camera will change over time.

pedvide · 2026-04-28T12:07:32 1777378052

I've taken long exposures using film (analog, so no stacking or any other funny business) and saw the same thing. I always thought they were planes but now it seems they may have been satellites. I'm curious if someone knows why this happens

mark-r · 2026-04-28T12:13:19 1777378399

I'm not aware of a digital camera that can take a 10 minute continuous exposure, but maybe there are special astronomy cameras that can?

nayuki · 2026-04-28T14:49:52 1777387792

Pretty much every DSLR/DSLM camera out there has a "bulb" mode that keeps the shutter open as long as you hold down the shutter button. I think my personal record is a 20-minute exposure.

As for actually holding down the button, you can either use an external wired shutter button that has a mechanical lock to hold it down, or you use a wired controller that has an electronic timer, or you use a software feature in the camera to set the bulb timer.

jlarocco · 2026-04-28T15:50:48 1777391448

For anybody wondering, the reason not to do a single ultra-long exposures is noise.

There's an equilibrium between exposure duration, aperture, and ISO that gives the best results for the conditions with a minimum amount of sensor noise, and getting close to the equilibrium and stacking the images typically gives better results than one massive exposure.

nayuki · 2026-04-28T16:18:03 1777393083

I believe your claim about noise and long exposures is false. To start, I posit that there are three sources of noise:

0) Photon shot noise from the object that you want to photograph. This is an inherent and unchangeable quantum-mechanical fact.

1) Sensor read noise per photo taken. This increases with the number of subexposures.

2) Dark current noise per time and per temperature.

#0 and #2 only depend on the total exposure time, not the number of subexposures. #1 actually gets worse with more subexposures, but what you gain are the ability to reject satellite trails, bad mount tracking, cosmic rays, wind gusts, rolling clouds, and other transient artifacts. Whereas if you took a single hour-long exposure, it's essentially guaranteed to be ruined by something.

The trade-off in how many / how long subexposures to take has been analyzed and discussed to death by astro imagers. To cite a few videos I enjoyed: https://www.youtube.com/results?search_query=astrophotograph... , https://www.youtube.com/watch?v=T_k9B01AeFM , https://www.youtube.com/playlist?list=PLaDi49CzWbrYhWEKxWiwB... , https://www.youtube.com/watch?v=mj5zn_Jz3dE , https://www.youtube.com/watch?v=n1RbyswFUqs

As for ISO, it is very commonly misunderstood. ISO amplifies photon noise and dark current noise, and changing the ISO doesn't make your images better or worse in these aspects. ISO in the form of analog gain can help boost the signal above the analog-to-digital converter noise, and that's what it's useful for. The MinutePhysics video explains excellently: https://www.youtube.com/watch?v=ZWSvHBG7X0w . More and more sensors these days approach "ISO invariance", where analog amplifier gain has about the same effect as digital gain (i.e. multiplying the measured numbers on a computer).

Exactly what I'm refuting:

> exposure duration

In astronomy, more is better. Get as much total exposure time as you can afford (e.g. time being at a suitable location, time spent monitoring the equipment, time under clear skies).

> aperture

In astronomy, more is better. Buy the biggest aperture you can afford - obviously, subject to constraints such as cost, weight, mountability, focal length. Also, telescopes don't have adjustable aperture blades, unlike general photographic lenses. You could put a disc cut-out in front of the telescope to close down the aperture, but that's just a waste of light.

> minimum amount of sensor noise

You get the least amount of sensor noise by reducing the exposure time and reducing the temperature (dedicated astro cameras have Peltier cooling). Note that although noise increases with time, signal increases with time faster, so the signal-to-noise ratio is proportional to the square root of time. So 100× more exposure time gives you a 10× better SNR.

> stacking the images typically gives better results than one massive exposure

This is the main falsehood that I wanted to address. Taking multiple images actually gives more noise overall, even if it's a tiny bit. But multiple images gives you much more processing flexibility and the ability to selectively reject things.

tejtm · 2026-04-28T19:07:08 1777403228

Exposure time (in digital imaging) is directly related to sensor well saturation.

It does not mater how much water you pour into a full bucket.

zimpenfish · 2026-04-28T14:10:56 1777385456

Do iPhones count?

I've taken multi-hour continuous exposures on my iPhone + iPad (both "normal" and "light trail" variants.)

By the looks of [0], you can do at least 90 seconds on the Olympus E-M5 MK II - which is what I have and I'll see if it can do 10 minutes tonight.

[0] https://www.olympuspassion.com/2019/08/26/long-exposures-wit...

AntiUSAbah · 2026-04-28T15:02:20 1777388540

My Canon can do this without modification and its 8 years old. Switch to bulp and have an external mini device which you connect with a microphone cable and it creates the signal for shutter off after x minutes.

For extra long exposre its recommended to use also a stable powersource.

adolph · 2026-04-28T16:27:22 1777393642

How is a 10 minute continuous exposure functionally different from 10 minutes of video with every frame stacked? In the former, each photodiode acts as a compositor for each pixel instead of whatever algorithm is chosen to combine frames in the latter?

nayuki · 2026-04-28T16:33:18 1777393998

You pay the read noise every time you read out the sensor and digitize the values. Also, you lose a tiny bit of time between exposures as the sensor resets itself. And you might have a bottleneck in moving the data off the sensor and saving the image. Furthermore, if you perform lossy compression on the video, then your digitally stacked image will differ significantly from analog stacking on the silicon sensor.

rcxdude · 2026-04-28T12:16:32 1777378592

Maybe, but also a lot of satellites rotate and so their brightness changes over time.

raldi · 2026-04-28T12:32:20 1777379540

Passing clouds?

albert_e · 2026-04-27T05:52:40 1777269160

Tetris seems like a good fit -- given the frame to frame changes are minimal adjacent pixels, the responsiveness should be acceptable

albert_e · 2026-04-27T03:34:03 1777260843

> Guy couldn’t even bother to write his own damn post mortem.

Are you ... from the future ;)

albert_e · 2026-04-24T09:10:01 1777021801

> The challenge is: when you let a session idle for >1 hour, when you come back to it and send a prompt, it will be a full cache miss, all N messages. We noticed that this corner case led to outsized token costs for users.

I dont agree with this being characterized as a "corner case".

Isn't this how most long running work will happen across all serious users?

I am not at my desk babysitting a single CC chat session all day. I have other things to attend to -- and that was the whole point of agentic engineering.

Dont CC users take lunch breaks?

How are all these utterly common scenarios being named as corner cases -- as something that is wildly out of the norm, and UX can be sacrificed for those cases?

albert_e · 2026-04-17T15:18:31 1776439111

is this the Figma/Canva/Powerpoint/Keynote killer?

alpb · 2026-04-17T15:44:05 1776440645

This largely appears to be a HTML generator at its core, not necessarily what Figma does with layers/canvases etc. There's no collaborative nature to it either.

It feels like a lightly designed product that moves claude CLI to their backend, generates the HTMLs and renders them in browser on claude.ai website for you. Sure, it accepts your design system as an input from you or imports from your repo, but you could feed the same into claude CLI as well?

I'm curious what exactly it gives besides having claude CLI + prompting it well with your design system + skills.

weatherfun · 2026-04-17T16:07:14 1776442034

The IBM/Microsoft analogy is a classic. It’s always fascinating to watch these 'frenemy' dynamics play out. In these cases, the one who owns the direct interface with the end-user usually wins the long game, while the 'infrastructure' partner risks becoming just another utility. Will be interesting to see if Canva can maintain its identity or just become a shell for Claude's output.

diatone · 2026-04-17T16:06:33 1776441993

Yep agree it looks like it’s taking the existing generated artefact, parameterising it within an inch of its life, exposing a pseudo WYSIWYG for the parameters and calling it a day with a few export options. Not a huge leap from what they’ve got already but it’s a clever adjacent step for sure. Same product new chrome.

Boss0565 · 2026-04-17T15:20:50 1776439250

Considering Canva collaborated with them, no?

Sol- · 2026-04-17T15:24:21 1776439461

Maybe a collaboration with a metaphorical gun to your head.

frankdenbow · 2026-04-17T15:28:43 1776439723

Canva has more expansive editing tools but I agree, it seems like a frenemies situation.

svnt · 2026-04-17T15:26:09 1776439569

Or one from a place of unfounded hubris.

netdevphoenix · 2026-04-17T15:28:23 1776439703

IBM also collaborated with Microsoft for the OS and we know how that ended.

strickjb9 · 2026-04-17T15:31:21 1776439881

First NanoBanana came for the artists, and I did not speak out— Because I was not an artist.

Then Claude came for the designers with Claude Design, and I did not speak out— Because I was not a designer.

...

coffeebeqn · 2026-04-17T16:52:33 1776444753

Claude code also came for us. Is anything sacred? Middle management?

yard2010 · 2026-04-17T15:45:29 1776440729

Great Niemöller paraphrase

https://en.wikipedia.org/wiki/First_They_Came

albert_e · 2026-04-16T14:44:06 1776350646

Does the model card fit in the model's context :)

anonyfox · 2026-04-16T16:50:54 1776358254

well it will saturate your 5h limit window at least