rohtashotas's comments

rohtashotas · on Feb 22, 2024

It's not a silly mistake. It was rlhf'd to do this intentionally.

When the results are more extremist than the unfiltered model, it's no longer a 'small mistake'

wepple · on Feb 22, 2024

rlhf: Reinforcement learning from human feedback

gnicholas · on Feb 22, 2024

How is this pronounced out loud?

wepple · on Feb 22, 2024

I was just saving folks a google, as I had no idea what the acronym was.

I propose rill-hiff until someone who actually know what they’re doing shows up!

KTibow · on Feb 22, 2024

Realistically it was probably just how Gemini was prompted to use the image generator tool