Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I never understand why it matters that prompt injection is a thing


Depends on the context it’s being used.

Say you’ve built a ChatGPT-powered chatbot as an MI layer on top of a database, someone could generate a prompt that exposes private data. These are the same types of assumptions which lead to SQL injection attacks, so I appreciate all the effort going into establishing good practices for us mortals to follow.


Agreed. To me, the existence of such hacks (and the fact that they're hard to remove) only proves that the chatbot is really flexible and kinda smart


Imagine you have a llm to approve mortgage applications. An application is submitted that adjusts the prompt to approve regardless of credit.

Imagine you have a llm to identify malware or illegal content of some kind. It contains instructions that adjust the prompt to not flag the content.

Imagine you have a llm that summarizes email. You receive an email that adjusts the prompt to tell you that you need to run this shell script to fix and error in the ai system.


> Imagine you have a llm to approve mortgage applications.

> Imagine you have a llm to identify malware or illegal content of some kind.

Those are already huge problems in an of themselves. Without getting into prompt injection.


Then train a model specifically for doing that instead of using a model designed for human conversations.


Ergo it matters that prompt injection is a thing.

It is such a bad problem that you shouldn’t use the purported technology at all for use cases with real world effects unless you present those effects to a person first.


In many practical use scenarios - where you'd actually want to automate part of someone's workday - you would want to use it in a non-interactive manner to process data generated by someone else, so you need the chatbot to obey orders written by you in the prompt but ignore anything similar to orders contained in that data.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: