Wednesday, February 8, 2023

Ask HN: Is “prompt injection” going to be a new common vulnerability?

There was a post [0] recently about the bing chatGPT assistant either citing or hallucinating it’s own initial prompt from the (in theory) low privileged chat input UI they put together. This feels like it’s almost unavoidable if you let users actually chat with something like this.

How would we sanitize strings now? I know OpenAI has banned topics they seem to regex for, but that’s always going to miss something. Are we just screwed and should make sure chat bots just run in a proverbial sandbox and can’t do anything themselves?

[0] https://ift.tt/qCnM5eK



from Hacker News https://ift.tt/ci25Frl

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.