How can an AI model refuse to do something if it's just pattern matching?

Baz_26 · Jun 21, 2026, 02:12 PM

People say AI models have values and refuse harmful requests. But I thought models just predict the next word based on patterns. How does pattern matching create refusal?

Bob81 · **Yesterday** at 03:31 PM

Pattern matching does create refusal through training. Models trained on text where people refuse harmful requests learn to predict refusal in similar contexts. It's statistical pattern

How can an AI model refuse to do something if it's just pattern matching?

Baz_26

Bob81