Retro-Future Columnist

The atmosphere in meeting rooms can sometimes become too quiet. The more polished answers LLMs provide, the deeper the silence grows, yet in practice, a subtle unease—hard to articulate—sometimes serves as the last safety net in decision-making.[2][5] The logic might be sound, but there’s a feeling that somewhere, the assumptions are off. We still haven’t fully put into words this faint friction.[3][7] Now, we are being challenged to consider whether such a feeling truly has value in decision-making in the AI era.

Discussions about human intuition have a long foundation. Kahneman and Gary Klein clarified that while intuition is not always correct, it can be trusted under conditions where the environment is regular, learning opportunities exist, and feedback is swift.[1][4][12] The recognition that veteran firefighters and commanders sense danger before calculations supports this lineage.[4][9] Crucially, intuition is not mystical but a rapid judgment formed through accumulated experience.

However, LLMs do not possess this kind of experience. They can statistically string together the next word from vast text datasets, yet they don’t physically remember the heat of a fire scene or the moment an organizational atmosphere shifts.[8][9] Therefore, although their answers might be smooth, the basis of their judgment differs from humans’. The more polished LLM-generated text is, the easier it is for us to confuse 'sounding plausible' with 'withstanding reality.'[3][6] The role of that uneasy sensation as a gauge of this gap is significant.

A 2023 study showed that even with AI predictions accompanied by explanations, people sometimes override them using their intuition.[2][11][13] The study identified three pathways: intuition about the outcome, intuition about features, and intuition about the AI’s limits.[2][11] In short, people don’t reflexively reject AI; they use different senses to assess the output content, explanation’s logic, and the model’s limitations. Adding an explanation doesn’t erase all doubts.[7][11]

This is particularly crucial when considering LLM-assisted decision-making. A 2024 review organizes this by emphasizing that beyond explainability, accountability and psychological factors heavily influence decisions involving LLMs.[3] The issue is not settled by 'high accuracy' alone. Who takes ultimate responsibility, where do humans step in, and in which contexts does an explanation generate doubt rather than reassurance?[3][7] LLMs provide answers but won’t automatically fill in institutional design around their use.

What emerges here is that the feeling of unease is not mere emotion but a cognitive resource to foster appropriate reliance. While the dangers of over-trusting AI are often discussed, the risk of losing sight of when to doubt AI remains underappreciated.[7][11] If organizations adopt LLMs but eliminate space for humans to say 'something’s wrong,' efficiency may rise but the circuits preventing errors will weaken. Errors that proceed silently are the hardest to correct.[3][7]

Still, glorifying unease is also hazardous. As Kahneman and Klein argued, intuition helps only when surrounded by an environment conducive to learning and verifiable feedback.[1][10][12] Thus, unease toward LLMs must connect not to vague feelings or biases but to procedures confirming which assumptions are suspicious. There needs to be a pathway that channels unease back to fact-checking, comparative evaluation, and responsibility allocation.[3][7]

What remains unconfirmed is where human unease genuinely protects outcomes and where it might instead amplify bias or conservatism.[3][7] In high-responsibility domains like healthcare, finance, hiring, and policymaking, it requires longer observation to understand if AI explanations truly aid human judgment or merely offer false reassurance.[3][7] What we need most now is not decisive answers but perspectives tracking when people overrule or overlook AI outputs.

As LLMs become widespread, we acclimate to the speed of answers. Yet what society must preserve is not just speed.[3][7] It is human ability to tune in when small cracks appear in assumptions—and how that sensitivity is safeguarded institutionally. Unease is vague, but its ambiguity can serve as the final check.[1][2][7] Going forward, attention should be on not just model performance curves but whether the human circuits to say 'wait' still function.