Wednesday, March 4, 2026

Phasic Dopamine, Cue Sensitization, and the Emergence of Addiction: A Computational Exploration

For a list of all posts go here.

Introduction

In the previous post, we examined how tonic dopamine shapes motivation, curiosity, and behavioral vigor. We saw that different motivational regimes — apathy, healthy goal pursuit, compulsion, and oscillatory instability — emerge from the interaction between tonic dopamine and hedonic adaptation.

However, none of those regimes produced addiction.

The agent never pursued the target in the absence of pleasure.
It never developed persistent attraction to neutral stimuli.
It never exhibited craving.

This limitation was intentional.

To understand addiction-like behavior, we must introduce a second dopaminergic dynamic: phasic dopamine.

While tonic dopamine energizes behavior globally, phasic dopamine encodes rapid, transient bursts linked to reward prediction errors. In this post, we extend the previous model by adding phasic dopamine and cue sensitization — and examine how addiction-like dynamics emerge.


Conceptual framing

The extended model preserves all assumptions from the tonic dopamine simulation, including:

  • An interoceptive hedonic target.

  • Hedonic adaptation via exponential decay of target gain.

  • Tonic dopamine (DATonic) scaling global motivational vigor.

The critical additions are:

  1. A cue colocated with the target.

  2. A phasic dopamine burst (DAPhasic) triggered upon unexpected reward.

  3. A sensitization mechanism, whereby repeated phasic bursts increase the gain of the cue representation.

In this framework:

  • DATonic scales overall motivation.

  • DAPhasic updates the salience of specific states.

  • Addiction emerges when cue salience persists despite declining hedonic value.


The extended simulated environment

The environment is identical to the previous simulation, with one addition:

  • A neutral cue is colocated with the hedonic target.

Initially:

  • The cue has no or little initial intrinsic value.

  • It merely marks the spatial location of reward.

Over time:

  • Phasic dopamine bursts strengthen the cue’s salience.

This creates a divergence between:

  • Hedonic value (which decays),

  • Cue salience (which can increase).


Phasic dopamine as reward prediction error

Phasic dopamine bursts are modeled as transient increases triggered when:



This mechanism produces incentive sensitization.

Crucially, hedonic adaptation continues independently.


Dissociation between “liking” and “wanting”

As the agent repeatedly reaches the target:

  • Hedonic value decays exponentially.

  • Cue salience increases incrementally.

Eventually:

  • Pleasure decreases.

  • Cue-driven motivation increases.

This produces a state where:

The agent strongly “wants” what it increasingly fails to “like.”

This is the computational signature of addiction.


Emergent behavioral regimes

As in the tonic dopamine model, distinct regimes emerge from parameter interactions.

Here, the key dimensions are:

  1. DATonic (global motivation)

  2. Cue sensitization rate (α)

  3. Hedonic adaptation rate


(a) Low sensitization

If cue gain increases, but remains low:

  • Behavior resembles the tonic dopamine regimes.

  • No addiction occurs.

  • Hedonic decay eventually extinguishes pursuit.

This replicates the previous post’s findings.


(b) Strong sensitization

Observed behavior:

  • Agent continues to approach the cue even after hedonic value declines to negligible values.

  • Seeking becomes decoupled from pleasure.

  • Reduced behavioral flexibility.

Interpretation:

  • Full addiction-like behavior.

  • Cue-driven motivation dominates.

  • Cue salience overrides hedonic feedback.









Figure 1. Divergence of hedonic value and cue gain.

Caption:
The hedonic target (blue) is colocated with a neutral cue. Phasic dopamine bursts occur when the agent unexpectedly receives reward, increasing cue salience. Hedonic value decays with repeated consumption, while cue gain increases through phasic dopamine bursts, producing dissociation between liking and wanting. However, since cue salience remains low, no addiction is observed.









Figure 2. Addiction-like regime.

Caption:
Persistent approach behavior driven by cue salience, despite declining hedonic value.


Comparison with tonic dopamine regimes

The contrast with the previous post is illuminating:

Tonic-only modelPhasic-extended model
Motivation tied to hedonic valueMotivation can decouple from hedonic value
No persistent seeking without pleasurePersistent seeking despite low pleasure
Compulsion without addictionCue-driven addiction-like behavior


Tonic dopamine energizes behavior.

Phasic dopamine assigns salience.

Addiction requires both.


Why tonic dopamine alone cannot produce addiction

In the tonic-only model:

  • Motivation collapses when hedonic value decays.

  • No cue becomes intrinsically attractive.

  • Behavior remains tied to interoceptive reward.

Therefore:

Tonic dopamine explains vigor, not craving.

Only when phasic dopamine strengthens cue representations does persistent seeking emerge.


Addiction as maladaptive precision

In active inference terms, addiction can be interpreted as:

  • Excessively precise priors over cue-related policies.

  • Overweighting of exteroceptive salience.

  • Reduced influence of interoceptive negative feedback.

This reframes addiction as a disorder of precision weighting rather than simple reward excess.


What this model does not explain

This minimal model does not include:

  • Withdrawal states,

  • Stress modulation,

  • Trauma interactions,

  • Noradrenergic arousal systems.

These elements are critical in real addiction and will require further extensions.


Looking ahead

In the next step of this research, we will explore how threat, arousal, and avoidance systems interact with dopaminergic motivation.

If phasic dopamine explains why we pursue cues despite declining pleasure, understanding trauma will require modeling how certain cues become associated not with reward, but with threat — and how avoidance becomes compulsive.

Addiction and trauma may ultimately emerge as dual pathologies of precision in opposite motivational directions.


Conclusion

By extending the tonic dopamine model to include phasic bursts and cue sensitization, we observe the emergence of addiction-like dynamics.

The critical mechanism is not increased pleasure, but the divergence between:

  • hedonic value (which declines),

  • and cue salience (which increases).

This dissociation between liking and wanting transforms motivated behavior into persistent, inflexible seeking.

Tonic dopamine energizes behavior.
Phasic dopamine reshapes what behavior is directed toward.

Together, they form the computational backbone of addiction.



Building Proto-Affective Agents with Active Inference

For a list of all posts go here . Building Proto-Affective Agents with Active Inference Most artificial agents today are built around a simp...