Healthy VolunteersLSDLSD

Effect of lysergic acid diethylamide (LSD) on reinforcement learning in humans

In a within-subjects study using a probabilistic reversal learning task and computational modelling, intravenous LSD (75 μg) increased reinforcement learning rates—more for reward than punishment—and decreased stimulus stickiness (greater exploratory behaviour), while leaving reinforcement sensitivity and win‑stay/lose‑shift probabilities unchanged. These effects suggest LSD induces a state of heightened plasticity that could facilitate revision of maladaptive associations relevant to psychiatric treatment.

Authors

Cardinal, R. N.
Carhart-Harris, R. L.
den Ouden, H. E. M.

Published

December 9, 2020

Psychological Medicine

individual Study

Links

Read Paper DOI Google Scholar

Abstract

Abstract The non-selective serotonin 2A (5-HT 2A ) receptor agonist lysergic acid diethylamide (LSD) holds promise as a treatment for some psychiatric disorders. Psychedelic drugs such as LSD have been suggested to have therapeutic actions through their effects on learning. The behavioural effects of LSD in humans, however, remain largely unexplored. Here we examined how LSD affects probabilistic reversal learning in healthy humans. Healthy volunteers received intravenous LSD (75μg in 10 mL saline) or placebo (10mL saline) in a within-subjects design and completed a probabilistic reversal learning task. Participants had to learn through trial and error which of three stimuli was rewarded most of the time, and these contingencies switched in a reversal phase. Computational models of reinforcement learning were fitted to the behavioural data to assess how LSD affected the updating (“learning rates”) and deployment (“reinforcement sensitivity”) of value representations during choice, as well as “stimulus stickiness”, which assays choice repetition irrespective of reinforcement history. Conventional measures assessing sensitivity to immediate feedback (“win-stay” and “lose-shift” probabilities) were unaffected, whereas LSD increased the impact of the strength of initial learning on perseveration. Computational modelling revealed that the most pronounced effect of LSD was enhancement of the reward learning rate. The punishment learning rate was also elevated. Stimulus stickiness was decreased by LSD, reflecting heightened exploratory behaviour, while reinforcement sensitivity was unaffected. Increased reinforcement learning rates suggest LSD induced a state of heightened plasticity. These results indicate a potential mechanism through which revision of maladaptive associations could occur in the clinical application of LSD. Significance statement The psychedelic (“mind-manifesting”) drug LSD holds promise for the treatment of some psychiatric disorders. Theories have postulated its therapeutic potential centres on enhancing learning and flexible thinking. Here we provide substantiating empirical evidence by examining the computations underlying behaviour as healthy volunteers learned through trial and error under LSD. Viewing choice as based on representations of an action’s value, LSD increased the speed at which value was updated following feedback, which was more pronounced following reward than punishment. Behaviour was also more exploratory under LSD, irrespective of the outcome of actions. These results indicate that LSD impacted a fundamental belief-updating process inherent in the brain which can be leveraged to revise maladaptive associations characteristic of a range of mental disorders.

Unlocked with Blossom Pro

Research Summary of 'Effect of lysergic acid diethylamide (LSD) on reinforcement learning in humans'

Introduction

LSD is a non-selective 5-HT2A receptor agonist whose therapeutic potential in psychiatry has been increasingly discussed. Theoretical accounts posit that psychedelics promote plasticity and the revision of maladaptive beliefs, but objective tests of how LSD alters learning and cognitive flexibility in humans are limited. Serotonin and dopamine systems, both implicated in feedback-driven learning and flexibility, are candidate neurochemical mediators; translational work (including serotonergic depletion and dopaminergic manipulations in animals) shows these systems influence perseveration and sensitivity to reward and punishment. Kanen and colleagues set out to test whether acute LSD alters probabilistic reversal learning (PRL) in healthy volunteers and to characterise any effects using computational reinforcement learning models. The study specifically probed whether LSD changes sensitivity to immediate feedback (win-stay/lose-stay), the influence of previously acquired values on perseveration, and the rates at which value is updated after reward or punishment; it also examined a parameter indexing stimulus-driven choice independent of outcome ("stimulus stickiness").

Methods

The study used a within-subject, placebo-controlled design in which 19 healthy volunteers attended two sessions at least two weeks apart and received intravenous LSD (75 μg in 10 mL saline) on one session and placebo on the other. Participants were blinded to condition but experimenters were not. Injection was given over two minutes via an antecubital cannula; subjective effects were reported 5–15 minutes after dosing and the probabilistic reversal learning task was administered approximately five hours post-injection. A psychiatrist assessed participants for suitability for discharge once subjective effects subsided. The behavioural task comprised 80 trials split into an acquisition phase (first 40 trials) and a reversal phase (last 40 trials). On each trial participants chose one of three visual stimuli. In acquisition one stimulus yielded positive feedback on 75% of trials, a second on 50% of trials (a neutral stimulus added in this variant), and a third on 25% of trials; after 40 trials the 75% and 25% contingencies were swapped. Positive feedback was a green smiling face, negative feedback a red frowning face. The design allowed separate assessment of learning to select a rewarding stimulus and learning to avoid a punishing stimulus. Conventional analyses examined overall task performance (number of choices per stimulus, acquisition and reversal correct responses), win-stay and lose-stay probabilities (stay rate after wins or losses; in a three-choice task the base-rate stay is 33%), and perseverative errors during reversal (defined as two or more responses to the previously correct stimulus, excluding the first reversal trial). Null-hypothesis tests used α = 0.05. Computationally, three hierarchical reinforcement learning models were fitted to the choice data using Hamiltonian Markov chain Monte Carlo in Stan. Models were compared by bridge sampling to estimate marginal likelihoods. The winning model included four parameters: reward learning rate (α_rew), punishment learning rate (α_pun), reinforcement sensitivity (τ_reinf, comparable to inverse temperature), and stimulus stickiness (τ_stim). The hierarchical Bayesian structure treated drug condition at the top level and subjects below, with parameter estimates compared across conditions using highest posterior density intervals (HDIs). Convergence was checked with R-hat (<1.2) and parameter differences were assessed across the posterior samples to test whether HDIs excluded zero.

Results

Overall task performance was preserved under LSD. Repeated-measures ANOVA on the number of times each stimulus was chosen revealed a strong main effect of stimulus and a stimulus × phase interaction, but no interactions involving drug; paired t-tests showed no difference in number of correct responses between placebo and LSD during acquisition (t18 = 0.84, p = .4, d = .19) or reversal (t18 = 0.23, p = .8, d = .05). Perseverative errors as a subset of reversal errors did not differ between conditions (t18 = 0.03, p = .98, d = .01). The extracted text contains an incomplete report of the analysis relating initial learning to perseveration; the precise regression results are not clearly reported in the extraction. Immediate feedback sensitivity was unaffected by LSD. There was a robust main effect of valence—participants were more likely to stay after wins than after losses (F1,18 = 37.76, p = 8.0 × 10^-6)—but no main effect of drug (F1,18 = 0.20, p = .66) and no valence × drug interaction (F1,18 = 0.63, p = .44). Model comparison favoured the four-parameter reinforcement learning model (separate reward and punishment learning rates, reinforcement sensitivity, and stimulus stickiness). Convergence diagnostics were acceptable (R-hat < 1.2). Computational parameter estimates showed that LSD markedly increased the reward learning rate (LSD mean = 0.87; placebo mean = 0.28); the posterior 99.9% highest posterior density interval for the difference excluded zero. The punishment learning rate was also elevated under LSD (LSD mean = 0.48; placebo mean = 0.39), with the drug difference excluding zero at the 99% HDI. The effect on reward learning rate was larger than on punishment learning rate (the extracted text reporting the exact statistic is truncated). Stimulus stickiness was reduced by LSD (LSD mean = 0.23; placebo mean = 0.43), with the drug difference excluding zero at the 90% HDI. Reinforcement sensitivity (τ_reinf) was not credibly altered by LSD (LSD mean = 4.70; placebo mean = 5.57; 0 within the 95% HDI).

Discussion

The investigators interpret their findings as evidence that LSD increases the rate at which humans update value representations following prediction errors, particularly for rewards. They argue this pattern is consistent with the idea that psychedelics promote psychological plasticity by "relaxing priors," making individuals more sensitive to new evidence and thereby facilitating belief updating. The asymmetric enhancement—greater for reward than punishment—suggests that LSD differentially affects appetitive learning processes. Mechanistically, the authors note that a specific neurochemical attribution is not possible given LSD's broad pharmacology, but highlight 5-HT2A and D2 receptor systems as plausible contributors. They reference supporting animal data: optogenetic stimulation of dorsal raphe serotonin neurons increased reinforcement learning rates and tracked prediction errors in mice, and prior work in rodents showed LSD effects on reversal learning that were blocked by the 5-HT2A antagonist ketanserin. Dopaminergic influences are also considered, given dopamine's established role in reward-driven updating and prior human genetic associations implicating dopamine in acquisition–perseveration relationships. Interactions between serotonin and dopamine systems are proposed as another candidate mechanism. The authors reconcile an apparent paradox—enhanced learning rates alongside observations of cognitive inflexibility under LSD—by emphasising timing. When acquisition and reversal both occur under LSD, enhanced learning during acquisition may be "stamped in," making subsequent reversal harder; conversely, if acquisition occurs before drug administration, LSD can facilitate reversal. This timing-dependent effect is discussed as clinically relevant because maladaptive beliefs targeted in therapy are typically formed prior to treatment. Limitations acknowledged in the extracted text include the inability to isolate the exact neurochemical pathway mediating effects and the relevance of the timing between drug administration and behavioural testing. The authors present the study as one of few objective assessments of basic cognitive processes under LSD in humans and suggest the findings have implications for understanding how LSD might aid revision of deleterious associations in therapeutic contexts.

View full paper sections

CONCLUSION

There has been a recent surge of interest in potential therapeutic effects of psychedelics, particularly LSD. Theorising on the mechanisms of such effects centres on their role in enhancing learning and plasticity. In the current study we tested these postulated effects of LSD in flexible learning in humans and find that LSD increased learning rates as well as the impact of previously learnt values on subsequent perseverative behaviour. Specifically, LSD increased the speed at which value representations were updated following prediction error (the mismatch between expectations and experience). Whilst LSD enhanced the impact of both positive and negative feedback, it augmented learning from reward significantly more than it augmented learning from punishment. The observation that LSD enhanced learning rates may be particularly important for understanding the mechanisms through which LSD might be therapeutically useful. Psychedelic drugs have been hypothesised to destabilise pre-existing beliefs (i.e. relax prior beliefs or "priors"), making them amenable to revision (Carhart-Harris and Friston 2019). The notion of relaxed priors is directly compatible with increased reinforcement learning rates: in our study, LSD rendered subjects more sensitive to prediction errors, which naturally implies downweighting of prior beliefs (Carhart-Harris and Friston 2019). That LSD affected a fundamental belief-updating process is notable given that psychedelics are under investigation trans-diagnostically for diverse clinical phenomena including depression, anxiety, alcoholand nicotine abuse, OCD, and eating disorders); a unifying feature of these conditions is maladaptive associations in need of revision. Given the broad effect of LSD on a range of neurotransmitter systems, it is not possible to determine the specific neurochemical mechanism underlying the observed LSD effects on learning. Nonetheless, obvious possibilities involve the serotonin and dopamine system, in particular 5-HT 2A and D 2 receptors. Specifically, the psychological plasticity purportedly promoted by psychedelics is believed to be mediated through action at 5-HT 2A receptors (Carhart-Harris and Nutt 2017) via downstream enhancement of NMDA (N-methyl-D-aspartate) glutamate receptor transmission) and brainderived neurotrophic factor (BDNF) expression. The hypothesis that the present results are driven by serotonergic effects of LSD is supported by two recent studies in mice. Optogenetically stimulating dorsal raphé serotonin neurons enhanced reinforcement learning rates, whilst activation of these neurons tracked prediction errors during reversal learning). In addition to affecting the serotonin system, however, LSD also acts at dopamine receptors. Dopamine has long been known to play a crucial role in belief updating following reward, and more recent evidence shows that dopaminergic manipulations may alter learning rates. A dopaminergic effect would be in line with our previous study where genetic variation in the dopamine, but not serotonin transporter polymorphism, was associated with the same enhanced relationship between acquisition and perseveration as reported here under LSD (den Ouden et al. 2013). Serotonin-dopamine interactions represent another candidate mechanism that could underlie the present findings. For example, stimulation of 5-HT 2A receptors in the prefrontal cortex of the rat, enhanced ventral tegmental area (VTA) dopaminergic activity. Indeed, the initial action of LSD at 5-HT 2A receptors has been proposed to sensitise dopamine neuron firing, which subsequently potentiates the direct dopaminergic effects of LSD. LSD action at D 2 receptors, consequently, appears to be especially pronounced at a later time following LSD administration, which is relevant given the relatively long delay between LSD administration and performance of the current task (see Methods). However, arguing against a late dopaminergic effect is a previous study in rodents where the effects of LSD on reversal learning were consistent across four different time lags between drug administration and behavioural testing. LSD impaired flexibility such that under LSD, better initial learning led to more perseverative responding. This result was in line with a recent study showing that LSD induced spatial working memory deficits and higher-order cognitive inflexibility in a setshifting paradigm). Importantly, these effects were blocked by coadministration of the 5-HT 2A antagonist ketanserin, showing that the LSD-induced impairments were mediated by 5-HT 2A agonism, consistent with a 5-HT 2A mechanism underlying the present results. These data collectively suggest that LSD "stamps in" new learning following drug administration, which may subsequently be harder to update. LSD's effects to induce cognitive inflexibility are ostensibly at odds with the observation that LSD enhanced plasticity (through enhanced learning rates). However, these results can be reconciled by considering the timing of drug administration with respect to initial learning and tests of cognitive flexibility. In both the present experiment and the previous set-shifting study), all phases of learning (acquisition and reversal) were conducted after LSD administration. In contrast, when acquisition learning was conducted prior to LSD administration, LSD resulted in improved reversal learning (using a reversal paradigm in rats;. Likewise, when acquisition learning was conducted prior to administration of a 5-HT 2A antagonist, reversal learning was impaired). Collectively, these findings suggest that whether a prior belief is down-or up-weighted under LSD may depend on whether the prior is formed before or during drug administration, respectively. This observation is of great relevance for a putative therapeutic setting, where maladaptive beliefs will have been formed before treatment. In summary, LSD enhanced the rate at which humans updated their beliefs based on feedback. Learning rate was most enhanced by LSD when receiving reward, and to a lesser extent following punishment. This study represents one of the few applications of objective measures to investigate fundamental cognitive processes in humans under LSD. These findings have implications for understanding the mechanisms through which LSD might be therapeutically useful for revising deleterious associations.

Study Details

Study Type
individual
Population
humans
Characteristics
placebo controlledsingle blindcrossoverre analysis
Journal
Psychological Medicine
Compounds
LSD LSD
Topic
Healthy Volunteers