Skip to main content

Altered trial-to-trial responses to reward outcomes in KCNMA1 knockout mice during probabilistic learning tasks

Abstract

The large-conductance calcium- and voltage-activated potassium (BK) channels, encoded by the KCNMA1 gene, play important roles in neuronal function. Mutations in KCNMA1 have been found in patients with various neurodevelopmental features, including intellectual disability, autism spectrum disorder (ASD), or attention deficit hyperactivity disorder (ADHD). Previous studies of KCNMA1 knockout mice have suggested altered activity patterns and behavioral flexibility, but it remained unclear whether these changes primarily affect immediate behavioral adaptation or longer-term learning processes. Using a 5-armed bandit task (5-ABT) and a novel Δrepeat rate analysis method that considers individual baseline choice tendencies, we investigated immediate trial-by-trial Win-Stay-Lose-Shift (WSLS) strategies and learning rates across multiple trials in KCNMA1 knockout (KCNMA1−/−) mice. Three key findings emerged: (1) Unlike wildtype mice, which showed increased Δrepeat rates after rewards and decreased rates after losses, KCNMA1−/− mice exhibited impaired WSLS behavior, (2) KCNMA1−/− mice displayed shortened response intervals after unrewarded trials, and (3) despite these short-term behavioral impairments, their learning rates and task accuracy remained comparable to wildtype mice, with significantly shorter task completion times. These results suggest that BK channel dysfunction primarily alters immediate behavioral responses to outcomes in the next trial rather than affecting long-term learning capabilities. These findings and our analytical method may help identify behavioral phenotypes in animal models of both BK channel-related and other neurodevelopmental disorders.

Introduction

The large-conductance calcium- and voltage-activated potassium (BK) channels, encoded by the KCNMA1 gene, play a crucial role in regulating neuronal excitability through negative feedback mechanisms. These channels are located in both pre- and post-synaptic compartments, where they respond to membrane depolarization and increases in intracellular calcium by generating large outward potassium currents [24]. At the presynaptic terminal, BK channels limit calcium influx and neurotransmitter release by promoting rapid repolarization. Postsynaptically, they contribute to spike repolarization and afterhyperpolarization, thereby controlling neuronal firing patterns and excitability. This precise regulation of neuronal activity by BK channels is essential for normal brain function, and its disruption through mutations in KCNMA1 has been linked to various neurological and psychiatric conditions, including motor deficits, autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), and intellectual disability [3, 11, 31, 38, 41, 43,44,45, 58, 74]. A substantial number of patients with KCNMA1 mutations, encompassing both loss-of-function and gain-of-function variants, have been reported to exhibit neurodevelopmental disorders, with a subset displaying autistic features [37, 38, 41, 43, 44].

Notably, reduced BK channel expression has been implicated in other neurodevelopmental disorders that present with autistic features. In Fragile X syndrome (FXS), a disorder characterized by intellectual disability and autism-like behaviors, the fragile X mental retardation protein (FMRP) in the central nervous system directly modulates BK channel function to regulate action potential duration [16, 23]. Indeed, BK channels have emerged as a promising therapeutic target for FXS, with channel openers showing behavioral improvements in animal models [26, 27, 33, 75]. Similarly, Williams-Beuren syndrome, another neurodevelopmental disorder with autistic features, exhibits decreased KCNMA1 expression [36]. Additionally, Angelman syndrome, which is characterized by hyperactivity, has been shown to involve degradation of BK channels [63].

Studies of KCNMA1 knockout mice have revealed complex behavioral phenotypes. While these mice exhibit moderate ataxia and motor deficits [39, 50, 68, 69] and reduced wheel running [40], their open field locomotor activity remains normal [68, 69]. Intriguingly, they show increased home cage activity and Y-maze exploration [40, 68, 69], along with marked alterations in acoustic response and habituation [68, 69]. In comparison, Fmr1 knockout mice modeling FXS display hyperactivity, sensory hyperexcitability, and impaired short-term spatial memory-deficits that can be ameliorated by BK channel openers [8, 27, 33, 75]. While these findings demonstrate the broad impact of BK channels on behavior, they do not fully characterize how BK channel dysfunction might relate to the impaired behavioral adaptation to changing environments—a hallmark feature of ASD and ADHD in human individuals.

Our recent study using the 3-choice serial reaction time task (3-CSRTT), found that, while KCNMA1 knockout did not severely impair task acquisition, it did increase premature responses when cue waiting times are randomized [4]. This suggests that BK channels play a critical role in modulating the timing control of actions, in response to unpredictable stimulus patterns. Additionally, perseverative responses (repeating the choice behavior even after receiving a reward) decreased significantly in KCNMA1−/− mice compared to the wild type [4], indicating that their outcome recognition or in action-chain executions are altered. While the 3-CSRTT was used to examine the ability to adapt to tasks that require attention, further research using other tasks might clarify adaptation in highly uncertain environments and responses to outcomes.

Impaired behavioral flexibility, a characteristic of neurodevelopmental disorders, is often studied using reversal learning tasks [59, 66]. These tasks, which require remapping stimulus-action-outcome associations, are negatively affected by various neurological and mental disorders [51,52,53, 59]. Win-Stay-Lose-Shift (WSLS) strategies are used to analyze trial-by-trial decisions in reversal learning tasks [7, 12, 18, 19, 34, 35]. Win-Stay repeats rewarded choices, while Lose-Shift changes after no reward [25, 72]. Probabilistic reward-related outcomes increase task difficulty by the requiring integration of information from multiple trials. These designs, known as "probabilistic reversal learning” or "bandit tasks”, represent a reinforcement learning paradigm [13, 34, 47, 48, 65].

Alterations in WSLS behaviors are common in neurodevelopmental disorders [6, 62]. Individuals with ASD often exhibit reduced cognitive flexibility and difficulty in coping with uncertainty [55]. In probabilistic reversal learning tasks, ASD patients tend to persist with the same choices, instead of adjusting responses according to the outcomes [17, 21, 57]. However, few studies reported that adults with ASD exhibited extensive choice switching and a lack of Win-Stay, during repeated experiential tasks [73] and probabilistic learning tasks [62]. This pattern is also characteristic of ADHD, where task adaptation is hindered by a tendency to switch choices regardless of the outcome [6].

Alterations in the temporal aspects of behavior, such as post-error slowing (where reaction time is prolonged after an unrewarded trial), are also often reported in neurodevelopmental disorders. For instance, children with ADHD show shortened post-error slowing in challenging tasks, which is considered to indicate reduced error recognition [5].

The k-armed bandit task (k-ABT, k = number of choices) has been a valuable tool to investigate these behavioral characteristics in animal models, as it allows to examine several quantitative behavioral indicators, such as accuracy, behavioral flexibility, WSLS strategies, and temporal response characteristics in uncertain environments [13, 30, 34, 59]. While previous studies using the 2-ABT (k = 2) have revealed abnormalities in mouse models of neurodevelopmental disorders [1, 2, 56], the potential role of BK channels in these behaviors has not been explored using this paradigm. Therefore, we hypothesized that examining the WSLS strategies and temporal response characteristics of KCNMA1−/− mice in probabilistic learning tasks may reveal some kind of abnormalities or indication of these phenotypes. This study is expected to contribute to a deeper understanding of the behavioral characteristics of neurodevelopmental disorders originating from KCNMA1 mutations. Additionally, it may potentially be useful to classify subtypes of clinical symptoms, thereby enhancing our ability to identify and assess these neurodevelopmental disorders.

This study uses the 5-ABT, as described by Ohta et al. [48], to explore the behavioral responses of KCNMA1−/− mice to unexpected events. In this task, the mice are required to nose-poke into one of five holes, each associated with a predefined probability of delivering food, thereby forcing the mice to learn from binary outcomes—reward or no reward. This experimental setup allows the quantitative evaluation of WSLS behaviors observed in subsequent trials. In particular, we propose a method to calculate the difference between the repeat probability and the baseline probability of selecting the same choice in WSLS analysis, based on our previous work [48], which was considered useful to detect behavioral differences in WSLS strategy. By reversing the reward probabilities, behavioral flexibility can also be assessed. While the 2-ABT has been successfully used in many studies [56, 61, 65], its design inherently limits shifting choices to a single alternative option, making it difficult to distinguish between learned alternation behavior and true choice flexibility. In contrast, the 5-ABT allows the animals to choose among four remaining options, enabling a broader evaluation of choice variation characteristics with reduced influence of simple alternation tendencies. The positive and negative learning rates and exploration tendency indexes can also be calculated by fitting a Q-learning model to the behavioral data. This provides insights into the long-term learning efficacy and choice policy [48, 49].

This study addresses three key aspects: (1) the nature of WSLS behavior in KCNMA1−/− mice, (2) their performance in a reversal learning task, and (3) the temporal characteristics of their responses. Through this comprehensive approach, we aim to elucidate the role of BK channels on the immediate behavioral adaptation and temporal aspects of operant learning, which carries implications for the understanding of KCNMA1-related neurodevelopmental disorders.

Materials and methods

Animals

The KCNMA1 knockout strain was originally provided by Dr. Andrea L. Meredith at Department of Physiology, University of Maryland School of Medicine [39]. KCNMA1+/− mutant mice were backcrossed with C57BL/6 J mice for more than 10 generations and maintained at National Defense Medical College. Male BK channel-deficient mice (KCNMA1−/−, n = 16) and their male wild-type (KCNMA1+/+, WT) littermates (n = 16) were used for the behavioral experiments. Genotyping was conducted at 2–3 weeks of age using primers: (F) ATAGCCTGAAGAACGAGATCAGC; (R) CCTCAAGAAGGGGACTCTAAAC for detecting KCNMA1−/− and (F) TTCATCATCTTGCTCTGGCGGACG; (R) CCATAGTCACCAATAGCCC for WT. After genotyping, mice were housed in groups of 2 or 3 per cage, with each cage containing both WT and KCNMA1−/− littermates. Animals were maintained in a temperature-controlled room (22–25 °C) with a 12-h light/dark cycle. Mice were tested in the 5-ABT from 11 to 28 weeks of age, and were food-restricted for the 24 h hours preceding the test. Average group weights were as follows: WT, 27.6 ± 1.6 g (mean ± SD) at baseline and 24.7 ± 1.9 g (89.5% of baseline) after food restriction; KCNMA1−/−, 22.3 ± 2.2 g at baseline and 19.6 ± 2.1 g (87.9%) post-restriction. Throughout the experiment, mice could obtain food reward pellets (Dustless Precision Pellets, 20 mg, Bio-Serve, Frenchtown, NJ, USA) by performing the 5-ABT.

Apparatus

The 5-ABT was conducted individually in an operant chamber measuring 18.8 × 23.0 × 17.5 cm (MED-NP9M-B1, Med Associates Inc., VT, USA). This chamber featured a curved rear panel fitted with a horizontal array of nine round holes (diameter: 1.2 cm), each equipped with an infrared photocell beam sensor for nose-poke detection and an LED light. From the nine holes, five were used for the test and four were covered with plastic plates for the duration of the task. The front panel included a food magazine outfitted with an infrared sensor and an LED light. An acrylic nest box measuring 13.6 × 20.8 × 11.5 cm (SN-798; AS ONE Corp., Japan) was connected to the operant box by a 3-cm diameter passage linking the two box walls. This setup allows the continuous monitoring of the mice’s choices in response to changes in reward probabilities, across several days [28, 48, 54]. Water was freely available from a bottle mounted on the nest box.

Experimental procedures

The experiment comprised two stages, training and testing, as previously described by Ohta et al. [48]. Initially, during the training stage, the process was divided into four progressive phases (i.e., Phase 0 to Phase 3). In Phase 0, magazine training, mice were trained to nose-poke the magazine to receive a pellet, with the phase ending after 50 pellets were dispensed. In Phase 1, mice were allowed to nose-poke any of five available holes to receive a pellet, and this phase also concluded after 50 pellets. Phase 2 began with the first nose-poke to the magazine (task call), and the mice received a pellet with every nose-poke, ending once 25 pellets were dispensed. In Phase 3, the reward probability at each of the five holes was set to 70%, and this phase was completed once another 25 pellets were delivered.

After completing all training phases, mice performed the testing stage, which involved three distinct tasks: ALL, BIT and REV (Reversal). In the ALL task, each hole had a 30% chance of receiving a pellet; in the BIT task, only the third hole offered a 50% chance of a pellet, with all other holes having zero probability; and in the REV task, every hole except the third offered a 30% chance of receiving a pellet. Each task continued until 151 pellets were collected. If mice failed to collect at least 75 pellets within 24 h, they were manually fed 75 pellets. The allotted time from task call to nose-poke was 15 s, and failures to act within this window were recorded as timeouts. The inter-trial interval (ITI) was consistently maintained at 4 s.

All mice successfully progressed through the training phases and completed the experimental protocol, with no exclusions necessary due to training failures or incomplete task performance.

Indexes

The outcomes measured in the study were as follows: Accuracy, which corresponded to the number of times a mouse selected the correct hole (i.e., the option with a non-zero reward probability), divided by the number of selection attempts for each task (BIT, REV). Timeout, which corresponded to the number of trials in which a mouse failed to nose-poke a hole within the 15-s time limit following a task call. Repeat rate, which represented the rate at which a mouse chose the same hole after obtaining (repeat rate after Win) or not obtaining (repeat rate after Lose) a reward, across all subtasks. Δrepeat rate, which represented the rate of change in the repeat rate after win (Δrepeat rate after Win) or repeat rate after lose (Δrepeat rate after Lose) across all subtasks. Reaction time, which corresponded to the median time elapsed between a task call after a rewarded (i.e., reaction time after Win) or an unrewarded (i.e., reaction time after Lose) trial, and a nose-poke. Reward latency, which corresponded to the median time elapsed from the correct nose-poke to the magazine nose-poke. Task call latency after Lose, which indicated the median time taken between an unrewarded nose-poke and the start of a new task with a magazine nose-poke. Inter-response interval (IRI), which corresponded to the median time elapsed between a rewarded (i.e., IRI after Win) or an unrewarded (i.e., IRI after Lose) nose-poke, and the next nose-poke. The IRI was calculated across the trials of each subtask. Entropy, was calculated based on the probability distribution of nose-pokes for each hole \(i\) for the first 300 trials of each task, using the definition by Shannon [60]: \(H=-\Sigma {p}_{i}{\mathit{log}}_{2}{p}_{i}\).

Q-learning-based analysis

Behavioral data were analyzed in the assumption that each mouse continuously attributed an action value to each hole, as previously described [13, 34, 48]. Action values were determined according to the Differential Learning Rate Q-learning model (DLR-Q) [9, 14, 29, 48], which can be summarized as follows:

Let \(A\) be the set actions, where \(A=\{1, 2, 3, 4, 5\}\), with each number corresponding to choosing a specific hole. On each trial \(t\), where an action \(a\in A\) was selected and an outcome \(r\in \{\text{0,1}\}\) was obtained, the action value \({Q}_{t}\) of choosing action \(a\) was calculated/updated as:

$${Q}_{t+1}\left(a\right)={Q}_{t}\left(a\right)+ \left\{\begin{array}{c}{\alpha }^{+}\left(r-{Q}_{t}\left(a\right)\right)\,if\,r=1 \\ {\alpha }^{-}\left(r-{Q}_{t}\left(a\right)\right)\,if\,r=0\end{array}\right.$$

where \({\alpha }^{+}\) and \({\alpha }^{-}\) are free parameters that control the magnitude of value updates based on the presence or absence of reward. They are referred to as the positive and negative learning rates, respectively. The probability of selecting action \(a\) on trial \(\text{t}\) was predicted using the soft-max policy function:

$$P\left(a\right)=\frac{\mathit{exp}\left({Q}_{t}\left(a\right)\beta \right)}{{\sum }_{b\in A}\mathit{exp}\left({Q}_{t}\left(b\right)\beta \right)}$$

where \(\upbeta\) is a free parameter called the inverse temperature, which governs the exploration–exploitation trade-off [64]. These three free parameters (\({\alpha }^{+}, {\alpha }^{-} \text{and }\upbeta\)) were estimated using the importance sampling method [29].

Statistical analysis

Duration was compared between both groups across all seven phases (training phases 0–3 and test phases ALL, BIT, and REV) using a two-way repeated measures ANOVA, with Sidak’s post-hoc test for multiple comparisons. For other measures, comparisons were made as follows: Accuracy was compared between both groups and two tasks (BIT, REV) using a two-way repeated measures ANOVA with Sidak’s post-hoc test. Entropy, timeout, IRIs after Win and Lose, reward latency, task call latency after Lose, and reaction times after Win and Lose were compared between both groups across three test tasks (ALL, BIT, REV) using a two-way repeated ANOVA with Sidak’s test for post-hoc multiple comparisons. Repeat rates after Win and Lose, Δrepeat rates after Win and Lose, positive and negative learning rates and inverse temperature were analysed using the two-tailed t-test with Welch’s correction. All statistical analysis were performed with GraphPad Prism 10 (GraphPad Software, USA).

Results

We investigated the reinforcement learning abilities of WT and KCNMA1−/− mice using the 5-ABT (Fig. 1). Figure 2A, B show choice rate changes of individual representative mice (one WT, and another KCNMA1−/− mouse, respectively) for the five-hole options in the subtasks ALL, BIT, and REV. In BIT, hole 3 had a 50% reward probability while others were 0%; both WT (Fig. 2A) and KCNMA1−/− (Fig. 2B) mice gradually increased their selection of the hole 3. In REV, all holes except hole 3 had 30% reward probability, and both genotypes correspondingly decreased their selection of hole 3. The average choice rates for each group are presented separately for BIT (Fig. 2C, D) and REV (Fig. 2E, F), aligned to the start of each subtask to account for individual differences in task completion times. Both groups successfully identified the correct hole choice in BIT and its reversal in REV.

Fig. 1
figure 1

The five-armed bandit task (5-ABT). A nose-poke into the magazine initiated a trial (task-call), after which the mouse was required to nose-poke into one of the five holes within 15 s. Each hole was associated with a preset probability of food pellet delivery. The experiment consisted of three consecutive tasks—ALL, BIT (only one choice yields reward), and REV (reverse)—each featuring distinct reward probabilities for each hole. Each task ended after 151 food pellets had been dispensed

Fig. 2
figure 2

5-ABT responses of WT and KCNMA1−/− mice. A and B A representative choice history of WT and KCNMA1−/− mice in subtasks ALL, BIT and REV. Blue, green, orange, magenta, and yellow lines represent selection rates for holes 1, 2, 3, 4, and 5, respectively, plotted as 20-step moving averages of choice ratios. Vertical dotted lines indicate task switching. C and D Group average of 20-step moving averages of hole selection for each mouse in the subtask BIT. Since mice completed BIT in different numbers of trials, averages are shown for the first 500 steps. E and F Group average in REV. The horizontal, blue, dotted lines indicate a 50% choice ratio

No significant group differences were found for accuracy in the BIT and REV tasks (task × group: p = 0.274, F [1, 30] = 1.24; task: p < 0.001, F [1, 30] = 44.7; group: p = 0.638, F [1, 30] = 0.227; Fig. 3A). Analysis of choice entropy showed only a significant main effect of task (Fig. 3B; task × group: p = 0.142, F [2, 60] = 2.02; task: p < 0.001, F [1.741, 52.22] = 9.99; group: p = 0.440, F [1, 30] = 0.612), indicating that all mice altered their exploration patterns across different task phases, regardless of genotype. A significant interaction between task and genotype was found in the duration to complete all task phases, including training phases 0–3 and the three test phases (Fig. 3C; task × group: p < 0.001, F [6, 180] = 5.58; task: p < 0.001, F [3.52, 106] = 149; group: p = 0.377, F [1, 30] = 0.80). Post-hoc analyses did not show significant difference in each task phase (Phase 0: WT 2.36 h vs KCNMA1−/− 2.96 h, p = 0.998; Phase 1: 3.49 vs 7.38, p = 0.085; Phase 2: 2.51 vs 3.01, p = 0.996; Phase 3: 5.91 vs 13.0, p = 0.225; ALL: 46.4 vs 36.2, p = 0.284; BIT: 35.1 vs 26.5, p = 0.131; REV: 32.7 vs 30.4, p = 0.976). The groups did not differ significantly in the number of timeouts (Fig. 3D; task × group: p = 0.108, F [2, 60] = 2.31; task: p < 0.001, F [1.477, 44.32] = 28.9; group: p = 0.075, F [1, 30] = 3.39; WT: mean = 179.2; KCNMA1−/−: mean = 119.6).

Fig. 3
figure 3

Behavioral characteristics exhibited in the 5-ABT. A Accuracy, the ratio of correct choices in BIT and REV. B Choice entropy. C Duration, time required to complete each task. D Number of timeout trials per task. No significant differences were found between groups

We examined group differences in the Win-Stay-Lose-Shift (WSLS) strategy. We calculated the probability of both a rewarded (Fig. 4A, Win-Stay) and an unrewarded (Fig. 4D, Lose-Shift) choice being re-selected after n trials (repeat rate). Our previous research showed that, after a rewarded trial, the probability of selecting the same option was higher only for the immediately subsequent trial [48]. Based on this, we used the mean probability of the same-option selection from the 6th to the 10th trial, following either Win or Lose as a baseline. We then established the Δrepeat ratio by computing the difference between this baseline and the repetition rate in the immediately subsequent trial for each individual mouse (Fig. 4B, E). The Δrepeat ratio after Win differed significantly between groups (p < 0.001, t [27.18] = 4.51; Fig. 4B), with an average of 0.0490 ± 0.034 (SD) for WT mice and 0.0020 ± 0.024 for KCNMA1−/− mice. Similarly, a significant difference was found for Δrepeat ratio after Lose (p < 0.001, t [28.05] = 5.65; Fig. 4E), with an average of -0.050 ± 0.025 for WT mice and -0.0050 ± 0.019 for KCNMA1−/− mice. For comparison with previous studies using other ASD mouse models [1, 56], the repeat rates for the trial that followed a rewarded and an unrewarded trial, were also calculated and compared between groups (Fig. 4C, F). In contrast to our findings using the Δrepeat rates analysis, conventional repeat rate calculations showed that KCNMA1−/− mice had significantly higher repeat rates after Lose (p = 0.047, t [27.19] = 2.08) but not after Win (p = 0.053, t [29.91] = 2.02).

Fig. 4
figure 4

Win-Stay-Lose-Shift behaviors in WT and KCNMA1−/− mice. A and D Probability of repeating choices. Black (WT) and red (KCNMA1−/−) lines show the probability (0–1.0) of selecting the same hole in subsequent trials after a rewarded (A) or unrewarded (D) choice at trial 0. Dotted lines represent the baseline probability, calculated as the mean probability of selecting the same hole across trials 6–10. Error bars represent the standard error of the mean. B and E Change in repeat probability (Δrepeat rate). The difference between immediate repeat probability (trial 1) and baseline probability (trials 6–10 average) after Win (B) or Lose (E). Positive values indicate increased tendency to repeat choices compared to baseline, while negative values indicate decreased tendency. C and F Absolute repeat probability. The probability (0–1.0) of selecting the same hole in the immediately following trial after a rewarded choice (C) or unrewarded choice (F). Asterisks indicate statistical significance at p < 0.05 (*) and p < 0.001 (***)

To explore the temporal characteristics of the behavior that followed the outcome of each choice, we evaluated the inter-response interval (IRI), which measured the time elapsed between two-hole choices. No significant difference was found between groups for IRI after a Win (Fig. 5A; task × group: p = 0.796, F [2, 60] = 0.229; task: p < 0.011, F [1.188, 35.63] = 6.63; group: p = 0.056, F [1, 30] = 3.96; WT: mean = 62.8; KCNMA1−/−: mean = 46.2). However, the groups differed significantly in the IRI after a Lose (Fig. 5B; task × group: p = 0.505, F [2, 60] = 0.691; task: p < 0.001, F [1.346, 40.39] = 22.0; group: p = 0.002, F [1, 30] = 11.4; WT: mean = 20.0; KCNMA1−/−: mean = 14.5). These findings suggest a temporal response alteration, specific to trials in which no reward is obtained, which prompts a detailed breakdown of the IRI components.

Fig. 5
figure 5

Temporal characteristics of 5-ABT responses. A and B Inter response interval (IRI) after a Win (A) or a Lose (B), representing the time between consecutive hole nose-pokes. Significant genotype differences were observed in IRI after Lose. C Reward latency, time elapsed from successful hole selection to magazine nose-poke to obtain a pellet. D Task call latency after Lose, time elapsed from unsuccessful hole selection to magazine nose-poke to initiate the next trial, showing significant differences between genotypes. E and F Reaction time after a Win (E) and a Lose (F), representing the time elapsed from a task call to the next hole selection, with significant genotype differences in reaction time after Lose (F). Asterisks indicate statistical significance between genotypes, p < 0.05 (*) and p < 0.005 (**)

We measured the time from the rewarded hole nose-poke to the magazine nose-poke to obtain a reward, known as reward latency, and the time from the hole nose-poke to the magazine nose-poke to task call when no reward was given. No significant difference was found in the reward latency of the two groups (Fig. 5C; task × group: p = 0.214, F [2, 60] = 1.58; task: p < 0.001, F [1.856, 55.67] = 18.6; group: p = 0.258, F [1, 30] = 1.33; WT: mean = 2.28 s; KCNMA1−/−: mean = 2.12 s). However, the groups differed significantly on the latency to a new task call following a Lose (Fig. 5D; task × group: p = 0.127, F [2, 60] = 2.14; task: p < 0.001, F [1.481, 4.44] = 23.3; group: p = 0.005, F [1, 30] = 9.46; WT: mean = 12.8 s; KCNMA1−/−: mean = 9.1 s). Regarding the reaction time after initiating the task, no significant difference was found between the groups in the trial that followed a Win (Fig. 5E; task × group: p = 0.216, F [2, 60] = 1.58; task: p < 0.001, F [1.960, 58.79] = 21.8; group: p = 0.875, F [1, 30] = 0.0251; WT: mean = 6.79 s; KCNMA1−/−: mean = 6.69 s), while the groups differed significantly in the trial that followed a Lose (Fig. 5F; task × group: p = 0.190, F [2, 60] = 1.71; task: p < 0.001, F [1.435, 43.04] = 59.1; group: p = 0.018, F [1, 30] = 6.22; WT: mean = 5.50 s; KCNMA1−/−: mean = 4.63 s).

Taken together, our results indicate that KCNMA1 knockout leads to altered choices and temporal control of behavior, on a scale from seconds to tens of seconds for the next trial. However, since the overall learning of the task was still achieved (Fig. 2 and 3A), we further investigated long-term reinforcement learning capabilities across multiple trials using Q-learning model-based analysis. We found no significant differences between the groups in both positive (Fig. 6A; p = 0.779, t [27.14] = 0.284) and negative learning rates (Fig. 6B; p = 0.223, t [28.00] = 1.25). Additionally, no significant differences were found for inverse temperature, a parameter related to exploratory behavior (Fig. 6C; p = 0.932, t [25.88] = 0.0867).

Fig. 6
figure 6

Estimated reinforcement learning meta-parameters. A Positive learning rate estimated from the action-selection data of each mouse. B Negative learning rate. C Inverse temperature. No significant group differences were found for any of these parameters

Discussion

This study demonstrates three key characteristics of BK channel dysfunction of mice, in the context of the 5-ABT: (1) it abolishes the WSLS strategy, (2) it shortens the time to retry after an unrewarded trial, and (3) despite these short-term behavioral impairments, their learning rates and task accuracy remained comparable to wild type mice.

Widely used analyses of WSLS behavior in other studies have investigated the probability of repeating the same choice in the trial that follows a Win or a Lose outcome [56, 61, 65]. In the present study, using the same method, we initially hypothesized that, similar to other ASD models such as Fmr1-KO and BTBR mice [1, 56], KCNMA1−/− mice would increase Lose-Shift behavior (Fig. 4F). However, acknowledging the possibility of individual variations in the baseline probability of repeating the same choice, we refined the analysis method, based on our previous work [48]. In this refined approach, we calculated the difference between an individual’s average repeat probability and the repeat rate immediately after a Win/Lose outcome. Intriguingly, this new analysis revealed that KCNMA1−/− mice actually showed a decrease in both Lose-Shift (Fig. 4E) and Win-Stay behaviors (Fig. 4B), thus contradicting the previous conclusions that KCNMA1−/− mice increased Lose-Shift behavior (Fig. 4F) and did not change in Win-Stay behavior (Fig. 4C). By pooling up to 10 subsequent trials, instead of considering only the first trial after a Win or a Lose, and assuming the number of trials after which the effects of Win or Lose would disappear, the expected value could be calculated based on the average repeat rate of each individual mouse. This, in turn, allowed to calculate the Δrepeat rate. This analytical method can potentially be useful to identify short-term behavioral characteristics in probabilistic learning task, both for animal models of various neurodevelopmental disorders and for patients with those disorders.

Human studies use multi-choice tasks like Iowa Gambling Task [46], while animal studies often use two-choice paradigms (2-ABT). In the case of 2-ABT, after animals are trained in alternating behavior, the probability patterns are changed or reversed. However, this makes it difficult to distinguish between WSLS strategies and the learned alternating behavior. To enhance translational studies between human neurodevelopmental/mental disorders and animal models, it is crucial to investigate using Δrepeat rate how the number of available choices influences the sensitivity in detecting WSLS strategy differences.

Despite the significant loss of WSLS strategy observed in KCNMA1−/− mice, only partial changes were found in their long-term learning ability. Since the positive and negative learning rates did not differ significantly between groups (Fig. 6A, B), it is concluded that their ability to update the action value of each choice over the long term was not significantly impaired. This indicates the lack of a direct correlation between the short-term WSLS strategy and the update of the long-term action value, which suggests that they function through independent mechanisms.

In terms of reversal learning, KCNMA1−/− mice did not show severely impaired behavioral flexibility, as evidenced by similar accuracy to WT mice in both BIT and REV tasks (Fig. 3A). The inverse temperature parameter indicates the balance between exploration and exploitation: lower values suggest a stronger tendency to explore regardless of past outcomes, while higher values indicate a preference for choices that previously yielded rewards [15, 20, 65]. The absence of significant differences in inverse temperature and choice entropy between groups (Figs. 6C, 3B) suggests that their overall exploration tendencies were similar.

The KCNMA1−/− mice exhibit a unique form of hyperactivity that becomes apparent under specific circumstances. While these knockout mice demonstrated comparable task performance metrics (accuracy, duration, and timeouts) to WT mice (Fig. 3A, C, D), they showed distinctive temporal patterns in their inter-trial behavior. Analysis of inter-response intervals (IRI) revealed that KCNMA1−/− mice had shorter retry times following unrewarded trials (Fig. 5B). They also demonstrated reduced task call times after non-reward (Fig. 5D) and decreased reaction times, specifically after Lose (Fig. 5F). These findings point to a distinctive tendency in KCNMA1−/− mice to rapidly engage in subsequent actions, particularly following negative outcomes. This behavioral pattern is further corroborated by Y-maze experiments, where KCNMA1−/− mice exhibited twice the number of explorations compared to WT mice, while maintaining similar alternation rates [68, 69]. This suggests a propensity for quick exploration of alternative paths without perseveration. Additionally, in the 3-CSRTT, KCNMA1−/− mice showed significantly fewer perseverative responses [4], contrasting with other ASD model mice. The hyperactivity observed in KCNMA1−/− mice is characterized by increased actions per unit time, especially after unsuccessful experiences. This behavior is distinct from simple repetitive actions and is not marked by a fixation on specific choices, as evidenced by their diminished Win-Stay strategy (Fig. 4B). Instead, it reflects a rapid transition to new actions, particularly in response to negative outcomes, similar to the hyperactivity observed after stress induction in KCNMA1−/− mice [50]. However, while these behavioral changes might appear to reflect cognitive alterations, they could potentially arise from more fundamental sensory processing deficits. Given that BK channels play crucial roles in both locomotor function and auditory and visual systems [26, 67,68,69] abnormalities in sensory processing might secondarily lead to these apparent changes in trial-to-trial behavior.

Our findings have significant implications for the understanding of various neurodevelopmental disorders. Recent studies in humans using probabilistic reversal learning, have revealed distinct behavioral patterns in patients with different neurodevelopmental issues. For example, ASD patients aged 8–44 years old have exhibited Lose-Shift errors following reversal of task probability conditions, maintaining the choice for the previously learned option, which lead to deteriorated performance [21]. Additionally, adolescents with ASD were found to have significantly lower Win-Stay probabilities and positive learning rates compared to typically developing subjects [17]. Adults with ASD have also been reported to show lower Win-Stay probabilities, while maintaining intact Lose-Shift probabilities [62]. Fragile X syndrome (FXS) patients exhibiting ASD symptoms presented slower task learning and significant deficits in Lose-Shift learning [56]. In contrast, young adults with ADHD tended to switch choices after both Win and Lose outcomes and displayed a decreased negative learning rate [6]. While the exact relationship between BK channels and these disorders remains to be clarified [22, 37, 38, 74] it is noteworthy that both KCNMA1−/− mice and patients with BK channel-related disorders [38] share at least one common behavioral/phenotypic feature: altered WSLS strategy.

Post-error slowing is typically observed in probabilistic learning tasks where reaction time increases after a Lose, and differs among distinct cohorts. Notably, children with ADHD exhibit shorter post-error slowing in more challenging tasks [5], which indicates diminished error recognition. This feature may share common underlying factors with the decreased reaction time after Lose and the absence of the WSLS strategy observed in KCNMA1−/− mice.

The impaired WSLS strategy observed in KCNMA1−/− mice indicates a significant dysfunction in the neural circuits responsible for rapid behavioral adaptation to positive and negative outcomes. Previous animal and human studies have identified the specific brain regions associated with WSLS behavior [42, 61], Van Den [70, 71]. However, since our study was based on a non-selective, whole-body KCNMA1−/− model, it was not possible to pinpoint the exact brain regions involved, which highlights important directions for future research. Given that BK channels play a critical role in suppressing neuronal hyperexcitability [24], the loss of this regulatory mechanism in KCNMA1−/− mice may lead to excessive neural activity, ultimately resulting in impaired behavioral adjustments following outcome recognition. Subsequent studies are needed to identify specific brain regions and neuronal cell types that contribute for a non-zero Δrepeat ratio, which quantifies WSLS behavior.

A limitation of this study is the inclusion of only male mice, as sex differences in WSLS strategies have been reported [10]. Our decision to focus on male mice was driven by two key considerations. First, we aimed to establish and validate our novel Δrepeat ratio analysis method as a sensitive tool for detecting subtle or uncharacterized behavioral alterations in various types of knockout mouse models. Second, we sought to minimize potential confounding variables related to food restriction required for operant conditioning. Female BK KO mice are known to show different metabolic responses compared to males, including distinct patterns of body composition and weight regulation [32], suggesting they may be more sensitive to food restriction protocols. While this approach allowed us to demonstrate the utility of our analytical method and identify specific behavioral phenotypes, we acknowledge that extending this investigation to include female mice is crucial for future research. Such studies would not only provide a more comprehensive understanding of BK channel function across sexes but would also better align with the current emphasis on sex-inclusive research in pre-clinical studies for neurodevelopmental disorders.

Conclusion

This study demonstrates that KCNMA1−/− mice present lower Win-Stay-Lose-Shift (WSLS) behavior and shorter retry time after unrewarded trials than WT mice, indicating that BK channels are crucial for short-term behavioral adjustment and temporal control. This study therefore shows a strong correlation between KCNMA1 gene mutations and specific indications or symptoms of neurodevelopmental disorders.

Availability of data and materials

No datasets were generated or analysed during the current study.

References

  1. Alvarez BD, Morales CA, Oliver BL, Cavazos C, Amodeo LR, Amodeo DA. Impairments in operant probabilistic reversal learning in BTBR T+tf/J male and female mice. Behav Brain Res. 2023;437:114111. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.bbr.2022.114111.

    Article  PubMed  Google Scholar 

  2. Amodeo DA, Jones JH, Sweeney JA, Ragozzino ME. Differences in BTBR T+ tf/J and C57BL/6J mice on probabilistic reversal learning and stereotyped behaviors. Behav Brain Res. 2012;227(1):64–72. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.bbr.2011.10.032.

    Article  PubMed  Google Scholar 

  3. Ancatén-González C, Segura I, Alvarado-Sánchez R, Chávez AE, Latorre R. Ca2+- and voltage-activated K+ (BK) channels in the nervous system: one gene, a myriad of physiological functions. Int J Mol Sci. 2023;24(4):3407. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/ijms24043407.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Arake M, Ohta H, Nozawa T, Satoh Y, Fujita M, Nakata T, Meredith AL, Shinomiya N, Ishizuka T, Morimoto Y. BK channel dysfunction disrupts attention-controlled behaviors and altered perseverative responses in murine instrumental learning. Behav Brain Res. 2024;468:115015. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.bbr.2024.115015.

    Article  CAS  PubMed  Google Scholar 

  5. Arnett AB, Rhoads C, Rutter TM. Reduced error recognition explains post-error slowing differences among children with attention deficit hyperactivity disorder. J Int Neuropsychol Soc. 2022;28(8):810–20. https://doiorg.publicaciones.saludcastillayleon.es/10.1017/S1355617721001065.

    Article  PubMed  Google Scholar 

  6. Aster H-C, Waltmann M, Busch A, Romanos M, Gamer M, Maria Van Noort B, Beck A, Kappel V, Deserno L. Impaired flexible reward learning in ADHD patients is associated with blunted reinforcement sensitivity and neural signals in ventral striatum and parietal cortex. NeuroImage Clin. 2024;42:103588.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Bari A, Theobald DE, Caprioli D, Mar AC, Aidoo-Micah A, Dalley JW, Robbins TW. Serotonin modulates sensitivity to reward and negative feedback in a probabilistic reversal learning task in rats. Neuropsychopharmacology. 2010;35(6):1290–301. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/npp.2009.233.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Carreno-Munoz MI, Martins F, Medrano MC, Aloisi E, Pietropaolo S, Dechaud C, Subashi E, Bony G, Ginger M, Moujahid A, Frick A, Leinekugel X. Potential involvement of impaired BK Ca channel function in sensory defensiveness and some behavioral disturbances induced by unfamiliar environment in a mouse model of fragile X syndrome. Neuropsychopharmacology. 2018;43(3):492–502. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/npp.2017.149.

    Article  CAS  PubMed  Google Scholar 

  9. Cazé RD, Van Der Meer MAA. Adaptive properties of differential learning rates for positive and negative outcomes. Biol Cybern. 2013;107(6):711–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00422-013-0571-5.

    Article  PubMed  Google Scholar 

  10. Chen CS, Knep E, Han A, Ebitz RB, Grissom NM. Sex differences in learning from exploration. Elife. 2021;10:e69748. https://doiorg.publicaciones.saludcastillayleon.es/10.7554/eLife.69748.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Cheng P, Qiu Z, Du Y. Potassium channels and autism spectrum disorder: an overview. Int J Dev Neurosci. 2021;81(6):479–91. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/jdn.10123.

    Article  CAS  PubMed  Google Scholar 

  12. Chierchia G, Soukupová M, Kilford EJ, Griffin C, Leung J, Palminteri S, Blakemore S. Confirmatory reinforcement learning changes with age during adolescence. Dev Sci. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/desc.13330.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Cinotti F, Fresno V, Aklil N, Coutureau E, Girard B, Marchand AR, Khamassi M. Dopamine blockade impairs the exploration-exploitation trade-off in rats. Sci Rep. 2019;9(1):1–14. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41598-019-43245-z.

    Article  CAS  Google Scholar 

  14. Ciranka S, Linde-Domingo J, Padezhki I, Wicharz C, Wu CM, Spitzer B. Asymmetric reinforcement learning facilitates human inference of transitive relations. Nat Hum Behav. 2022;6(4):555–64. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41562-021-01263-w.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Collins AGE, Frank MJ. Opponent actor learning (OpAL): Modeling interactive effects of striatal dopamine on reinforcement learning and choice incentive. Psychol Rev. 2014;121(3):337–66. https://doiorg.publicaciones.saludcastillayleon.es/10.1037/a0037015.

    Article  PubMed  Google Scholar 

  16. Contractor A. Broadening roles for FMRP: big news for big potassium (BK) channels. Neuron. 2013;77(4):601–3. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neuron.2013.02.001.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Crawley D, Zhang L, Jones EJH, Ahmad J, Oakley B, Sanjosé Cáceres A, Charman T, Buitelaar JK, Murphy DGM, Chatham C, Den Ouden H, Loth E, The EU-AIMS LEAP group. Modeling flexible behavior in childhood to adulthood shows age-dependent learning mechanisms and less optimal learning in autism in each age group. PLOS Biol. 2020;18(10):e3000908. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pbio.3000908.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Dalton GL, Phillips AG, Floresco SB. Preferential involvement by nucleus accumbens shell in mediating probabilistic learning and reversal shifts. J Neurosci. 2014;34(13):4618–26. https://doiorg.publicaciones.saludcastillayleon.es/10.1523/JNEUROSCI.5058-13.2014.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Dalton GL, Wang NY, Phillips AG, Floresco SB. Multifaceted contributions by different regions of the orbitofrontal and medial prefrontal cortex to probabilistic reversal learning. J Neurosci. 2016;36(6):1996–2006. https://doiorg.publicaciones.saludcastillayleon.es/10.1523/JNEUROSCI.3366-15.2016.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Daw ND, O’Doherty JP, Dayan P, Seymour B, Dolan RJ. Cortical substrates for exploratory decisions in humans. Nature. 2006;441(7095):876–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/nature04766.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. D’Cruz A-M, Ragozzino ME, Mosconi MW, Shrestha S, Cook EH, Sweeney JA. Reduced behavioral flexibility in autism spectrum disorders. Neuropsychology. 2013;27(2):152–60. https://doiorg.publicaciones.saludcastillayleon.es/10.1037/a0031721.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Deng P-Y, Klyachko VA. Genetic upregulation of BK channel activity normalizes multiple synaptic and circuit defects in a mouse model of fragile X syndrome. J Physiol. 2016;594(1):83–97. https://doiorg.publicaciones.saludcastillayleon.es/10.1113/JP271031.

    Article  CAS  PubMed  Google Scholar 

  23. Deng P-Y, Rotman Z, Blundon JA, Cho Y, Cui J, Cavalli V, Zakharenko SS, Klyachko VA. FMRP regulates neurotransmitter release and synaptic information transmission by modulating action potential duration via BK channels. Neuron. 2013;77(4):696–711. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neuron.2012.12.018.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Echeverría F, Gonzalez-Sanabria N, Alvarado-Sanchez R, Fernández M, Castillo K, Latorre R. Large conductance voltage-and calcium-activated K+ (BK) channel in health and disease. Front Pharmacol. 2024;15:1373507. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fphar.2024.1373507.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Estes WK. Toward a statistical theory of learning. Psychol Rev. 1994;101(2):282–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1037/0033-295X.101.2.282.

    Article  Google Scholar 

  26. Ferraguto C, Bouleau Y, Peineau T, Dulon D, Pietropaolo S. Hyperacusis in the adult Fmr1-KO mouse model of fragile X syndrome: the therapeutic relevance of cochlear alterations and BKCa channels. Int J Mol Sci. 2023;24(14):11863. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/ijms241411863.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Ferraguto C, Piquemal-Lagoueillat M, Lemaire V, Moreau MM, Trazzi S, Uguagliati B, Ciani E, Bertrand SS, Louette E, Bontempi B, Pietropaolo S. Therapeutic efficacy of the BKCa channel opener chlorzoxazone in a mouse model of Fragile X syndrome. Neuropsychopharmacology. 2024;49(13):2032–41. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41386-024-01956-6.

    Article  CAS  PubMed  Google Scholar 

  28. Gallistel CR, Balci F, Freestone D, Kheifets A, King A. Automated, quantitative cognitive/behavioral screening of mice: for genetics, pharmacology, animal cognition and undergraduate instruction. J Vis Exp. 2014;84:51047. https://doiorg.publicaciones.saludcastillayleon.es/10.3791/51047.

    Article  Google Scholar 

  29. Gershman SJ. Do learning rates adapt to the distribution of rewards? Psychon Bull Rev. 2015;22(5):1320–7. https://doiorg.publicaciones.saludcastillayleon.es/10.3758/s13423-014-0790-3.

    Article  PubMed  Google Scholar 

  30. Gruber AJ, Thapa R. The memory trace supporting lose-shift responding decays rapidly after reward omission and is distinct from other learning mechanisms in rats. eNeuro. 2016;3(6):1–14. https://doiorg.publicaciones.saludcastillayleon.es/10.1523/ENEURO.0167-16.2016.

    Article  Google Scholar 

  31. Guglielmi L. Update on the implication of potassium channels in autism: K+ channelautism spectrum disorder. Front Cell Neurosci. 2015;9:34. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fncel.2015.00034.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Halm ST, Bottomley MA, Almutairi MM, Di Fulvio M, Halm DR. Survival and growth of C57BL/6J mice lacking the BK channel, Kcnma1: lower adult body weight occurs together with higher body fat. Physiol Rep. 2017;5(4):e13137.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Hébert B, Pietropaolo S, Même S, Laudier B, Laugeray A, Doisne N, Quartier A, Lefeuvre S, Got L, Cahard D, Laumonnier F, Crusio WE, Pichon J, Menuet A, Perche O, Briault S. Rescue of fragile X syndrome phenotypes in Fmr1KO mice by a BKCa channel opener molecule. Orphanet J Rare Dis. 2014;9(1):124. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13023-014-0124-6.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Ito M, Doya K. Validation of decision-making models and analysis of decision variables in the rat basal ganglia. J Neurosci. 2009;29(31):9861–74. https://doiorg.publicaciones.saludcastillayleon.es/10.1523/JNEUROSCI.6157-08.2009.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Jang AI, Costa VD, Rudebeck PH, Chudasama Y, Murray EA, Averbeck BB. The role of frontal cortical and medial-temporal Lobe brain areas in learning a bayesian prior belief on reversals. J Neurosci. 2015;35(33):11751–60. https://doiorg.publicaciones.saludcastillayleon.es/10.1523/JNEUROSCI.1594-15.2015.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Khattak S, Brimble E, Zhang W, Zaslavsky K, Strong E, Ross PJ, Hendry J, Mital S, Salter MW, Osborne LR, Ellis J. Human induced pluripotent stem cell derived neurons as a model for Williams-Beuren syndrome. Mol Brain. 2015;8(1):77. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13041-015-0168-0.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Liang L, Li X, Moutton S, Schrier Vergano SA, Cogné B, Saint-Martin A, Hurst ACE, Hu Y, Bodamer O, Thevenon J, Hung CY, Isidor B, Gerard B, Rega A, Nambot S, Lehalle D, Duffourd Y, Thauvin-Robinet C, Faivre L, Wang QK. De novo loss-of-function KCNMA1 variants are associated with a new multiple malformation syndrome and a broad spectrum of developmental and neurological phenotypes. Human Mol Genet. 2019;28(17):2937–51. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/hmg/ddz117.

    Article  CAS  Google Scholar 

  38. Meredith AL. BK channelopathies and KCNMA1 -linked disease models. Annu Rev Physiol. 2024;86(1):277–300. https://doiorg.publicaciones.saludcastillayleon.es/10.1146/annurev-physiol-030323-042845.

    Article  CAS  PubMed  Google Scholar 

  39. Meredith AL, Thorneloe KS, Werner ME, Nelson MT, Aldrich RW. Overactive bladder and incontinence in the absence of the BK large conductance Ca2+-activated K+ channel. J Biol Chem. 2004;279(35):36746–52. https://doiorg.publicaciones.saludcastillayleon.es/10.1074/jbc.M405621200.

    Article  CAS  PubMed  Google Scholar 

  40. Meredith AL, Wiler SW, Miller BH, Takahashi JS, Fodor AA, Ruby NF, Aldrich RW. BK calcium-activated potassium channels regulate circadian behavioral rhythms and pacemaker output. Nat Neurosci. 2006;9(8):1041–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/nn1740.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Miller JP, Moldenhauer HJ, Keros S, Meredith AL. An emerging spectrum of variants and clinical features in KCNMA1-linked channelopathy. Channels. 2021;15(1):447–64. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/19336950.2021.1938852.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Mizoguchi H, Katahira K, Inutsuka A, Fukumoto K, Nakamura A, Wang T, Nagai T, Sato J, Sawada M, Ohira H, Yamanaka A, Yamada K. Insular neural system controls decision-making in healthy and methamphetamine-treated rats. Proc Natl Acad Sci USA. 2015;112(29):E3930-9. https://doiorg.publicaciones.saludcastillayleon.es/10.1073/pnas.1418014112.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Moldenhauer HJ, Dinsdale RL, Alvarez S, Fernández-Jaén A, Meredith AL. Effect of an autism-associated KCNMB2 variant, G124R, on BK channel properties. Curr Res Physiol. 2022;5:404–13. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.crphys.2022.09.001.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Moldenhauer HJ, Mi Park S, Meredith AL. Characterization of new human KCNMA1 loss-of-function mutations. Biophys J. 2020;118(3):114a. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.bpj.2019.11.767.

    Article  Google Scholar 

  45. Moldenhauer HJ, Tammen K, Meredith AL. Structural mapping of patient-associated KCNMA1 gene variants. Biophys J. 2024;123(14):1984–2000. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.bpj.2023.11.3404.

    Article  CAS  PubMed  Google Scholar 

  46. Mussey JL, Travers BG, Klinger LG, Klinger MR. Decision-making skills in ASD: performance on the Iowa Gambling Task. Autism Res. 2015;8(1):105–14. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/aur.1429.

    Article  PubMed  Google Scholar 

  47. Ohta H, Nozawa T, Nakano T, Morimoto Y, Ishizuka T. Nonlinear age-related differences in probabilistic learning in mice: a 5-armed bandit task study. Neurobiol Aging. 2024;142:8–16. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neurobiolaging.2024.06.004.

    Article  PubMed  Google Scholar 

  48. Ohta H, Satori K, Takarada Y, Arake M, Ishizuka T, Morimoto Y, Takahashi T. The asymmetric learning rates of murine exploratory behavior in sparse reward environments. Neural Netw. 2021;143:218–29. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neunet.2021.05.030.

    Article  PubMed  Google Scholar 

  49. Palminteri S, Lebreton M. The computational roots of positivity and confirmation biases in reinforcement learning. Trends Cogn Sci. 2022;26(7):607–21. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.tics.2022.04.005.

    Article  PubMed  Google Scholar 

  50. Park SM, Roache CE, Iffland PH, Moldenhauer HJ, Matychak KK, Plante AE, Lieberman AG, Crino PB, Meredith A. BK channel properties correlate with neurobehavioral severity in three KCNMA1-linked channelopathy mouse models. eLife. 2022;11:e77953. https://doiorg.publicaciones.saludcastillayleon.es/10.7554/eLife.77953.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Peterson DA, Elliott C, Song DD, Makeig S, Sejnowski TJ, Poizner H. Probabilistic reversal learning is impaired in Parkinson’s disease. Neuroscience. 2009;163(4):1092–101. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neuroscience.2009.07.033.

    Article  CAS  PubMed  Google Scholar 

  52. Reddy LF, Waltz JA, Green MF, Wynn JK, Horan WP. Probabilistic reversal learning in schizophrenia: stability of deficits and potential causal mechanisms. Schizophr Bull. 2016;42(4):942–51. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/schbul/sbv226.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Remijnse PL, Nielen MMA, Van Balkom AJLM, Cath DC, Van Oppen P, Uylings HBM, Veltman DJ. Reduced orbitofrontal-striatal activity on a reversal learning task in obsessive-compulsive disorder. Arch Gen Psychiatry. 2006;63(11):1225. https://doiorg.publicaciones.saludcastillayleon.es/10.1001/archpsyc.63.11.1225.

    Article  PubMed  Google Scholar 

  54. Remmelink E, Chau U, Smit AB, Verhage M, Loos M. A one-week 5-choice serial reaction time task to measure impulsivity and attention in adult and adolescent mice. Sci Rep. 2017;7:1–13. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/srep42519.

    Article  CAS  Google Scholar 

  55. Sanders J, Johnson KA, Garavan H, Gill M, Gallagher L. A review of neuropsychological and neuroimaging research in autistic spectrum disorders: attention, inhibition and cognitive flexibility. Res Autism Spectr Disorders. 2008;2(1):1–16. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.rasd.2007.03.005.

    Article  Google Scholar 

  56. Schmitt LM, Arzuaga AL, Dapore A, Duncan J, Patel M, Larson JR, Erickson CA, Sweeney JA, Ragozzino ME. Parallel learning and cognitive flexibility impairments between Fmr1 knockout mice and individuals with fragile X syndrome. Front Behav Neurosci. 2023;16:1074682. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fnbeh.2022.1074682.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Schmitt LM, Bojanek E, White SP, Ragozzino ME, Cook EH, Sweeney JA, Mosconi MW. Familiality of behavioral flexibility and response inhibition deficits in autism spectrum disorder (ASD). Molecular Autism. 2019;10(1):47. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13229-019-0296-y.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Schmunk G, Gargus JJ. Channelopathy pathogenesis in autism spectrum disorders. Front Genet. 2013. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fgene.2013.00222.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Schuetze M, Rohr CS, Dewey D, McCrimmon A, Bray S. Reinforcement learning in autism spectrum disorder. Front Psychol. 2017. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fpsyg.2017.02035.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Shannon CE. A mathematical theory of communication. Bell Syst Tech J. 1948;27(4):623–56. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/j.1538-7305.1948.tb00917.x.

    Article  Google Scholar 

  61. Skelin I, Hakstol R, Vanoyen J, Mudiayi D, Molina LA, Holec V, Hong NS, Euston DR, Mcdonald RJ, Gruber AJ. Lesions of dorsal striatum eliminate lose-switch responding but not mixed-response strategies in rats. Eur J Neurosci. 2014;39(10):1655–63. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/ejn.12518.

    Article  PubMed  Google Scholar 

  62. Solomon M, Smith AC, Frank MJ, Ly S, Carter CS. Probabilistic reinforcement learning in adults with autism spectrum disorders. Autism Res. 2011;4(2):109–20. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/aur.177.

    Article  PubMed  PubMed Central  Google Scholar 

  63. Sun AX, Yuan Q, Fukuda M, Yu W, Yan H, Lim GGY, Nai MH, D’Agostino GA, Tran H-D, Itahana Y, Wang D, Lokman H, Itahana K, Lim SWL, Tang J, Chang YY, Zhang M, Cook SA, Rackham OJL, Je HS. Potassium channel dysfunction in human neuronal models of Angelman syndrome. Science. 2019;366:1486–92. https://doiorg.publicaciones.saludcastillayleon.es/10.1126/science.aav5386.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Sutton, R. S., & Barto, A. G. (2018). Reinforcement Learning: An Introduction. MIT press.

  65. Swanson K, Averbeck BB, Laubach M. Noradrenergic regulation of two-armed bandit performance. Behav Neurosci. 2022;136(1):84–99. https://doiorg.publicaciones.saludcastillayleon.es/10.1037/bne0000495.

    Article  CAS  PubMed  Google Scholar 

  66. Tait DS, Bowman EM, Neuwirth LS, Brown VJ. Assessment of intradimensional/extradimensional attentional set-shifting in rats. Neurosci Biobehav Rev. 2018;89:72–84. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neubiorev.2018.02.013.

    Article  PubMed  Google Scholar 

  67. Tanimoto N, Sothilingam V, Euler T, Ruth P, Seeliger MW, Schubert T. BK channels mediate pathway-specific modulation of visual signals in the In Vivo mouse retina. J Neurosci. 2012;32(14):4861–6. https://doiorg.publicaciones.saludcastillayleon.es/10.1523/JNEUROSCI.4654-11.2012.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Typlt M, Mirkowski M, Azzopardi E, Ruettiger L, Ruth P, Schmid S. Mice with deficient BK channel function show impaired prepulse inhibition and spatial learning, but normal working and spatial reference memory. PLoS ONE. 2013;8(11):e81270. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pone.0081270.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Typlt M, Mirkowski M, Azzopardi E, Ruth P, Pilz PKD, Schmid S. Habituation of reflexive and motivated behavior in mice with deficient BK channel function. Front Integr Neurosci. 2013;7:79. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fnint.2013.00079.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Van Den Bos W. Better than expected or as bad as you thought? The neurocognitive development of probabilistic feedback processing. Front Human Neurosci. 2009. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/neuro.09.052.2009.

    Article  Google Scholar 

  71. Waltz JA, Kasanova Z, Ross TJ, Salmeron BJ, McMahon RP, Gold JM, Stein EA. The roles of reward, default, and executive control networks in set-shifting impairments in schizophrenia. PLoS ONE. 2013;8(2):e57257. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pone.0057257.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  72. Worthy DA, Todd Maddox W. A comparison model of reinforcement-learning and win-stay-lose-shift decision-making processes: a tribute to W.K. Estes J Math Psychol. 2014;59:41–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jmp.2013.10.001.

    Article  PubMed  Google Scholar 

  73. Zeif D, Yakobi O, Yechiam E. Choice behavior in autistic adults: What drives the extreme switching phenomenon? PLoS ONE. 2023;18(3):e0282296. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pone.0282296.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  74. Zhang G, Gibson RA, McDonald M, Liang P, Kang PW, Shi J, Yang H, Cui J, Mikati MA. A gain-of-function mutation in KCNMA1 causes dystonia spells controlled with stimulant therapy. Mov Disord. 2020;35(10):1868–73. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/mds.28138.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  75. Zhang Y, Bonnan A, Bony G, Ferezou I, Pietropaolo S, Ginger M, Sans N, Rossier J, Oostra B, LeMasson G, Frick A. Dendritic channelopathies contribute to neocortical and sensory hyperexcitability in Fmr1−/y mice. Nat Neurosci. 2014;17(12):1701–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/nn.3864.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We are grateful to T. Tobita and Y. Takarada for their excellent assistance.

Funding

This work was supported by JSPS KAKENHI Grant Numbers: JP 20K05933 and JP24K03038.

Author information

Authors and Affiliations

Authors

Contributions

H.O., T.N., and K.H. wrote the main manuscript text. H.O. acquired all data and prepared all figures. H.O. and K.H. analyzed the data. H.O., Y.S., Y.M., and T.I. designed the work. A.M. and Y.S. provided research tools and methodology and edited the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Hiroyuki Ohta.

Ethics declarations

Ethics approval and consent to participate

All animal procedures were performed in accordance with the institutional ethical guidelines for animal experiments of National Defense Medical College. The Animal Research Committee of National Defense Medical College approved the experimental procedures.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ohta, H., Nozawa, T., Higuchi, K. et al. Altered trial-to-trial responses to reward outcomes in KCNMA1 knockout mice during probabilistic learning tasks. Behav Brain Funct 20, 36 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12993-024-00262-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12993-024-00262-x

Keywords