Reinforcement is any consequence that increases the likelihood of a behavior happening again. It’s one of the most fundamental concepts in psychology, rooted in a simple idea: behaviors followed by favorable outcomes tend to be repeated, while behaviors that produce no meaningful outcome tend to fade. The concept was formalized by B.F. Skinner as part of operant conditioning, but the underlying principle shapes everything from how children learn to share to why you check your phone so often.
How Reinforcement Works
The basic mechanism is straightforward. A person or animal does something, something happens afterward, and that consequence either makes the behavior more or less likely in the future. Reinforcement specifically refers to consequences that make behavior more likely. Punishment is the opposite, aimed at decreasing behavior.
This idea has roots going back to 1905, when psychologist Edward Thorndike proposed his “law of effect”: the probability that a stimulus will repeatedly trigger a learned response depends on the perceived consequences of that response. Skinner built on this foundation and formalized it into operant conditioning, which he saw as the explanatory basis of much of human behavior. The key insight is that organisms don’t just respond to their environment passively. They operate on it, and the results of those operations shape what they do next.
Positive vs. Negative Reinforcement
The words “positive” and “negative” here don’t mean “good” and “bad.” They mean “adding” and “removing.” Both types increase the target behavior.
Positive reinforcement means adding something desirable after a behavior. A child gets a sticker for telling the truth. A dog gets a treat for coming when called. An employee receives a raise after strong performance reviews. In each case, something pleasant is introduced, making the behavior more likely to happen again.
Negative reinforcement means removing something unpleasant after a behavior. You buckle your seat belt to stop the car’s beeping sound. You put on sunscreen to avoid a sunburn. You leave for work early to dodge traffic. The behavior increases not because something good was added, but because something annoying or uncomfortable was taken away.
People often confuse negative reinforcement with punishment, but they produce opposite effects. Negative reinforcement still increases behavior. Punishment, whether it involves adding something unpleasant (a reprimand) or removing something desirable (taking away screen time), aims to decrease behavior.
Primary and Secondary Reinforcers
Not all reinforcers are created equal. Primary reinforcers have innate biological value. You don’t need to learn that food, water, sleep, shelter, sex, and physical touch are rewarding. They satisfy survival needs directly.
Secondary reinforcers, on the other hand, have no inherent value. They only become reinforcing through their association with primary reinforcers. Money is the clearest example: it’s just paper or numbers on a screen, but it works as a powerful reinforcer because you can exchange it for food, shelter, and other things you need. Praise functions as a secondary reinforcer because it’s linked to social connection and affection. Stickers on a behavior chart, tokens in a reward system, grades in school: all secondary reinforcers that gain their power through learned associations.
Schedules of Reinforcement
Reinforcement doesn’t have to happen every single time a behavior occurs, and the pattern of when it’s delivered dramatically affects how the behavior looks and how resistant it becomes to fading. Skinner identified four primary schedules.
- Fixed-ratio: Reinforcement comes after a set number of responses. A factory worker paid per 50 units produced, or a coffee shop punch card that earns a free drink after 10 purchases. These schedules generate high response rates, sometimes exceeding 10 responses per second in laboratory settings. However, as the required number of responses increases, people and animals tend to pause longer after each reinforcement before starting again.
- Variable-ratio: Reinforcement comes after a random number of responses, averaging around a set value. Slot machines are the classic example. Because you never know which pull will pay off, this schedule produces fast, steady responding and is the hardest to extinguish. It’s also why social media feeds are so compelling: the next scroll might deliver something interesting, or it might not.
- Fixed-interval: Reinforcement is available for the first response after a set time period. Checking the oven when you know the timer will go off in 20 minutes, or studying harder as an exam approaches. People tend to respond slowly right after reinforcement and accelerate as the next interval ends.
- Variable-interval: Reinforcement is available after random time periods averaging around a set value. Checking your email is a good example: new messages arrive at unpredictable times. This schedule produces slow, steady responding that can remain remarkably consistent for hundreds of hours.
The schedule matters enormously for real-world applications. Behaviors reinforced on variable schedules are far more persistent than those reinforced every time, which is why intermittent rewards (from gambling to inconsistent parenting) can be so powerfully habit-forming.
What Happens in the Brain
Reinforcement isn’t just a behavioral concept. It has a biological basis centered on dopamine, a chemical messenger in the brain. A pathway running from deep in the brainstem to a region called the nucleus accumbens plays a central role in motivation-related functions: activating behavior, sustaining effort, driving approach toward goals, and supporting learning from consequences.
Interestingly, this system is more nuanced than the popular “reward chemical” label suggests. Dopamine in this pathway doesn’t simply produce pleasure. It’s involved in behavioral activation and the willingness to exert effort, which means it helps determine not just whether something feels good, but whether you’ll work to get it. This distinction matters because it explains why reinforcement isn’t just about enjoyment. It’s about the motivational push that keeps behavior going.
Extinction: When Reinforcement Stops
If a behavior stops being reinforced, it will generally decrease over time and may eventually disappear. This process is called extinction, and it’s considered just as important as reinforcement in helping organisms adapt to changing conditions.
Extinction isn’t always clean or permanent, though. A behavior that has been reduced through extinction will often show spontaneous recovery, meaning it reappears at the start of a new session or day even without being reinforced again. Behaviors can also recover when the context changes: a habit extinguished in one environment may return in the original setting where it was first learned, or even in a completely novel one. These effects explain why breaking habits can feel like a process of repeated effort rather than a single decision.
Reinforcement in Everyday Life
Reinforcement principles are applied deliberately in several fields. Applied behavior analysis (ABA), widely used with children on the autism spectrum, is built on the idea that reinforcing specific behaviors increases them while withholding reinforcement for other behaviors causes them to fade. Effective ABA programs use natural forms of reinforcement tied to the activity itself. If a child is working with building blocks, the reward might be getting to knock the tower down, rather than receiving an unrelated treat.
In classrooms, research on behavior management suggests teachers should aim for at least four positive statements for every reprimand. This ratio keeps the learning environment focused on reinforcing desired behavior rather than simply punishing unwanted behavior. Token economies, where students earn tokens exchangeable for privileges or prizes, apply secondary reinforcement in a structured way.
When Reinforcement Can Backfire
External reinforcement doesn’t always produce the intended long-term effect. A well-established body of research shows that virtually every type of expected tangible reward made contingent on task performance undermines intrinsic motivation, the internal drive to do something because it’s inherently interesting or satisfying. When people receive external rewards for activities they already enjoy, they can shift from feeling self-directed to feeling controlled, and their interest in the activity drops once the rewards stop.
This effect extends beyond tangible rewards. Threats, deadlines, rigid directives, and competitive pressure can also diminish intrinsic motivation for the same reason: they make people feel their behavior is being controlled from outside rather than chosen freely. The key factor seems to be autonomy. Reinforcement that supports a person’s sense of competence and self-direction can actually enhance intrinsic motivation. But reinforcement that feels coercive or controlling tends to erode it. This is why the best applications of reinforcement in education and parenting tend to emphasize choice, natural consequences, and feedback that builds a sense of mastery rather than dependence on external rewards.