11/05/2025
More education at work, this time how all 4 quadrants are used in the example of herding! Balanced training utilizes all 4 quadrants to help dogs shift their behavior in a way they easily understand, because itās how they (and all animals) naturally operate. Great read from a different perspective!
The Quadrants of Learning (and How They Show Up in Herding)
You can read more in the brand new second edition of Urban Sheepdog. Order here: https://amzn.to/3Kx3yK0
Few things in dog training get mixed up as much as the four quadrants of learning, but they arenāt opinions or methods. Theyāre just a way to describe what happens after a behaviour, and whether that behaviour becomes more or less likely next time.
The other day, there was a post in a herding group with so many comments trying to unpack how the quadrants apply in herding. Some suggested it was all positive reinforcement, some said it wasnāt. Some thought herding is void of the quadrants.
Letās unpack what is actually taking place!
ā¢Positive reinforcement: You add something the dog wants, and the behaviour increases. Example: You give your dog a treat for sitting, and they sit more often.
ā¢Negative reinforcement: You remove something the dog doesnāt want, and the behaviour increases. Example: You loosen leash pressure when your dog stops pulling, so they learn that staying close makes the discomfort go away.
ā¢Positive punishment: You add something the dog doesnāt want, and the behaviour decreases. Example: You say āhey!ā sharply when they jump up, and they stop jumping as much.
ā¢Negative punishment: You remove something the dog wants, and the behaviour decreases. Example: You stop the game when they bite too hard, so they learn that rough play makes the fun end.
Thatās all it is.
āPositiveā and ānegativeā mean add or remove, like math. Itās not āpositive is goodā and ānegative is bad.ā āReinforcementā means the behaviour goes up. āPunishmentā means it goes down.
Now, picture a herding dog on stock. The learning theory is happening constantly:
When a handler steps in toward the dog, swings a stick, or uses a sharp tone, thatās positive punishment: something unpleasant is added to make the current behaviour (like diving in too close or gripping) less likely.
When the dog changes their behaviour and backs off, gives space, finds balance, and the "pressure" or correction stops, thatās negative reinforcement: the removal of something the dog finds aversive makes that better behaviour more likely next time.
When a dog works well and the handler lets them keep working or praises quietly, thatās positive reinforcement. The praise is added, and we're using the sheep as the reinforcer: something the dog wants is added, and the work continues because the dogās choices keep paying off.
When a dog loses the chance to work because they ignored cues or got too wound up, thatās negative punishment: the thing they wanted most (the sheep) disappears, so that behaviour is less likely.
The Sheep Are Learning Too!
Learning theory doesnāt just apply to the dog. The sheep are also responding to consequences in real time. Every movement from the dog or handler changes what they feel, want, or avoid, and that shapes their behaviour too.
Negative reinforcement: When they move away from the dog, and the dog eases up on its intensity, the "pressure" from the dog decreases. The removal of that discomfort (the dogās eye, movement, or proximity) makes them more likely to respond in the same way next time.
Positive punishment: If they challenge the dog or refuse to move, and the dog rushes in, grips, or blocks hard, something unpleasant is added. That makes the bold behaviour less likely.
Negative punishment: If a sheep drifts too far from the group and loses the safety of the flock, the loss itself is punishing, and theyāre more likely to stay closer next time.
So while the dog is learning how to influence the sheep, the sheep are learning how to respond to the dog.
The whole system is built on feedback loops of what they call āpressure and releaseā, which is really just the quadrants!
It's happening to the humans, too. We buy a bunch of sheep, realize how expensive hay is, lose money and stop buying sheep, that's negative punishment (the loss of something good ($), which decreases my behaviour in the future!