R+ Dog Training
Home About Lexicon Examples Guide Myths Punishment Rescue dogs DAPS Contact

 

Examples -- applying operant training:

 


Training the sit

Training the recall

Positive reinforcement R+

Dog sits (either on cue or in free shaping with no cue)

Trainer then gives the dog a treat

Dog comes (cue or no cue)

Trainer then gives the dog a treat

R+ with event marker

The dog sits (cue or no cue).

The instant the dog's fanny hits the ground,

the trainer clicks. 

Trainer then gives the dog a treat.

Trainer cues the dog to come.

The instant the dog turns toward the trainer,

the trainer clicks.**

When the dog comes in, it gets a treat.

Luring (non-operant)

The trainer pulls out a treat and

holds it high above the dog's head.

In trying to reach the treat, the dog rocks back and sits.

The dog then gets to eat the treat

The dog is running away or chasing a cat.

The trainer gets out a treat and yells
"Lassie, Cookie!!"

Dog comes in and gets the treat (you hope).

Negative punishment P-

Trainer cues the dog to sit, but the dog stays standing.

Trainer leaves the room with treats in hand. 

Dog loses the opportunity to be rewarded.

Trainer cues the dog to come.  Dog still chases cat.

Trainer catches dog and removes the dog from the presence of the cat and removes its ability to run and play.

Negative reinforcement* R-/P+

Trainer commands the dog to sit.

If the dog does not quickly comply, the trainer jerks sharply and upward on the leash.  The choke or pinch collar tightens and the dog feels uncomfortable. The force of the jerk upward makes sitting an easy choice for the dog. 

The pressure is then relieved. 

Dog is running away or chasing a cat.

Trainer commands the dog to come. 

If the dog does not comply the trainer closes

the button on the flexilead or steps on the long line.

The dog experiences a sharp jerk and turns toward the handler, relieving the pressure from the collar and leash.

The handler then praises the dog as it comes in.**

Physical "modelling"

(non-operant)

Trainer pushes dog's behind downward.

Under this force, the dog sits.

Trainer rewards or praises dog.

Dog is out running after a cat.

Trainer reels the dog in with the long line,

saying "come" as she does so, then rewards or praises.

 

*In practice there is no distinction between negative reinforcement and positive punishment, which is not the case for R+ and P-.  In  another example, the dog is pulling on the leash.  As the dog pulls, the trainer pops the leash sharply and repeatedly for as long as the dog is pulling, causing the dog discomfort while it is pulling.  As soon as the dog stops pulling and thus starts to walk on a loose leash, the trainer stops the jerking.  Positive punishment happened when the discomfort of the jerking and tightening the collar suppressed the pulling.  Negative reinforcement happened when taking away the jerking strengthened the behavior of walking on a loose leash. 

 

** In each of these examples, there is a secondary or conditioned reinforcer.  In the R+ example, the clicker (or other event marker)  is a conditioned reinforcer meaning that food or play is coming.  In the R- example, the praise is a conditioned reinforcer meaning that no discomfort will be applied while the praise is occurring.

 

Other notes:

1.  The word "positive" is unfortunately used in two completely distinct senses in animal training.  The first use comes directly from plain English usage, that is, "positive" means something good or desirable.  The second use is a formal, strict definition in operant behavior analysis and means something "added to".  If that something added is meant to suppress a behavior, as in positive punishment, then it is anything but good or desirable

.

2.  "Positive" dog training can therefore mean a variety of things.  Just because someone says they are "positive" does not mean that aversives are not also used.  In fact, "positive" traditional training is a particular mix of the use of "rewards" or things that the dog desires as lures or mood changers (and not as operant conditioned reinforcers) and corrections.  Traditional dog training does use operant conditioning, but specifically that using aversives or corrections, such as leash pops, verbal reprimand, intimidation, or electronic stimuli.  In other words, traditional dog training is a mixture of operant and non-operant methods, and of positive and aversive elements.

 

If food or play is used in a non-operant fashion, i.e., as a lure, it is extremely difficult to fade out.  For this reason, the mix of reward and punishment is ensured, because the operant conditioning methods of R-/P+ are required to strengthen desired behaviors.

 

However, the price that this blend of methods exacts is high.  The dog may perform well and reliably, but the dog's attitude is often not the same as a dog trained with R+/P-.  In fact, many breeds of dogs and many individuals in any breed maybe passed off as untrainable with R-/P+ methods,  written off as "stubborn", "stupid", "willful", "has own agenda", "soft", and so on. 

 

Karen Pryor has written eloquently on this topic.  Her article, Poisoning the Cue [Teaching Dogs, 2002, vol.1(1)], is the only place I know of where the concepts of mixing positive and negative "discriminative stimuli" are explained. 

 

© Catherine Toft 2007