580 likes | 996 Views
PSYCHOLOGY Unit 3: Learning “Operant Conditioning”. What is Learning?. Most learning is... A ssociative L earning : R ealization that certain events occur together . Learning itself refers to a relatively durable change in behavior or knowledge that is due to experience .
E N D
PSYCHOLOGY Unit 3: Learning“Operant Conditioning”
What is Learning? Most learning is... Associative Learning: Realization that certain events occur together. Learning itself refers to a relatively durable change in behavior or knowledgethat is due to experience. • Classical Conditioning • Operant Conditioning • Observational Learning (Latent, Abstract, Insight)
Behaviorism Everything you know, everything you are is the result of human behavior. In other words, psychology is the study of behavior, not of the mind! Picked up steam in the late 1960s and during the 1970s. A reaction to the non-scientific work of Freud.
Classical vs. Operant They both use acquisition, discrimination, SR, generalization and extinction. Classical Conditioning:automatic(respondent behavior). Ex.) Your dog gets sick and requires several painful trips to the vet. Now he hides every time he hears you rattle your keys. Automatic. Operant Conditioning: behavior where one can influence their environment with behaviors which have consequences(operant behavior). Ex.) Teacher comments on test.
Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment in rats: • trial and error learning • allows acquisition of motor programs that arenot instinctive • behavior shaped by rewards • develops as a result of the association of reinforcement with a particular response • on a proportion of occasions Trial & Error---------------->Trial & Reward---------->Operant Conditioning Operant Response- Reinforcement - Learned Behavior
Edward Thorndike Law of Effect: rewarded behavior is likely to be repeated. Studied at cats inside a ‘puzzle box’ - found that a well-practiced cat will find the way out. If an action brings an reward, Thorndike believed that that action becomes stamped into the mind. Behavior changes because of it’s consequences. Previous theories had emphasized practice or repetition. Thorndike gave equal consideration to the effects of reward or punishment, success or failure, & satisfaction or annoyance on the learner.
B.F. Skinner Instead of antecedents of behavior (what comes before) a new focus on consequences of behavior. BF Skinner argued that, CC did not explain complex behavior. 2 categories of consequences: Reinforcement & Punishment Reinforcement is designed to increase the probability that a behavior will occur again. Punishment is designed to decrease the probability that a behavior will occur again.
Shaping A procedure in Operant Conditioning -reinforces & guidesbehavior closer and closer towards a goal. Reinforcers guide behavior, step-by-step. Closer and closer to the target behavior through successive approximations. “Baby Steps” Reinforcers Any event that STRENGTHENSthe behavior it follows. There are + and – reinforcers. + Positive Reinforcers: Strengthens a response by presenting a stimulus after a response. - Negative Reinforcers: Strengthens a response by reducing or removing an aversive stimulus.
Positive Reinforcement Strengthens a response by presentinga stimulus after a response. $$$ Getting Paid! We may continue to go to work each day because we receive a paycheckon a weekly or monthly basis. ***AWARDS*** If we receive awardsfor writing short stories, we may be more likely to increase the frequency of writing short stories. "PRAISE!" Receiving praisefor our karaoke performances can increase how often we sing.
Negative Reinforcement Strengthensa response by reducing or removing an aversive stimulus. Driving in heavy trafficis a negative condition for most of us. You leave home earlier than usual one morning, and don't run into heavy traffic. You leave home earlier again the next morning and again you avoid heavy traffic. Your behavior of leaving home earlier is strengthened by the consequence of the avoidance of heavy traffic. The concept of Negative Reinforcement is difficult to learn because of the word negative. Negative Reinforcement is often confused with Punishment. They are very different, however. Negative Reinforcementstrengthens a behavior because a negative condition is stopped or avoided as a consequence of the behavior.
Punishment Punishment, on the other hand, weakens a behavior because a negative condition is introduced or experienced as a consequence of the behavior. Punishment is often mistakenly confused with negative reinforcement. Remember, reinforcement always increases the chances that a behavior will occur and Punishment always decreases the chances that a behavior will occur.
Punishment Positive Punishment: This type of punishment is also known as "punishment by application." Positive punishment involves presenting an aversive stimulus after a behavior as occurred. For example, when a student talks out of turn in the middle of class, the teacher might scold the child for interrupting her.
Punishment Negative Punishment: This type of punishment is also known as "punishment by removal." Negative punishment involves taking away a desirable stimulusaftera behavior as occurred. For example, when the student from the previous example talks out of turn again, the teacher promptly tells the child that he will have to miss recess because of his behavior.
Punishment also has some notable drawbacks. First, any behavior changes that result from punishment are often temporary. "Punished behavior is likely to reappear after the punitive consequences are withdrawn," Skinner explained in his book About Behaviorism. Perhaps the greatest drawback is the fact that punishment does not actually offer any information about more appropriate or desired behaviors. While subjects might be learning to not perform certain actions, they are not really learning anything about what they should be doing. Another thing to consider about punishment is that it can have unintended and undesirable consequences. For example, while approximately 75 percent of parents in the United States report spanking their children on occasion, researchers have found that this type of physical punishment can lead to antisocial behavior, aggressiveness and delinquency among children. For this reason, Skinner and other psychologists suggest that any potential short-term gains from using punishment as a behavior modification tool need to be weighed again the potential long-term consequences.
YOUTUBE VIDEO: An inspirational fan video from the movie Office Space. How can we all seize the day? Can we use psychology to improve our lives?
Positive reinforcement- when something is given (apply anaversive stimulus). Negative reinforcement - when something is removed (remove an aversive stimulus). Skinner - punishment should be judicious, immediate, consistent, & severe enough actually to be a punishment.
YouTube: Schallhorn Operant Conditioning - Reinforcement & Punishment
A lot of students are confused about negative reinforcement. What's the difference between that and punishment? Remember, it's "reinforcement" so the behavior increases, and because it's "negative," the reinforcer is removed after the response.
Positive or Negative Reinforcement? Cleaning the house to get rid of the disgusting mess and/or to stop your mother from nagging
Positive or Negative Reinforcement? Cleaning the house to get rid of the disgusting mess and/or to stop your mother from nagging NEGATIVE REINFORCEMENT Strengthens a response by reducing or removing an aversive stimulus. Nagging/Mess as negative reinforcer to cleaning.
Positive or Negative Reinforcement? Taking aspirin to relieve a headache
Positive or Negative Reinforcement? Taking aspirin to relieve a headache NEGATIVE REINFORCEMENT Strengthens a response by reducing or removing an aversive stimulus. headache as negative reinforcer to taking medication.
Positive or Negative Reinforcement? Listening to your favorite music after studying for an hour
Positive or Negative Reinforcement? Listening to your favorite music after studying for an hour POSITIVE REINFORCEMENT: Strengthens a response by presentinga stimulus after a response.
Positive or Negative Reinforcement? Leaving the movie theater if the movie is bad
Positive or Negative Reinforcement? -- Leaving the movie theater if the movie is bad Negative Reinforcement strengthens a behavior because a negative condition is stopped or avoided as a consequence of the behavior.
Positive or Negative Reinforcement? Giving in to an argument or to a child or dog’s begging
Positive or Negative Reinforcement? Giving in to an argument or to a child or dog’s begging Negative Reinforcement strengthens a behavior because a negative condition is stopped or avoided as a consequence of the behavior. Negative reinforcement is NOT the same as punishment! Negative reinforcers, like all reinforcers, increase the frequency of the responses that they follow. (Punishment, in contrast, decreases the frequency of responses.)
YouTube: Psych 101 - Operant Conditioning 'Schedules of Reinforcement'
Fixed-ratio Schedules A schedule that reinforces a response only after a specified number of responses. Examples in natural environments: Jobs that pay based on units delivered. Employees often find this schedule undesirable because it produces a rate of response that leaves them nervous and exhausted at the end of the day. They may feel pressured not to slow down or take rest breaks, since they feel that such will costs them money. This is an example of how a schedule can produce a high rate of response even though the response rate is aversive to the subject. Examples in video games: Collecting tokens. Many games require the player to collect a fixed number of tokens to advance to the next level, obtain a new life point, or receive some other reinforcers. Attaining a new level in an RPG. Some RPG's clearly indicate how much experience is required to achieve the next level. A high degree of certainty as to the level of work that will be required to achieve the next level puts the player on a fixed ratio schedule.
Variable-ratio Schedule A schedule of reinforcement that reinforces a response after an unpredictable number of responses. Slot machines: Slot machines are programmed on VR schedule. The gambler has no way of predicting how many times he must put a coin in the slot and pull the lever to hit a payoff but the more times a coin is inserted the greater the chance of a payout. People who play slot machines are often reluctant to leave them, especially when they have had a large number of un-reinforced responses. They are concerned that someone else will win the moment they leave. Playing golf: It only takes a few good shots to encourage the player to keep playing or play again. The player is uncertain how good each shot will be, but the more often they play, the more likely they are to get a good shot. Door to door salesmen: It is uncertain how many houses they will have to visit to make a sale, but the more houses they try, the more likely that they will succeed.
Fixed-interval Schedule A schedule of reinforcement that reinforces a response only after a specified time has elapsed. An example might be getting a raise every year and not in between. A major issue with this schedule is that people tend to improve their performance right before the time period expires so as to "look good" when the review comes around. Example: I give Bart a Butterfinger every ten minutes after he moons someone. "HAHA!" In the Real World: A weekly paycheck is a good example of a fixed-interval schedule. The employee receives reinforcement every seven days, which may result in a higher response rate as payday approaches.
Variable-interval Schedule A schedule of reinforcement that reinforces a response at unpredictabletimeintervals. Reinforcing someone after a variable amount of time is the final schedule. If you have a boss who checks your work periodically, you understand the power of this schedule. Because you don’t know when the next ‘check-up’ might come, you have to be working hard at all times in order to be ready. In this sense, the variable schedules are more powerful and result in more consistent behaviors. This may not be as true for punishment since consistency in the application is so important, but for all other types of reinforcement they tend to result in stronger responses.
Punishment An event that DECREASES the behavior that it follows. Does punishment work?
Tardies & D-HALLS The Breakfast Club was released in 1985. Saturday, March 24, 1984. Shermer High School, Shermer, Illinois. 60062. Dear Mr. Vernon, We accept the fact that we had to sacrifice a whole Saturday in detention for whatever it was that we did wrong… what we did was wrong, but we think you’re crazy to make us write this essay telling you who we think we are. What do you care? You see us as you want to see us… in the simplest terms & the most convenient definitions. You see us as a brain, an athlete, a basket case, a princess & a criminal. Correct? That’s the way we saw each other at seven o’clock this morning. We were brainwashed.