関: operant conditioning

WordNet

equip with instruments for measuring, recording, or controlling
the means whereby some act is accomplished; "my greed was the instrument of my destruction"; "science has given us new tools to fight disease" (同)tool
a device that requires skill for proper use
a person used by another to gain an end (同)pawn, cat''s-paw
write an instrumental score for (同)instrumentate
address a legal document to
relating to or designed for or performed on musical instruments; "instrumental compositions"; "an instrumental ensemble"
the cognitive process of acquiring skill or knowledge; "the childs acquisition of language" (同)acquisition

PrepTutorEJDIC

(主として精密な)『器具』,器械,計器・(また『musical instrument』)『楽器』) / 手段,方便（means）;(人の)手先 / 法律文書(証書・遺書・協定書・約束手形など)
役立つ;(…をするのに)役立つ《+『in』 do『ing』》 / 楽器の,器楽の
(…を)学ぶこと,(…の)学習《+『of』+『名』》 / 『学問』,学識,博識

Wikipedia preview

出典(authority):フリー百科事典『ウィキペディア（Wikipedia）』「2015/06/11 01:14:41」(JST)

wiki en

[Wiki en表示]

Diagram of operant conditioning

Operant conditioning separates itself from classical conditioning because it is highly complex, integrating positive and negative conditioning into its practices; whereas, classical conditioning focuses only on either positive or negative conditioning but not both together. Another dubbing of operant conditioning is instrumental learning. Instrumental conditioning was first discovered and published by Jerzy Konorski and was also referred to as Type II reflexes. Mechanisms of instrumental conditioning suggest that the behavior may change in form, frequency, or strength. The expressions “operant behavior” and “respondent behavior" were popularized by B.F. Skinner who worked on reproduction of Konorski’s experiments. Operant behavior means that “a response is followed by a reinforcing stimulus”.

Operant behavior operates on the environment and is maintained by its antecedents and consequences, while classical conditioning is maintained by conditioning of reflexive (reflex) behaviors, which are elicited by antecedent conditions. Behaviors conditioned through a classical conditioning procedure are not maintained by consequences.^[1] They both, however, form the core of behavior analysis and have grown into professional practices. Operant conditions are simple to understand, after trial and error Learning is achieved. A reward for overcoming an obstacle can give the inner motivation needed to continue with success.

1 Historical notes
- 1.1 Thorndike's law of effect
- 1.2 Skinner
2 Tools and procedures
- 2.1 To shape behavior: antecedents and consequences
  - 2.1.1 Some other common terms and procedures
- 2.2 Operant conditioning to change human behavior
3 Factors that alter the effectiveness of consequences
4 Operant variability
5 Avoidance learning
- 5.1 Discriminated avoidance learning
- 5.2 Free-operant avoidance learning
- 5.3 Two-process theory of avoidance
- 5.4 One-factor Theory
6 Four term contingency
7 Operant hoarding
8 Biological correlates of operant conditioning
9 Operant conditioning in economics
10 Questions about the law of effect
11 See also
12 References
13 External links

Historical notes

Thorndike's law of effect

Operant conditioning, sometimes called instrumental learning, was first extensively studied by Edward L. Thorndike (1874–1949), who observed the behavior of cats trying to escape from home-made puzzle boxes.^[2] When first constrained in the boxes, the cats took a long time to escape. With experience, ineffective responses occurred less frequently and successful responses occurred more frequently, enabling the cats to escape in less time over successive trials. In his law of effect, Thorndike theorized that behaviors followed by satisfying consequences tend to be repeated and those that produce unpleasant consequences are less likely to be repeated. In short, some consequences strengthened behavior and some consequences weakened behavior. Thorndike produced the first known animal learning curves through this procedure.^[3]

Most of the activities humans learn to do they learn through operant conditioning. Often the conditioning is not strictly laid out in order to teach one, instead the conditioning just falls into place. Operant conditioning is a natural parenting technique that has occurred for thousands of years;^[4] however, the psychological phenomenon of operant conditioning was not specifically noted until Thorndike’s studies in the early 20th century.

Skinner

B.F. Skinner (1904–1990) often referred to as the father of operant conditioning. His work is most often cited in connection with this topic. His book "The Behavior of Organisms",^[5] published in 1938, initiated his lifelong study of operant conditioning and its application to human and animal behavior. Following the ideas of Ernst Mach, Skinner rejected Thorndike's reference to unobservable mental states such as satisfaction, building his analysis on observable behavior and its equally observable consequences.^[6]

To implement his empirical approach, Skinner invented the operant conditioning chamber in which subjects such as pigeons and rats were isolated from extraneous stimuli and free to make one or two simple, repeatable responses.^[7] This was similar to Thorndike’s puzzle box and became known as the Skinner box. Another invention, the cumulative recorder, produced a graphical record of these responses from which response rates could be estimated. These records were the primary data that Skinner and his colleagues used to explore the effects on response rate of various reinforcement schedules.^[8] A reinforcement schedule may be defined as "any procedure that delivers reinforcement to an organism according to some well-defined rule".^[9] Reinforcement is known as “behavior which is reinforced tends to be repeated (i.e. strengthened); behavior which is not reinforced tends to die out-or be extinguished (i.e. weakened).” The effects of schedules became, in turn, the basic experimental data from which Skinner developed his account of operant conditioning. He also drew on many less formal observations of human and animal behavior.^[10]

Many of Skinner's writings are devoted to the application of operant conditioning to human behavior.^[11] In 1957, Skinner published Verbal Behavior,^[12] which extended the principles of operant conditioning to language, a form of human behavior that had previously been analyzed quite differently by linguists and others. Skinner defined new functional relationships such as "mands" and "tacts" to capture the essentials of language, but he introduced no new principles, treating verbal behavior like any other behavior controlled by its consequences, which included the reactions of the speaker's audience.

Tools and procedures

To shape behavior: antecedents and consequences

Antecedents as well as the following consequences: reinforcement and punishment are the core tools of operant conditioning. It is important to realize that some terminology in operant conditioning is used in a way that is different from everyday use.

"Antecedent stimuli" occurs before a behavior happens.

"Reinforcement" and "punishment" refer to their effect on the desired behavior.

Reinforcement increases the probability of a behavior being expressed.
Punishment reduces the probability of a behavior being expressed

"Positive" and "negative" refer to the presence or absence of the stimulus.

Positive is the addition of a stimulus
Negative is the removal or absence of a stimulus (often adverse)

There is an additional procedure

Extinction is caused by the lack of any consequence following a behavior. When a behavior is inconsequential (i.e., producing neither favorable nor unfavorable consequences) it will occur less frequently. When a previously reinforced behavior is no longer reinforced with either positive or negative reinforcement, it leads to a decline (extinction) in that behavior.

This creates a total of five basic consequences -

Positive reinforcement (reinforcement): Occurs when a behavior (response) is followed by a stimulus that is appetitive or rewarding, increasing the frequency of that behavior. In the Skinner box experiment, a stimulus such as food or a sugar solution can be delivered when the rat engages in a target behavior, such as pressing a lever. This procedure is usually called simply reinforcement.
Negative reinforcement (escape): Occurs when a behavior (response) is followed by the removal of an aversive stimulus, thereby increasing that behavior's frequency. In the Skinner box experiment, negative reinforcement can be a loud noise continuously sounding inside the rat's cage until it engages in the target behavior, such as pressing a lever, upon which the loud noise is removed.
Positive punishment (punishment) (also called "Punishment by contingent stimulation"): Occurs when a behavior (response) is followed by a stimulus, such as introducing a shock or loud noise, resulting in a decrease in that behavior. Positive punishment is sometimes a confusing term, as it denotes the "addition" of a stimulus or increase in the intensity of a stimulus that is aversive (such as spanking or an electric shock). This procedure is usually called simply punishment.
Negative punishment (penalty) (also called "Punishment by contingent withdrawal"): Occurs when a behavior (response) is followed by the removal of a stimulus, such as taking away a child's toy following an undesired behavior, resulting in a decrease in that behavior.
Extinction: Occurs when a behavior (response) that had previously been reinforced is no longer effective. For example, a rat is first given food many times for lever presses. Then, in "extinction", no food is given. Typically the rat continues to press more and more slowly and eventually stops, at which time lever pressing is said to be "extinguished."

It is important to note that actors are not spoken of as being reinforced, punished, or extinguished; it is the actions that are reinforced, punished, or extinguished. Additionally, reinforcement, punishment, and extinction are not terms whose use is restricted to the laboratory. Naturally occurring consequences can also be said to reinforce, punish, or extinguish behavior and are not always delivered by people.

Some other common terms and procedures

Escape and avoidance In escape learning, a behavior terminates an (aversive) stimulus. For example, shielding one's eyes from sunlight terminates the (aversive) stimulation of bright light in one's eyes. In avoidance learning, the behavior precedes and prevents an (aversive) stimulus, for example putting on sun glasses before going outdoors. Because, in avoidance, the stimulation does not occur, avoidance behavior seems to have no means of reinforcement. Indeed this non-occurrence of the stimulus has been a problem for reinforcement theory, which has been dealt with in various ways. See section on avoidance learning below.
Noncontingent reinforcement refers to delivery of reinforcing stimuli regardless of the organism's behavior. Noncontingent reinforcement may be used in an attempt to reduce an undesired target behavior by reinforcing multiple alternative responses while extinguishing the target response.^[13] As no measured behavior is identified as being strengthened, there is controversy surrounding the use of the term noncontingent "reinforcement".^[14]

Schedules of reinforcement Schedules of reinforcement are rules that control the delivery of reinforcement. The rules specify either the time that reinforcement is to be made available, or the number of responses to be made, or both.
- Fixed interval schedule: Reinforcement occurs following the first response after a fixed time has elapsed after the previous reinforcement.
- Variable interval schedule: Reinforcement occurs following the first response after a variable time has elapsed from the previous reinforcement.
- Fixed ratio schedule: Reinforcement occurs after a fixed number of responses have been emitted since the previous reinforcement.
- Variable ratio schedule: Reinforcement occurs after a variable number of responses have been emitted since the previous reinforcement.
- Continuous reinforcement: Reinforcement occurs after each response.^[15]

Discrimination, generalization & context. Most behavior is under stimulus control. Several aspects of this may be distinguished:
- "Discrimination" typically occurs when a response is reinforced only in the presence of a specific stimulus. For example, a pigeon might be fed for pecking at a red light and not at a green light; in consequence, it pecks at red and stops pecking at green. Many complex combinations of stimuli and other conditions have been studied; for example an organism might be reinforced on an interval schedule in the presence of one stimulus and on a ratio schedule in the presence of another.
- "Generalization" is the tendency to respond to stimuli that are similar to a previously trained discriminative stimulus. For example, having been trained to peck at "red" a pigeon might also peck at "pink", though usually less strongly.
- "Context" refers to stimuli that are continuously present in a situation, like the walls, tables, chairs, etc. in a room, or the interior of an operant conditioning chamber. Context stimuli may come to control behavior as do discriminative stimuli, though usually more weakly. Behaviors learned in one context may be absent, or altered, in another. This may cause difficulties for behavioral therapy, because behaviors learned in the therapeutic setting may fail to occur elsewhere.

Operant conditioning to change human behavior

Researchers have found the following protocol to be effective when they use the tools of operant conditioning to modify human behavior:^{[citation needed]}

State goal (aims for the study) That is, clarify exactly what changes are to be brought about. For example, "reduce weight by 30 pounds."
Monitor behavior (log conditions) Keep track of behavior so that one can see whether the desired effects are occurring. For example, keep a chart of daily weights.
Reinforce desired behavior (give reward for proper behavior) For example, congratulate the individual on weight losses. With humans, a record of behavior may serve as a reinforcement. For example, when a participant sees a pattern of weight loss, this may reinforce continuance in a behavioral weight-loss program. A more general plan is the token economy, an exchange system in which tokens are given as rewards for desired behaviors. Tokens may later be exchanged for a desired prize or rewards such as power, prestige, goods or services.
Reduce incentives to perform undesirable behavior For example, remove candy and fatty snacks from kitchen shelves.

Factors that alter the effectiveness of consequences

When using consequences to modify a response, the effectiveness of a consequence can be increased or decreased by various factors. These factors can apply to either reinforcing or punishing consequences.

Satiation/Deprivation: The effectiveness of a consequence will be reduced if the individual's "appetite" for that source of stimulation has been satisfied. The opposite effect will occur if the individual becomes deprived of that stimulus: the effectiveness of a consequence will then increase. If someone is not hungry, food will not be an effective reinforcer for behavior. Satiation is generally only a potential problem with primary reinforcers, those that do not need to be learned such as food and water.^[16]
Immediacy: After a response, how immediately a consequence is then felt determines the effectiveness of the consequence. More immediate feedback will be more effective than less immediate feedback. If someone's license plate is caught by a traffic camera for speeding and they receive a speeding ticket in the mail a week later, this consequence will not be very effective against speeding. But if someone is speeding and is caught in the act by an officer who pulls them over, then their speeding behavior is more likely to be affected.^[17]
Contingency: If a consequence does not contingently (reliably, or consistently) follow the target response, its effectiveness upon the response is reduced. But if a consequence follows the response consistently after successive instances, its ability to modify the response is increased. The schedule of reinforcement, when consistent, leads to faster learning. When the schedule is variable the learning is slower. Extinction is more difficult when learning occurs during intermittent reinforcement and more easily extinguished when learning occurs during a highly consistent schedule.^[16]
Size: This is a "cost-benefit" determinant of whether a consequence will be effective. If the size, or amount, of the consequence is large enough to be worth the effort, the consequence will be more effective upon the behavior. An unusually large lottery jackpot, for example, might be enough to get someone to buy a one-dollar lottery ticket (or even buying multiple tickets). But if a lottery jackpot is small, the same person might not feel it to be worth the effort of driving out and finding a place to buy a ticket. In this example, it's also useful to note that "effort" is a punishing consequence. How these opposing expected consequences (reinforcing and punishing) balance out will determine whether the behavior is performed or not.

The majority of these factors exist because of various biological reasons. The biological purpose of the Principle of Satiation is to maintain the organism's homeostasis (an organism’s ability to maintain a stable internal environment). When an organism has been deprived of sugar, for example, the effectiveness of the taste of sugar as a reinforcer is high. However, as the organism reaches or exceeds their optimum blood-sugar levels, the taste of sugar becomes less effective, perhaps even aversive.

The Principles of Immediacy and Contingency exist for neurochemical reasons. When an organism experiences a reinforcing stimulus, dopamine pathways in the brain are activated. This network of pathways "releases a short pulse of dopamine onto many dendrites, thus broadcasting a rather global reinforcement signal to postsynaptic neurons."^[18] This allows recently activated synapses to increase their sensitivity to efferent (conducted or conducting outward or away from something) signals, thus increasing the probability of occurrence for the recent responses that preceded the reinforcement. These responses are, statistically, the most likely to have been the behavior responsible for successfully achieving reinforcement. But when the application of reinforcement is either less immediate or less contingent (less consistent), the ability of dopamine to act upon the appropriate synapses is reduced.

Operant variability

Operant variability is what allows a response to adapt to new situations. Operant behavior is distinguished from reflexes in that its response topography (the form of the response) is subject to slight variations from one performance to another. These slight variations can include small differences in the specific motions involved, differences in the amount of force applied, and small changes in the timing of the response. If a subject's history of reinforcement is consistent, such variations will remain stable because the same successful variations are more likely to be reinforced than less successful variations. However, behavioral variability can also be altered when subjected to certain controlling variables.^[19]

Avoidance learning

In avoidance learning an organism's behavior is reinforced by the termination or prevention of an (assumed aversive) stimulus. There are two kinds of commonly used experimental settings: discriminated and free-operant avoidance learning.

Avoidance learning is a type of negative reinforcement in which performing a response prevents an aversive stimulus from occurring in the first place.^[20](Mazur, 2013)

Avoidance is a prevention measure while escape is a termination.

Avoidance Paradox is a question of how the behavior of avoidance from an aversive stimulus can also be a reinforcement.

Discriminated avoidance learning

In discriminated avoidance learning, a novel stimulus such as a light or a tone is followed by an aversive stimulus such as a shock (CS-US, similar to classical conditioning). During the first trials (called escape-trials) the animal usually experiences both the CS (Conditioned Stimulus) and the US (Unconditioned Stimulus), showing the operant response to terminate the aversive US. During later trials, the animal will learn to perform the response during the presentation of the CS thus preventing the aversive US from occurring. Such trials are called "avoidance trials."

Free-operant avoidance learning

In this experimental session, no discrete stimulus is used to signal the occurrence of the aversive stimulus. Rather, the aversive stimulus (mostly shocks) are presented without explicit warning stimuli. There are two crucial time intervals determining the rate of avoidance learning. This first one is called the S-S-interval (shock-shock-interval). This is the amount of time which passes during successive presentations of the shock (unless the operant response is performed). The other one is called the R-S-interval (response-shock-interval) which specifies the length of the time interval following an operant response during which no shocks will be delivered. Note that each time the organism performs the operant response, the R-S-interval without shocks begins anew.

Two-process theory of avoidance

This theory was originally proposed in order to explain discriminated avoidance learning, in which an organism learns to avoid an aversive stimulus by escaping from a signal for that stimulus. This theory uses both operant and classical conditioning in order for avoidance responses to occur (escape).

This theory is shown in experiments were subjects were given an unconditioned stimulus followed by a conditioned stimulus and an avoidance task. In the avoidance procedure an avoidance response is reinforced by the termination of the CS. The CS predicts the arrival of an aversive event

Criticisms with this theory: more signs of fear in avoidance tasks and slow extinction of avoidance behavior.

The theory assumes that two processes take place:

a) Classical conditioning of fear.: During the first trials of the training, the organism experiences the pairing of a CS with an aversive US. The theory assumes that during these trials an association develops between the CS and the US through classical conditioning and, because of the aversive nature of the US, the CS comes to elicit a conditioned emotional reaction (CER) – "fear."
b) Reinforcement of the operant response by fear-reduction.: As a result of the first process, the CS now signals fear; this unpleasant emotional reaction serves to motivate operant responses, and those responses that terminate the CS are reinforced by fear termination. Although, after this training, the organism no longer experiences the aversive US, the term "avoidance" may be something of a misnomer, because the theory does not say that the organism "avoids" the US in the sense of anticipating it, but rather that the organism "escapes" an aversive internal state that is caused by the CS.

One-factor Theory

^[21] Only operant conditioning is necessary. States that avoidance can be an reinforcer. A single factor (operant behavior alone) is sufficient for avoidance.

Criticism: There is a reduction in frequency and subject’s sensitivity due to extinction.

This theory examines how no signal for the aversive event, that behavior of avoidance would still occur. (This also explains the slow extinction of avoidance responses).

Experiment by Hernstein and Hineline (1966), is evidence for one-factor theory. When a rat pressed a leaver, the rat switch between schedules of either receiving shock at rapid or slower rate. They found that the rats eventually acquired the avoidance response.

Proving that no external stimulus or passage of time needs to serve as signal for shock.

Four term contingency

Applied behavior analysis, which is the name of the discipline directly descended from Skinner's work, holds that behavior is explained in four terms: conditioned stimulus (S^C), a discriminative stimulus (S^d), a response (R), and a reinforcing stimulus (S^rein or S^r for reinforcers, sometimes S^ave for aversive stimuli).^[22]

Operant hoarding

Operant hoarding is a referring to the choice made by a rat, on a compound schedule called a multiple schedule, that maximizes its rate of reinforcement in an operant conditioning context. More specifically, rats were shown to have allowed food pellets to accumulate in a food tray by continuing to press a lever on a continuous reinforcement schedule instead of retrieving those pellets. Retrieval of the pellets always instituted a one-minute period of extinction during which no additional food pellets were available but those that had been accumulated earlier could be consumed. This finding appears to contradict the usual finding that rats behave impulsively in situations in which there is a choice between a smaller food object right away and a larger food object after some delay. See schedules of reinforcement.^[23]

Biological correlates of operant conditioning

The first scientific studies identifying neurons that responded in ways that suggested they encode for conditioned stimuli came from work by Mahlon deLong^[24]^[25] and by R.T. Richardson.^[25] They showed that nucleus basalis neurons, which release acetylcholine broadly throughout the cerebral cortex, are activated shortly after a conditioned stimulus, or after a primary reward if no conditioned stimulus exists. These neurons are equally active for positive and negative reinforcers, and have been demonstrated to cause plasticity in many cortical regions.^[26] Evidence also exists that dopamine is activated at similar times. There is considerable evidence that dopamine participates in both reinforcement and aversive learning.^[27] Dopamine pathways project much more densely onto frontal cortex regions. Cholinergic projections, in contrast, are dense even in the posterior cortical regions like the primary visual cortex. A study of patients with Parkinson's disease, a condition attributed to the insufficient action of dopamine, further illustrates the role of dopamine in positive reinforcement.^[28] It showed that while off their medication, patients learned more readily with aversive consequences than with positive reinforcement. Patients who were on their medication showed the opposite to be the case, positive reinforcement proving to be the more effective form of learning when the action of dopamine is high.

Operant conditioning in economics

Both psychologists and economists have become interested in applications of operant conditioning concepts and findings to the behavior of humans in the marketplace. One concept that encompasses both of economics and instrumental conditioning is consumer demand. With consumer demand, the focus is on the price of the commodity and the amount purchased. The degree to which price influences consumption is defined as being the elasticity of demand. Certain commodities are more elastic than others. Price change in certain foods can affect the amount bought, while gasoline and essentials seem to be less effected by price changes. For these examples, gasoline and essentials would be less elastic than certain foods like cake and candy. On a graph model representation, something less elastic would not be stretched out as far as a commodity that's consumption fluctuates greatly due to the price.^[29]

Questions about the law of effect

A number of observations seem to show that operant behavior can be established without reinforcement in the sense defined above. Most cited is the phenomenon of autoshaping (sometimes called "sign tracking"), in which a stimulus is repeatedly followed by reinforcement, and in consequence the animal begins to respond to the stimulus. For example, a response key is lighted and then food is presented. When this is repeated a few times a pigeon subject begins to peck the key even though food comes whether the bird pecks or not. Similarly, rats begin to handle small objects, such as a lever, when food is presented nearby.^[30]^[31] Strikingly, pigeons and rats persist in this behavior even when pecking the key or pressing the lever leads to less food (omission training).^[32]^[33]

These observations and others appear to contradict the law of effect, and they have prompted some researchers to propose new conceptualizations of operant reinforcement (e.g.^[34]^[35]^[36] A more general view is that autoshaping is an instance of classical conditioning; the autoshaping procedure has, in fact, become one of the most common ways to measure classical conditioning. In this view, many behaviors can be influenced by both classical contingencies (stimulus-reinforcement) and operant contingencies (response-reinforcement), and the experimenter’s task is to work out how these interact.^[37]

References

^ Domjan, Michael, Ed., The Principles of Learning and Behavior, Fifth Edition, Belmont, CA: Thomson/Wadsworth, 2003
^ Thorndike, E.L. (1901). "Animal intelligence: An experimental study of the associative processes in animals". Psychological Review Monograph Supplement 2: 1–109.
^ Miltenberger, R. G. "Behavioral Modification: Principles and Procedures". Thomson/Wadsworth, 2008. p. 9.
^ Miltenberger, R. G., & Crosland, K. A. (2014). Parenting. The wiley blackwell handbook of operant and classical conditioning. (pp. 509-531) Wiley-Blackwell. doi:10.1002/9781118468135.ch20
^ Skinner, B. F. "The Behavior of Organisms:An Experimental Analysis", 1938 New York: Appleton-Century-Crofts
^ Skinner, B. F. (1950). "Are theories of learning necessary?". Psychological Review 57: 193–216. doi:10.1037/h0054367.
^ Schacter, Daniel L., Daniel T. Gilbert, and Daniel M. Wegner. "B. F. Skinner: The role of reinforcement and Punishment", subsection in: Psychology; Second Edition. New York: Worth, Incorporated, 2011, 278-288.
^ Ferster, C. B. & Skinner, B. F. "Schedules of Reinforcement", 1957 New York: Appleton-Century-Crofts
^ Staddon, J. E. R; D. T Cerutti (February 2003). "Operant Conditioning". Annual Review of Psychology 54 (1): 115–144. doi:10.1146/annurev.psych.54.101601.145124. Retrieved 23 March 2013.
^ Mecca Chiesa (2004) Radical Behaviorism: The philosophy and the science
^ Skinner, B. F. "Science and Human Behavior", 1953. New York: MacMillan
^ Skinner, B. F. "Verbal Behavior", 1957. New York: Appleton-Century-Crofts
^ Tucker, M.; Sigafoos, J.; Bushell, H. (1998). "Use of noncontingent reinforcement in the treatment of challenging behavior". Behavior Modification 22: 529–547. doi:10.1177/01454455980224005.
^ Poling, A.; Normand, M. (1999). "Noncontingent reinforcement: an inappropriate description of time-based schedules that reduce behavior". Journal of Applied Behavior Analysis 32: 237–238. doi:10.1901/jaba.1999.32-237.
^ Schacter et al.2011 Psychology 2nd ed. pg.280-284 Reference for entire section Principles version 130317
^ ^a ^b Miltenberger, R. G. "Behavioral Modification: Principles and Procedures". Thomson/Wadsworth, 2008. p. 84.
^ Miltenberger, R. G. "Behavioral Modification: Principles and Procedures". Thomson/Wadsworth, 2008. p. 86.
^ Schultz, Wolfram (1998). "Predictive Reward Signal of Dopamine Neurons". The Journal of Neurophysiology 80 (1): 1–27.
^ Neuringer, A (2002). "Operant variability: Evidence, functions, and theory". Psychonometric Bulletin & Review 9 (4): 672–705. doi:10.3758/bf03196324.
^ Mazur, J. E. (2013). Learning and behavior. (7th ed). Pearson Education Inc.
^ Herrnstein, R.J.; Hineline, P. N. (1966). "Negative reinforcement as shock-frequency reduction". Journal of the Experimental Analysis of Behavior 9: 421–430. doi:10.1901/jeab.1966.9-421.
^ Pierce & Cheney (2004) Behavior Analysis and Learning
^ Cole, M.R. (1990). "Operant hoarding: A new paradigm for the study of self-control". Journal of the Experimental Analysis of Behavior 53: 247–262. doi:10.1901/jeab.1990.53-247.
^ "Activity of pallidal neurons during movement", M.R. DeLong, J. Neurophysiol., 34:414–27, 1971
^ ^a ^b Richardson RT, DeLong MR (1991): Electrophysiological studies of the function of the nucleus basalis in primates. In Napier TC, Kalivas P, Hamin I (eds), The Basal Forebrain: Anatomy to Function (Advances in Experimental Medicine and Biology, vol. 295. New York, Plenum, pp. 232–252
^ PNAS 93:11219-24 1996, Science 279:1714–8 1998
^ Neuron 63:244–253, 2009, Frontiers in Behavioral Neuroscience, 3: Article 13, 2009
^ Michael J. Frank, Lauren C. Seeberger, and Randall C. O'Reilly (2004) "By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism," Science 4, November 2004
^ Domjan, M. (2009). The Principles of Learning and Behavior. Wadsworth Publishing Company. 6th Edition. pages 244-249.
^ Timberlake, W (1983). "Rats' responses to a moving object related to food or water: A behavior-systems analysis". Animal Learning & Behavior 11 (3): 309–320. doi:10.3758/bf03199781.
^ Neuringer, A.J. (1969). "Animals respond for food in the presence of free food". Science 166: 399–401. doi:10.1126/science.166.3903.399.
^ Williams, D.R.; Williams, H. (1969). "Auto-maintenance in the pigeon: sustained pecking despite contingent non-reinforcement". J. Exper. Analys. of Behav 12: 511–520. doi:10.1901/jeab.1969.12-511.
^ Peden, B.F.; Brown, M.P.; Hearst, E. (1977). "Persistent approaches to a signal for food despite food omission for approaching.". Journal of Experimental Psychology: Animal Behavior Processes 3 (4): 377–399. doi:10.1037/0097-7403.3.4.377.
^ Gardner, R.A.; Gardner, B.T. (1988). "Feedforward vs feedbackward: An ethological alternative to the law of effect". Behavioral and Brain Sciences 11: 429–447. doi:10.1017/s0140525x00058258.
^ Gardner, R. A. & Gardner B.T. (1998) The structure of learning from sign stimuli to sign language. Mahwah NJ: Lawrence Erlbaum Associates.
^ Baum, W. M. (2012). "Rethinking reinforcement: Allocation, induction and contingency". Journal of the Experimental Analysis of Behavior 97: 101–124. doi:10.1901/jeab.2012.97-101.
^ Locurto, C. M., Terrace, H. S., & Gibbon, J. (1981) Autoshaping and conditioning theory. New York: Academic Press.

1. Staddon, J. E. R. & Cerutti, D. T. (2003) Operant behavior. Annual Review of Psychology, 54:115-14 2. Kalat, J. (2013). Introduction to Psychology (10th ed.). Cengage Learning. 3. Elmes, D. (2011). Research Methods in Psychology (9th ed.). Cengage Learning. 4. Boyd, D. (2014). Lifespan Development (7th ed.). Cengage Learning. 5. Myers, D. (2011). Psychology (10th ed.). Cengage Learning. 6. Ormrod, J. (2011). Human Learning (6th ed.). Pearson. 7. Skinner, B.F. (1953). Science and Human Behavior. New York: Macmillan.

[www.usu.edu/teachall/text/behavior/LRBIpdfs/Positive.pdf]
[www.cehd.umn.edu/ceed/publications/tipsheets/preschoolbehaviortipsheets/posrein.pdf]

UpToDate Contents

全文を閲覧するには購読必要です。 To read the full text you will need to subscribe.

1. 小児における特異的学習障害：臨床的特徴 specific learning disabilities in children clinical features
2. 小児における特異的学習障害：プライマリケアプロバイダーの役割 specific learning disabilities in children role of the primary care provider
3. 小児における特異的学習障害：教育的マネージメント specific learning disabilities in children educational management
4. 小児における特異的学習障害：評価 specific learning disabilities in children evaluation
5. 特異的学習障害の定義および学習障害に関する法律 definitions of specific learning disability and laws pertaining to learning disabilities

English Journal

Video feedforward for rapid learning of a picture-based communication system.

Smith J1, Hand L, Dowrick PW.Author information 1Department of Psychology, University of Auckland, PB 92019, Auckland, New Zealand, jems.smith@gmail.com.AbstractThis study examined the efficacy of video self modeling (VSM) using feedforward, to teach various goals of a picture exchange communication system (PECS). The participants were two boys with autism and one man with Down syndrome. All three participants were non-verbal with no current functional system of communication; the two children had long histories of PECS failure. A series of replications, with different length baselines, was used to examine whether video self modeling could replace the PECS method of teaching to achieve the same goals. All three participants showed rapid learning of their target behavior when introduced to their self modeling videos, and effects generalized without the need for further intervention. We conclude that VSM, using feedforward, can provide a fast, simple way of teaching the use of a picture-based communication system without the need for prompts or intensive operant conditioning. VSM may provide an accessible, easy-to-use alternative to common methods of teaching augmentative and alternative communication systems.
Journal of autism and developmental disorders.J Autism Dev Disord.2014 Apr;44(4):926-36. doi: 10.1007/s10803-013-1946-0.
This study examined the efficacy of video self modeling (VSM) using feedforward, to teach various goals of a picture exchange communication system (PECS). The participants were two boys with autism and one man with Down syndrome. All three participants were non-verbal with no current functional syst
PMID 24068486

Operant conditioning of autobiographical memory retrieval.

Debeer E1, Raes F, Williams JM, Craeynest M, Hermans D.Author information 1a Department of Psychology , University of Leuven , Leuven , Belgium.AbstractFunctional avoidance is considered as one of the key mechanisms underlying overgeneral autobiographical memory (OGM). According to this view OGM is regarded as a learned cognitive avoidance strategy, based on principles of operant conditioning; i.e., individuals learn to avoid the emotionally painful consequences associated with the retrieval of specific negative memories. The aim of the present study was to test one of the basic assumptions of the functional avoidance account, namely that autobiographical memory retrieval can be brought under operant control. Here 41 students were instructed to retrieve personal memories in response to 60 emotional cue words. Depending on the condition, they were punished with an aversive sound for the retrieval of specific or nonspecific memories in an operant conditioning procedure. Analyzes showed that the course of memory specificity significantly differed between conditions. After the procedure participants punished for nonspecific memories retrieved significantly more specific memories compared to participants punished for specific memories. However, whereas memory specificity significantly increased in participants punished for specific memories, it did not significantly decrease in participants punished for nonspecific memories. Thus, while our findings indicate that autobiographical memory retrieval can be brought under operant control, they do not support a functional avoidance view on OGM.
Memory (Hove, England).Memory.2014 Apr;22(3):171-83. doi: 10.1080/09658211.2013.774419. Epub 2013 Feb 28.
Functional avoidance is considered as one of the key mechanisms underlying overgeneral autobiographical memory (OGM). According to this view OGM is regarded as a learned cognitive avoidance strategy, based on principles of operant conditioning; i.e., individuals learn to avoid the emotionally painfu
PMID 23445114

Effects of environmental enrichment on the behavior of shelter dogs.

Herron ME1, Kirby-Madden TM, Lord LK.Author information 1Department of Veterinary Clinical Sciences, College of Veterinary Medicine, The Ohio State University, Columbus, OH 43210.AbstractObjective-To determine the effect of food-toy enrichment combined with cage-behavior training on desirable behaviors in shelter dogs and adoption rates. Design-Randomized controlled clinical trial. Animals-107 dogs. Procedures-Dogs placed up for adoption in a municipal shelter were randomly assigned to either an experimental group (n = 48) or control group (59). Experimental group subjects were exposed to an environmental enrichment and training protocol consisting of twice-daily cage-behavior training and daily provision of a food-filled toy. Cage-behavior training included operant conditioning via positive reinforcement of desirable behaviors, including approaching the front of the cage, sitting or lying, and remaining quiet when approached. Behavioral observations were performed by a blinded observer in a scan-sampling technique on day 0 (first day on adoption floor) and again on day 3 for experimental (n = 26) and control (32) dogs. Body posture, location in cage, and other behavioral parameters were recorded. Adoption information and behavioral observation data were compared between groups. Results-Compared with the control group, the experimental group had a significantly greater percentage of dogs with an increase in desirable behaviors of sitting or lying down (17/26 [65%] vs 7/32 [22%]) and being quiet (9/26 [35%] vs 4/32 [13%]) and a significantly greater percentage of dogs with a decrease in the undesirable behavior of jumping (15/26 [57%] vs 3/32 [9%]). Location in cage, fearfulness, and eye contact were not significantly different between groups. Survival analysis revealed no significant difference in adoption rates between groups. Conclusions and Clinical Relevance-Results suggested that enrichment programs improve desirable behaviors and decrease undesirable behavior in shelter dogs, which may enhance welfare.
Journal of the American Veterinary Medical Association.J Am Vet Med Assoc.2014 Mar 15;244(6):687-92. doi: 10.2460/javma.244.6.687.
Objective-To determine the effect of food-toy enrichment combined with cage-behavior training on desirable behaviors in shelter dogs and adoption rates. Design-Randomized controlled clinical trial. Animals-107 dogs. Procedures-Dogs placed up for adoption in a municipal shelter were randomly assigned
PMID 24568110

Japanese Journal

Role of incentive learning in reduction of instrumental performance in rats (Rattus norvegicus) by irrelevant drive state

IGUCHI YOSHIO
動物心理学研究 64(1), 11-18, 2014
NAID 40020154243

Role of incentive learning in reduction of instrumental performance in rats (Rattus norvegicus) by irrelevant drive state

IGUCHI YOSHIO
動物心理学研究, 2014
… Previous studies have demonstrated that the thirst drive interferes with the hunger drive in an animals performance of motivated behavior: water deprivation causes a reduction in food-reinforced instrumental performance of food-deprived rats. … Here, we examined whether this type of reinforcer devaluation effect would require the animals consummatory contact with the reinforcer under the irrelevant drive (i.e., incentive learning). …
NAID 130003393348

〈Articles〉Assessment of Self Efficacy in an EFL Context

Borrows Lance
生駒経済論叢 11(2), 155-180, 2013-11
… This investigation will seek to determine if significant changes occur in five affective factors, EFL self-efficacy, language anxiety, integrative motivation, willingness to communicate (WTC), and instrumental motivation. … These constructs were selected as it is hypothesized they will support continued learning and an elevated trajectory of proficiency and motivation in the future for SA participants in comparison to non-SA participants. …
NAID 120005368777

「道具的学習」

	Library resources about; Operant conditioning
	Resources in your library; Resources in other libraries;

　　[★]

英: instrumental learning
関: オペラント条件づけ、道具的学習法

「道具的学習法」

　　[★]

英: instrumental learning
関: 道具的学習

「instrument」

　　[★]

n.

道具、器具、機器、計器。楽器。手段。法律文書。(法律文書としての)証券、(約束)手形

v.

(器具を)装着する
関: apparatus、device、equipment、fit、instrumental、machine、tool、utensil、wear

「instrumental」

　　[★]

adj.

機器の、手段になる、役立つ

関: avail、available、avenue、equipment、gateway、help、helpful、instrument、mean、means、policy、serve、serviceable、subserve、tool、useful、valuable

[1] Domjan, Michael, Ed., The Principles of Learning and Behavior, Fifth Edition, Belmont, CA: Thomson/Wadsworth, 2003

[2] Thorndike, E.L. (1901). "Animal intelligence: An experimental study of the associative processes in animals". Psychological Review Monograph Supplement 2: 1–109.

[3] Miltenberger, R. G. "Behavioral Modification: Principles and Procedures". Thomson/Wadsworth, 2008. p. 9.

[parenting-4] Miltenberger, R. G., & Crosland, K. A. (2014). Parenting. The wiley blackwell handbook of operant and classical conditioning. (pp. 509-531) Wiley-Blackwell. doi:10.1002/9781118468135.ch20

[5] Skinner, B. F. "The Behavior of Organisms:An Experimental Analysis", 1938 New York: Appleton-Century-Crofts

[6] Skinner, B. F. (1950). "Are theories of learning necessary?". Psychological Review 57: 193–216. doi:10.1037/h0054367.

[7] Schacter, Daniel L., Daniel T. Gilbert, and Daniel M. Wegner. "B. F. Skinner: The role of reinforcement and Punishment", subsection in: Psychology; Second Edition. New York: Worth, Incorporated, 2011, 278-288.

[8] Ferster, C. B. & Skinner, B. F. "Schedules of Reinforcement", 1957 New York: Appleton-Century-Crofts

[9] Staddon, J. E. R; D. T Cerutti (February 2003). "Operant Conditioning". Annual Review of Psychology 54 (1): 115–144. doi:10.1146/annurev.psych.54.101601.145124. Retrieved 23 March 2013.

[10] Mecca Chiesa (2004) Radical Behaviorism: The philosophy and the science

[11] Skinner, B. F. "Science and Human Behavior", 1953. New York: MacMillan

[12] Skinner, B. F. "Verbal Behavior", 1957. New York: Appleton-Century-Crofts

[13] Tucker, M.; Sigafoos, J.; Bushell, H. (1998). "Use of noncontingent reinforcement in the treatment of challenging behavior". Behavior Modification 22: 529–547. doi:10.1177/01454455980224005.

[14] Poling, A.; Normand, M. (1999). "Noncontingent reinforcement: an inappropriate description of time-based schedules that reduce behavior". Journal of Applied Behavior Analysis 32: 237–238. doi:10.1901/jaba.1999.32-237.

[15] Schacter et al.2011 Psychology 2nd ed. pg.280-284 Reference for entire section Principles version 130317

[Miltenberger84-16] Miltenberger, R. G. "Behavioral Modification: Principles and Procedures". Thomson/Wadsworth, 2008. p. 84.

[17] Miltenberger, R. G. "Behavioral Modification: Principles and Procedures". Thomson/Wadsworth, 2008. p. 86.

[18] Schultz, Wolfram (1998). "Predictive Reward Signal of Dopamine Neurons". The Journal of Neurophysiology 80 (1): 1–27.

[19] Neuringer, A (2002). "Operant variability: Evidence, functions, and theory". Psychonometric Bulletin & Review 9 (4): 672–705. doi:10.3758/bf03196324.

[20] Mazur, J. E. (2013). Learning and behavior. (7th ed). Pearson Education Inc.

[21] Herrnstein, R.J.; Hineline, P. N. (1966). "Negative reinforcement as shock-frequency reduction". Journal of the Experimental Analysis of Behavior 9: 421–430. doi:10.1901/jeab.1966.9-421.

[22] Pierce & Cheney (2004) Behavior Analysis and Learning

[23] Cole, M.R. (1990). "Operant hoarding: A new paradigm for the study of self-control". Journal of the Experimental Analysis of Behavior 53: 247–262. doi:10.1901/jeab.1990.53-247.

[24] "Activity of pallidal neurons during movement", M.R. DeLong, J. Neurophysiol., 34:414–27, 1971

[RTR.26MRD-25] Richardson RT, DeLong MR (1991): Electrophysiological studies of the function of the nucleus basalis in primates. In Napier TC, Kalivas P, Hamin I (eds), The Basal Forebrain: Anatomy to Function (Advances in Experimental Medicine and Biology, vol. 295. New York, Plenum, pp. 232–252

[26] PNAS 93:11219-24 1996, Science 279:1714–8 1998

[27] Neuron 63:244–253, 2009, Frontiers in Behavioral Neuroscience, 3: Article 13, 2009

[28] Michael J. Frank, Lauren C. Seeberger, and Randall C. O'Reilly (2004) "By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism," Science 4, November 2004

[29] Domjan, M. (2009). The Principles of Learning and Behavior. Wadsworth Publishing Company. 6th Edition. pages 244-249.

[30] Timberlake, W (1983). "Rats' responses to a moving object related to food or water: A behavior-systems analysis". Animal Learning & Behavior 11 (3): 309–320. doi:10.3758/bf03199781.

[31] Neuringer, A.J. (1969). "Animals respond for food in the presence of free food". Science 166: 399–401. doi:10.1126/science.166.3903.399.

[32] Williams, D.R.; Williams, H. (1969). "Auto-maintenance in the pigeon: sustained pecking despite contingent non-reinforcement". J. Exper. Analys. of Behav 12: 511–520. doi:10.1901/jeab.1969.12-511.

[33] Peden, B.F.; Brown, M.P.; Hearst, E. (1977). "Persistent approaches to a signal for food despite food omission for approaching.". Journal of Experimental Psychology: Animal Behavior Processes 3 (4): 377–399. doi:10.1037/0097-7403.3.4.377.

[34] Gardner, R.A.; Gardner, B.T. (1988). "Feedforward vs feedbackward: An ethological alternative to the law of effect". Behavioral and Brain Sciences 11: 429–447. doi:10.1017/s0140525x00058258.

[35] Gardner, R. A. & Gardner B.T. (1998) The structure of learning from sign stimuli to sign language. Mahwah NJ: Lawrence Erlbaum Associates.

[36] Baum, W. M. (2012). "Rethinking reinforcement: Allocation, induction and contingency". Journal of the Experimental Analysis of Behavior 97: 101–124. doi:10.1901/jeab.2012.97-101.

[37] Locurto, C. M., Terrace, H. S., & Gibbon, J. (1981) Autoshaping and conditioning theory. New York: Academic Press.

リンク元	「道具的学習」「道具的学習法」
関連記事	「instrument」「instrumental」

匿名

検索

案内

instrumental learning