번역가 김홍식의 블로그입니다: reinforcement, behavior trap

자료: Wilipedia, http://en.wikipedia.org/wiki/Reinforcement

※ 중간 발췌:

Natural and artificial reinforcement

In his 1967 paper, Arbitrary and Natural Reinforcement, Charles Ferster proposed that reinforcement can be classified into events which increase the frequency of an operant as a natural consequence of the behavior itself, and those which are presumed to affect frequency by their requirement of human mediation, such as in a token economy where subjects are "rewarded" for certain behavior with an arbitrary token of a negotiable value. In 1970, Baer and Wolf created a name for the use of natural reinforcers called behavior traps.^[6] A behavior trap is one in which only a simple response is necessary to enter the trap, yet once entered, the trap cannot be resisted in creating general behavior change. It is the use of a behavioral trap that will increase one's repertoire by exposing a person to the naturally occurring reinforcement of that behavior. Behavior traps have four characteristics:

They are "baited" with virtually irresistible reinforcers that "lure" the student to the trap
Only a low-effort response already in the repertoire is necessary to enter the trap
Interrelated contingencies of reinforcement inside the trap motivate the person to acquire, extend, and maintain targeted academic/social skills^[7]
they can remain effective for long time because the person shows few, if any, satiation effects.

As can be seen from the above, artificial reinforcement is created to build or develop skills, and to generalize, it is important that either a behavior trap is introduced to 'capture' the skill and utilize naturally occurring reinforcement to maintain or increase it. This behavior trap may simply be a social situation that will generally result from a specific behavior once it has met a certain criterion (ex: if you use edible reinforcers to train a person to say hello and smile at people when they meet them, after that skill has been built up, the natural reinforcer of other people smiling, and having more friendly interactions will naturally reinforce the skill and the edibles can be faded).^[8]

.................................

※ 첫 부분:

For the construction materials reinforcement, see Rebar.

For reinforcement learning in computer science, see Reinforcement learning.

In operant conditioning, reinforcement occurs when an event following a response causes an increase in the probability of that response occurring in the future. Response strength can be assessed by measures such as the frequency with which the response is made (for example, a pigeon may peck a key more times in the session), or the speed with which it is made (for example, a rat may run a maze faster). The environment change contingent upon the response is called a reinforcer.

[edit]Types of reinforcement

B.F. Skinner, the researcher who articulated the major theoretical constructs of reinforcement and behaviorism, refused to specify causal origins of reinforcers. Skinner argued that reinforcers are defined by a change in response strength (that is, functionally rather than causally), and that which is a reinforcer to one person may not be to another. Accordingly, activities, foods or items which are generally considered pleasant or enjoyable may not necessarily be reinforcing; they can only be considered so if the behavior that immediately precedes the potential reinforcer increases in similar future situations. If a child receives a cookie when he or she asks for one, and the frequency of 'cookie-requesting behavior' increases, the cookie can be seen as reinforcing 'cookie-requesting behavior'. If however, cookie-requesting behavior does not increase, the cookie cannot be considered reinforcing. The sole criterion which can determine if an item, activity or food is reinforcing is the change in the probability of a behavior after the administration of a potential reinforcer. Other theories may focus on additional factors such as whether the person expected the strategy to work at some point, but a behavioral theory of reinforcement would focus specifically upon the probability of the behavior.

The study of reinforcement has produced an enormous body of reproducible experimental results. Reinforcement is the central concept and procedure in the experimental analysis of behavior and much of quantitative analysis of behavior.

Positive reinforcement is an increase in the future frequency of a behavior due to the addition of a stimulus immediately following a response. Giving (or adding) food to a dog contingent on its sitting is an example of positive reinforcement (if this results in an increase in the future behavior of the dog sitting).
Negative reinforcement is an increase in the future frequency of a behavior when the consequence is the removal of anaversive stimulus. Turning off (or removing) an annoying song when a child asks their parent is an example of negative reinforcement (if this results in an increase in asking behavior of the child in the future).
- Avoidance conditioning is a form of negative reinforcement that occurs when a behavior prevents an aversive stimulus from starting or being applied.

Skinner discusses that while it may appear so, Punishment is not the opposite of reinforcement. Rather, it has some other effects as well as decreasing undesired behavior.

	decreases likelihood of behavior	increases likelihood of behavior
presented	positive punishment	positive reinforcement
taken away	negative punishment	negative reinforcement

Distinguishing "positive" from "negative" can be difficult, and the necessity of the distinction is often debated^[1]. For example, in a very warm room, a current of external air serves as positive reinforcement because it is pleasantly cool or negative reinforcement because it removes uncomfortably hot air^[2]. Some reinforcement can be simultaneously positive and negative, such as a drug addict taking drugs for the added euphoria and eliminating withdrawal symptoms. Many behavioral psychologists simply refer to reinforcement or punishment—without polarity—to cover all consequent environmental changes.

[edit]Primary reinforcers

A primary reinforcer, sometimes called an unconditioned reinforcer, is a stimulus that does not require pairing to function as a reinforcer and most likely has obtained this function through the evolution and its role in species' survival^[3]. Examples of primary reinforcers include sleep, food, air, water, and sex. Other primary reinforcers, such as certain drugs, may mimic the effects of other primary reinforcers. While these primary reinforcers are fairly stable through life and across individuals, the reinforcing value of different primary reinforcers varies due to multiple factors (e.g., genetics, experience). Thus, one person may prefer one type of food while another abhors it. Or one person may eat lots of food while another eats very little. So even though food is a primary reinforcer for both individuals, the value of food as a reinforcer differs between them.

Often primary reinforcers shift their reinforcing value temporarily through satiation and deprivation. Food, for example, may cease to be effective as a reinforcer after a certain amount of it has been consumed (satiation). After a period during which it does not receive any of the primary reinforcer (deprivation), however, the primary reinforcer may once again regain its effectiveness in increasing response strength.

[edit]Secondary reinforcers

A secondary reinforcer, sometimes called a conditioned reinforcer, is a stimulus or situation that has acquired its function as a reinforcer after pairing with a stimulus which functions as a reinforcer. This stimulus may be a primary reinforcer or another conditioned reinforcer (such as money). An example of a secondary reinforcer would be the sound from a clicker, as used inclicker training. The sound of the clicker has been associated with praise or treats, and subsequently, the sound of the clicker may function as a reinforcer. As with primary reinforcers, an organism can experience satiation and deprivation with secondary reinforcers. (중략)

Criticisms

The standard definition of behavioral reinforcement has been criticized as circular, since it appears to argue that response strength is increased by reinforcement while defining reinforcement as something which increases response strength; that is, the standard definition says only that response strength is increased by things which increase response strength. However, the correct usage^[10] of reinforcement is that something is a reinforcer because of its effect on behavior, and not the other way around. It becomes circular if one says that a particular stimulus strengthens behavior because it is a reinforcer, and should not be used to explain why a stimulus is producing that effect on the behavior. Other definitions have been proposed, such as F. D. Sheffield's "consummatory behavior contingent on a response," but these are not broadly used in psychology.^[11]

[edit]History of the terms

In the 1920s Russian physiologist Ivan Pavlov may have been the first to use the word reinforcement with respect to behavior, but (according to Dinsmoor) he used its approximate Russian cognate sparingly, and even then it referred to strengthening an already-learned but weakening response. He did not use it, as it is today, for selecting and strengthening new behavior. Pavlov's introduction of the word extinction (in Russian) approximates today's psychological use.

In popular use, positive reinforcement is often used as a synonym for reward, with people (not behavior) thus being "reinforced," but this is contrary to the term's consistent technical usage, as it is a dimension of behavior, and not the person, which is strengthened. Negative reinforcement is often used by laypeople and even social scientists outside psychology as a synonym forpunishment. This is contrary to modern technical use, but it was B. F. Skinner who first used it this way in his 1938 book. By 1953, however, he followed others in thus employing the word punishment, and he re-cast negative reinforcement for the removal of aversive stimuli.

There are some within the field of behavior analysis^[12] who have suggested that the terms "positive" and "negative" constitute an unnecessary distinction in discussing reinforcement as it is often unclear whether stimuli are being removed or presented. For example, Iwata^[13] poses the question: “…is a change in temperature more accurately characterized by the presentation of cold (heat) or the removal of heat (cold)?” (p. 363). Thus, it may be best to conceptualize reinforcement simply as a pre-change condition being replaced by a post-change condition which reinforces the behavior which was followed by the change in stimulus conditions.

번역가 김홍식의 블로그입니다

페이지

2009년 5월 6일 수요일

reinforcement, behavior trap

Natural and artificial reinforcement

Contents

[edit]Types of reinforcement

[edit]Primary reinforcers

[edit]Secondary reinforcers

Criticisms

[edit]History of the terms

[edit]See also

[edit]

댓글 없음:

댓글 쓰기

개인정보 Profile

사는동네 Categories

지난주 페이지뷰

가장 많이 본 글

지식 검색 Knowledge Search

즐겨찾는 글동네 Favorite Ideas

동네일지 Archives