7/27/2019 Psicologia Traduzir
1/26
On-line Readings in Testing and Assessment
Persona l i ty
Nico Smid
January 2010
Summary
Personality is described in this reading from a psychometric perspective. It is defined
here as referring to the publicly observable regularities in behaviours that are both
lasting and typical for individuals and by which we tell individuals apart. The Five
Factor Model of personality (FFM) is put forward as the integrative framework for
personality assessment. Personality questionnaires are described as the primary
assessment tools. Methodical problems like differences between self appraisals andappraisals by others as well as ways to counter dishonest responding are discussed.
The growing importance of modern test theory in the form of Item Response Theory
(IRT) is stressed. Finally, the predictive power of personality for real life decisions is
discussed as well as the limits put on this by culture, situations and self-monitoring
ability.
7/27/2019 Psicologia Traduzir
2/26
Introduction
Sometimes, when giving a lecture or a workshop on personality, I start by asking the
audience to give a definition of Personality. After a lot of hesitations and uh, uh.,
quite a number of different descriptions follow. Often these are vague descriptions
like character or what you are like, which are quite different from each other. This
generally amounts to a dramatic demonstration of the ease with which people use an
everyday concept without ever checking whether they have a clear idea of it or
whether they agree with others regarding its content. This of course then creates a
fruitful basis for a willingness on the part of the audience to discuss the concept in
depth.
With this anecdotal experience in mind I will start by defining what will be covered in
this reading on personality and what will not.
First and foremost, I will restrict myself to normal personality. Of course, a lot of
personality research and application is done in so-called clinical contexts. That is to
say, working with individuals who have serious problems in adapting to their
environment or who are seen by others as being a serious problem in order to
understand their characteristic feelings and behaviours and to change them for the
better.
Normal personality, however, concerns itself with the normal variations in behaviour
we see between individuals who have no serious adaptive problems. The word
variations in the last sentence adds a second restriction to what will be covered. I
will focus on individual differences. That is to say, I will restrict myself to a so-called
psychometric perspective on personality. After all, this is a contribution to a website
of the International Test Commission (ITC). So, personality testing is the primary
focus. Testing should be understood here, however, in a broad sense. It refers to
any procedure that tries to assess individual differences in observable behaviour in a
reliable way in order to predict a relevant external criterion.
The following aspects will be covered:
Definition of personality
Structure of personality
Assessing personality: models and specific issues
Practical use in prediction
7/27/2019 Psicologia Traduzir
3/26
Current issues in measurement and application.
Personality defined
In discussions following the above mentioned question to define personality,
consensus is quickly reached that the prime focus needs to be on observable
behaviour. Of course, people have feelings; they have intentions and are more or
less motivated to do things. But you cannot observe such feelings, intentions or
motivations directly. You have to infer them either from believing direct self-reports
like I feel, I want to , or from observing things people do , their behaviours.
Therefore, the domain of personality-relevant information is restricted here to publicly
observable behaviour.
A second restriction concerns what aspects of behaviour are attended to. An
important aspect of the personality concept is that it refers to regularities in
behaviour which are typical and lasting for an individual. That is to say, behaviours
that set one individual apart as different from others in the eyes of external
observers, not only now but also tomorrow and the day after. And not only that, but
also behaviours which are important to take into account in our daily interactions with
each other. So-called socially-relevant behaviours. Behaviours other people find
worthwhile to attend to and to react upon.
Already more than half a century ago, the American psychologist Raymond Cattell
referred to the foregoing as the sedimentation hypothesis: each typical behavioural
aspect of an individual that is socially-relevant has its sediment in everyday
language in the form of a personality descriptive adjective like talkative, orderly,
and so on. Thus, the domain of observable personality data should coincide with the
total set of personality descriptive adjectives in everyday language.
The question then arises if it is possible to reliably define this set of adjectives. And
the answer is affirmative: yes, it is. It is straightforward to do this, and has therefore
been extensively done, not only in the American language and culture but also in
other languages and cultures all around the globe. It works like this. From a standard
dictionary, identify all the adjectives that can possibly be applied to an individual, and
which refer to observable behaviour. Then the next critical step follows. Ask a
relevant sample of individuals from the language group from which the adjectives
7/27/2019 Psicologia Traduzir
4/26
were identified to evaluate each adjective to determine whether it fits in the following
sentence: (s)he is by nature or (s)he is a person. Though
the latter sentence often lends itself to more adjectives being included than when the
former sentence is used, what is most remarkable is that different people show high
agreement among themselves whether an adjective fits or not. Consequently, this
methodology makes it possible to end up with a clearly defined set of relevant
personality differences represented in a neat list of 1000 to 2000 personality
descriptive adjectives, depending on the language you have used. Subsequent
research has shown that such a list, when carefully compiled, comprehensively
defines the personality domain. And what is more, the basic structure of this domain
is far more common than differences between languages or cultures. This is what
will be discussed next.
Structure of Personality: the Five Factor Model (FFM)
F i v e b as i c p e r s o n a l i t y f a c t o r s
Having compiled a list of say some 1200 personality descriptive adjectives (which is
about the number of such adjectives in the Dutch language), structuring the overlap
is the next step. Not all adjectives are unique. Agreeable and pleasant, for
instance, convey almost the same information about a person. Factor analysis or
meaning similarity analyses are among the techniques that can be used to reduce
the total set of adjectives into a number of independent basic factors.
To this end, large samples of participants can be requested to rate themselves on
the total list. Furthermore, other participants can be asked to rate people that they
know well. The resulting data both the self ratings and the ratings of others can
be reduced via factor analysis to a stable set of main factors. A further technique is
to ask participants to directly compare pairs of adjectives in terms of the extent to
which they are similar in meaning. The data from these comparisons are then
analyzed and summarized into underlying common meaning scales.
The results of both kinds of analyses factor analyses and meaning similarity
have turned out to be so consistently similar worldwide that the resultant model has
gained the status of a commonly accepted overall personality model in both the
scientific and the applied practical community. It is called the Five Factor Model, FFM
for short (McCrae & Costa, 2003).
7/27/2019 Psicologia Traduzir
5/26
These five factors might sometimes have somewhat different names but they all
convey similar meanings. Here, the following labels (including a short explanation)
will be used:
Need for Stability (N)
Differences in the extent to which people react emotionally to setbacks
Extraversion (E)
Differences in the extent to which people actively maintain contact with others
Openness (O)
Differences in the extent to which people look for new experiences and new
ideas
Accommodation (A)
Differences in the extent to which people place other peoples interests above
their own
Conscientiousness (C)
Differences in the extent to which people behave in an organized and purposeful
manner.
A h u m an u n i v er s al ?
The above is of course a very general and highly aggregated level of description. But
it is at the same time the core that has generally been found for the human species.
This should not come as a surprise. Well founded evolutionary theory asserts that
humans are quite a homogeneous species not only in genetic makeup but also in theenvironment in which they until very recently have evolved: hunters and gatherers in
small groups in a deprived and often hostile environment. Differences in the FFM
might be expected then to account for differences in survival value, and might thus
lead to the evolution of stable individual differences that matter both socially and
environmentally. To paraphrase a little (as well as giving a different rank order from
the list above, distinguishing between social and environmental control),
someones standing on each of the five factors of the FFM is informative to the
extent to which one may solve an important survival problem, such as:
Social control :
Accommodation : To what extent may I trust others or should I be guarded
regarding their intentions?
Extraversion : How well can I continue to take the initiative in my contacts with
other people, or would it be better to wait for them to take the first step?
7/27/2019 Psicologia Traduzir
6/26
Environmental control :
Conscientiousness : How well do I control my external environment, or does the
environment control me?
Need for stability : How well do I stay in control when unexpected setbacks
happen?
Openness : To what extent should I gather new information to stay in control?
Indeed, genetic and evolutionary psychologists (e.g., Buss, 2005) have presented
considerable evidence regarding the genetic as well as the evolutionary base of the
FFM. Other scholars have shown that it is highly common to derive a FFM structure
from answers to personality questionnaires based on it as, for instance, the NEO
(Costa & McCrae, 2003).
O n e f a c t o r , s e v e r al u n d e r l y i n g f a c et s
As far as the above mentioned high level of aggregation is concerned, in practical
applications, each of the FFM factors often split up into homogeneous sub-factors,
mostly called facets. This is the for instance the case with the above-mentioned
NEO questionnaire. As a concrete example from another questionnaire, the
Reflector Big Five Personality (RBFP) (Schakel, Smid, & Jaganjac, 2007), the factor
Need for stability is subdivided into the following homogeneous facets:
Sensitiveness : the extent to which people worry about themselves.
Intensity : the ease with which people get angry Interpretation : the extent to which people emphasize problems above solutions
Rebound time : how much time people need to rebound from setbacks
Reticence : the extent to which people feel uneasy in a group.
And so on for each of the five factors within the FFM.
The facets within a single factor hang together quite well individuals who score
high one facet have a more than average chance to also score high on another from
the set but far from perfectly so. Therefore, a more fine-grained description may be
provided of differences between people who have approximately the same position
on the overall factor of, say, Need for stability , but still show a different pattern on the
underlying facets. Taking such differences in facets into account should enable a
better prediction of, for instance, performance in jobs. And empirical evidence
suggests that this in fact is the case (Schakel & Smid, 2005).
7/27/2019 Psicologia Traduzir
7/26
7/27/2019 Psicologia Traduzir
8/26
such adjectives are included in the analysis, a sixth factor generally emerges. This
factor is commonly referred to by terms like integrity or sincerity (Ashton et al.,
2004). This finding does not mean, however, that the FFM is shown to be incorrect
but only that apart from what people objectively observe, they separately also
ascribe good or bad moral intentions to others. This is fruitful knowledge in its own
right but it refers primarily to the way the observer arrives at a moral judgment and
not to a direct description of observable behaviour.
The third line of research is akin to the latter. Here, the structure of personality in
non-western cultures is investigated. Also here, especially in Asian cultures, a sixth
factor integrity or other-directedness has been established (Cheung et al., 2003).
Again, this factor might be derived on a similar basis to the second line of research
discussed above. Within the boundaries of a culture, all statements are sampled that
might be attributed to people by members of the culture, without however explicitlydistinguishing by design between observable behaviours and ones that can only be
inferred, as was done when constructing the FFM. What is clear from this research is
that in the Asian and other, for instance, African (Meiring et al., 2005) cultures
moral attributions are important aspects within social relations. But these are also
partially grounded on observations of behaviour that form a separate domain in their
own right.
D o n t m i s t a k e an i n f e r e n c e f o r a n o b s e r v a t i o n
It may be concluded then from the above that it is a reasonable strategy to restrict
the data for investigating personality to the set of observable and stable personality
adjectives. Under such a restriction, the FFM stands out as the worldwide commonly
applicable core of general personality factors. At the same time, it is true that people
ascribe all sorts of moral intentions to other people, partly based on observations
and partly on their own moral ideas, which may be empirically summarized by terms
like honest or sincere. This is not a personality description, though, but an
observer inference. The FFM, however, is about summarizing observations of
behaviour and not about interpreting them.
7/27/2019 Psicologia Traduzir
9/26
How to assess personality
P er s o n a l i t y a s t h e c o m m o n p e r s p e c t i v e o f o t h e r s
Having defined the domain of data that is used to study personality, the next
question that needs to be addressed is how to assess the personality of individuals.Personality testing is the core business of the psychometric perspective on
personality.
Remember that the basic domain refers to observations of behavioural regularities.
Therefore, personality enacts itself in the public domain. The most straightforward
way to assess someones personality is then of course the average appraisal made
by all knowledgeable persons. How should one interpret knowledgeable?
When the objective is to summarize as accurately as possible the typical and lasting
aspects of an individuals behaviour also often called: the gist of behavior who
then is in the best position to do so? In principle the individual him- or her-self, since
(s)he observes most of it. At the same time we know that our evaluation of ourselves
is subject to a number of biases, not the least of which is the tendency to see
ourselves in a more favourable light than others see us.
Moreover, there is only one of me. This makes my self-appraisal quite unreliable.
The reliability of a single self-appraisal on a single scale will generally not transcendthe .30 barrier, which heavily reduces its usefulness as a predictor of any external
criterion.
Now the same is true of a single external observer. So, when using external
observers to assess someones personality, one should use the common appraisal
of at least five to eight observers in order to arrive at an acceptable level of reliability
of the average appraisal. There is of course a second reason to do so. Each
observer generally sees only a specific aspect of anothers typical behaviour (e.g.,
what a boss sees could be different from what a colleague sees). Therefore
differences in the perspectives of observers should be averaged out to arrive at a
common appraisal across all observers.
The use of averaged appraisals by other people to assess the personality of a
specific individual is seldom advocated, however. Within the perspective put forward
7/27/2019 Psicologia Traduzir
10/26
here, cost limitations and practical difficulties to gather such appraisals are among
the reasons for this. However, some people claim that you should not use appraisals
by others at all to assess the personality of an individual. The individual him-/her-self
is said to have specific knowledge about his/her own behaviour which cannot be
appraised by other people. Within the present perspective this can only refer to
internal states, not to observable behaviour. Thus, intentions, emotions or feelings
can of course only be fully appraised by the individual him- or her-self, but these are
explicitly excluded in the FFM as a direct database for assessing personality as
argued above.
W h y u s e s e l f a p p r a i s al t h e n ?
Most personality assessment is done by way of using personality questionnaires.
These are generally self appraisal instruments instead of common appraisals by
others. How can we derive valid personality inferences that is, the common
perspective of others from instruments that are merely self appraisals?
To answer this question, a closer look at the differences between self appraisal of
ones own behaviour and the appraisal by others of the same behaviour is useful.
There has been done a lot of research on this topic. The conclusions from such
research may be summarized as follows:
When a person responds honestly on a questionnaire that asks what (s)he generallydoes, his/her answers will in general be somewhat more favourable than those of
people who know him/her well and who also answer the questionnaire in a honest
manner. Whats more, for each personality factor the extent of surplus favourability in
a self appraisal is quite well known, and a next important finding is that most self
appraisals show about the same amount of favourability bias for different individuals.
This bias is therefore a quasi-constant so to speak.
Then, a methodical trick is used. Subtract the scores representing the self appraisal
of an individual on a personality questionnaire scale from the average self appraisal
scores of a well defined sample this is commonly called norming and you will
end up with a fairly good approximation of the averaged appraisal of the above
mentioned knowledgeable others.
7/27/2019 Psicologia Traduzir
11/26
According to this line of reasoning, an adequately normed score on a self appraisal
personality questionnaire which is the ubiquitous method to assess personality is
not merely a self appraisal. It is, when responded to honestly, a good
approximation of what is meant by personality within the FFM framework: the
common appraisal of the gist of someones behaviour by knowledgeable others. Not
perfectly so, of course, but as good as it gets, and as practical as generally
affordable.
H o n e s t r e s p o n d i n g
The above argument assumes honest responding. It is a challenge for personality
assessment to ensure honest responding. The most straightforward way to do so is
to take care that it is in the interest of the individual taking the questionnaire to
respond honestly. This could be achieved when the questionnaire is used as
information for the individual him/herself who wants to know his/her possibilities and
limitations for personal development. When, however, the questionnaire is used by a
prospective employer to estimate the suitability for a job, the individual may be
tempted not to answer honestly but to present him/herself in a manner that fits best
with the demands of the job.
A lot of research has been done to show that people can easily do this. In particular,
when the answer format is a so-called Likert scale which might look as follows:
1. Does not apply to me at all2. Does generally not apply to me
3. Sometimes applies to me and sometimes does not apply to me
4. Generally applies to me
5. Applies to me.
Apart from trying to convince the individual that it is in his/her best interest to answer
honestly which will not always work, of course a technical solution is often put
forward. Instead of presenting a Likert scale as given above, the person is asked to
make a forced choice between statements that do not differ much in terms of
favourability. For instance:
1. I am an orderly person
2. I am a friendly person.
7/27/2019 Psicologia Traduzir
12/26
To analyze the responses on such forced choices in order to assess someones
standing on a specific personality factor, as in this example, Conscientiousness or
Accommodation, has always posed quite a methodological problem. But recent
analysis models have overcome these problems. Thus, forced choice personality
questionnaires will be more common in the future.
O t h e r m e t h o d s o f a p p r ai s a l
Other than the use of personality questionnaires other methods have also been
proposed. Common to all these alternatives are that they are much more laborious,
while the value they add is not always clear-cut.
Most alternatives have in common that they avoid self appraisals but use
observations by others. Two frequently used alternatives are the following:
First, the use of extensive interviewing. Research shows that when personality
descriptions based on such interviews are used to predict external criteria like
performance in jobs, they are generally no better predictors than less
complicated instruments like tests for mental capacities or relevant for the
present context well-constructed and honestly taken personality
questionnaires. Of course, interviews can be made more reliable which could
probably result in better predictions, but this will be very laborious and too
expensive. Interviews must cover many different subjects to give a total picture,and more than one interviewer should be used, independent from one another.
Broadening an interview database by extending subjects to be covered and using
a number of different interviewers is, however, much more difficult to realize than
making a personality questionnaire longer to make it more reliable in order to
enhance its predictive power.
A second alternative is the direct observation of behaviour by a number of
external observers. Either in real life, or in controlled environments like role
playing according to a standardized script. Here, the same argument applies as
for interviewing. Though the observations might be much more standardized than
with interviewing, a lot of different observation situations or, for that matter, role
playing situations are needed to get a reliable estimate of the regularities in
behaviour. For that is what personality is all about: assessing the regularities
the gist of a persons behaviour, averaged across relevant time frames and
relevant situations.
7/27/2019 Psicologia Traduzir
13/26
To this end, applied professionals mostly use personality questionnaires instead of
more complicated observational procedures. Such questionnaires are by no means
perfect but their utility is at least as good as that of the alternatives, when taking time
and processing limitations into consideration.
Personality testing: some psychometric issues
Obse rvab l e da t a , l a t en t t r a i t s
Defining only observables as the basis for assessing personality is first and foremost
based upon a methodical argument. It does not imply that the observable data
coincides with the personality of an individual. These observables, however, should
be conceived as objective indicators from which an individuals position on one or
more personality dimensions may be derived. Such dimensions themselves areconceived to be latent . Thus, the more behaviours are observed which indicate
Need for stability, for example, the higher an individual is supposed to be standing
on the latent trait Need for stability.
The problem of course is how to arrive at an accurate assessment on a latent trait
when one only has fallible observables. This refers to the domain of psychometrics.
In analyzing personality questionnaires as argued above, the most commonly used
method of collecting personality data classical test theory (CTT) is generally used
for that purpose. The problem with CTT in this context is, however, that it is not a
formal model which enables one to test whether a assumed link between the scores
on the questionnaire itself and a persons standing on a latent trait is in fact correct
or not. It simply stays with the observables. CTT defines someones true score on
such a trait as the average across repeatedly completing the questionnaire.
Moreover it assumes that the unreliability of a score on a specific questionnaire is
always the same, independent from someones position on the latent trait, be it low
or high.
Both assumptions the way a true score is defined and equal reliability for each trait
position are demonstrably false but cannot be empirically checked within CTT
itself.
This is not a problem specific to personality assessment, but it is relevant in this
context because it not only hinders theoretical progress but also leads to
7/27/2019 Psicologia Traduzir
14/26
interpretational problems. More specifically, distinguishing between real personality
information and distortion by so-called response biases as, for instance, dishonest
responding cannot be done without ambiguity.
Modern test models have been developed to overcome such problems, among other
things.
Mode rn t e s t m ode l s : I t em R es p o n s e Theo r y ( IRT)
Recent research and practice in analyzing and using personality questionnaires is
based more and more on the so-called Item Response Theory (IRT). For the present
purposes, IRT may be described as a set of models by which one can test formally
how an individuals observable responses on a personality questionnaire can be
translated into a specific position on a latent personality trait. When the test is not
falsified by a given set of data, someones position on the latent trait can then be
directly measured.
Most importantly, at least two advantages of using IRT may be mentioned where
personality assessment is concerned. First, one may actually test the correctness of
different alternative models of personality against each other, whereas within CTT
the questionnaire responses at hand are by definition assumed to be correct
indicators. Referring to the FFM discussion above, IRT might fruitfully be used to test
models that formally specify in what way more than five factors might be needed toaccount for all variation in personality questionnaires.
A second advantage is the possibility to properly analyze forced response
questionnaires as described above in order to control for dishonest responding.
Within CTT, this is hindered by formal statistical problems.
IRT will not psychometrically be described here. The reader is referred to
psychometric articles elsewhere or on the present website.
7/27/2019 Psicologia Traduzir
15/26
Predictive power and utility
C h ai n o f i n f e r e n c e an d p r e d i c t i v e u t i l i t y
From a psychometric perspective someones personality is not assessed merely in
its own right, but as a predictor of some relevant external criterion. In contexts of personal development this might be a measure of life satisfaction or career
development. In organizational contexts the criterion might be suitability for a job or
the quality or quantity of performance in a job.
Common to these criteria is that they depend on a host of other things besides
someones personality. For example, often things that are beyond the control of the
individual such as economic circumstances impact on the job performance of a sales
manager. The latter state of affairs inevitably results in a decrease of predictive
power of personality as a predictor.
In fact we have a so-called chain of inference here. At the end of the chain is the
criterion we want to predict. Lets stay with the example: performance as a sales
manager. The personal part an individual can contribute to his/her quality as a sales
manager is the last but one part of the chain: his/her own behaviour. All behaviour
that contributes to being a good sales manager is summarized under the label:
competencies . In the case of a sales manager, a competency like persuasiveness
might be a relevant behaviour.
The first part of the chain is someones personality. In line with the foregoing this
might be a specific profile of scores on the factors and facets of the FFM. A relevant
facet here might be Rebound time from the FFM factor Need for stability as
defined earlier.
Now, when directly predicting the third part of the chain performance from the
first part personality two bridges have to be crossed. The first bridge is the extent
to which rebound time predicts the quality of the competency persuasiveness. This
prediction will not be perfect of course, since persuasiveness as a competency is a
skill to be learned , for which a short rebound time as a personality trait might be of
more or less help.
7/27/2019 Psicologia Traduzir
16/26
The second bridge is the extent to which the competency persuasiveness is an
important contributor to performance as a sales manager. This partly depends on
economic circumstances, tough or easy, for instance.
Thus the predictive power of personality is always to some extent decreased by the
two bridges to be crossed. In order to investigate the utility of using personality as a
predictor it is a wise strategy to attend to the bridges themselves instead of looking
at the direct predictive power by spanning both bridges at once.
First, one should empirically investigate which competencies are relevant
contributors to the criterion. In our example: are people who are really skilled in
persuasiveness relatively more often the better sales managers?
Then the second research step follows. Which personality facets from the FFMpredict the extent of effort and time needed to learn a specific competency as well as
the easiness by which this competency may be skilfully executed when it is needed
in practice? Again in our example the following turns out to be the case: the less
rebound time one needs to recover from setbacks, the more easily and faster one
learns to master the competency persuasiveness (and a number of other
competencies of course).
When only the relevant competencies are carefully selected on the one hand, and
only the relevant personality facets to predict those competencies are selected on
the other hand, the predictive power of a personality questionnaire commonly
called: its validity may approach a value between .30 and .40, depending on what
criterion is to be predicted. This may strike you as being a low figure after all, but the
utility in practical contexts is nevertheless considerable. For one, the best predictor in
organizational contexts, general intelligence, has a validity coefficient between .50
an .60 (Schmidt & Hunter, 1998). More important, however, is to realize what this
means. A validity coefficient of .30 means that one avoids about 30 percent of the
false decisions one would have made if one would have selected candidates for the job of sales manager at random, without taking the targeted personality information
into account. Quite a substantive gain in utility indeed.
7/27/2019 Psicologia Traduzir
17/26
E n h a n c i n g v a l i d i t y : s u b j e c t m a t t e r ex p e r t s
The way in which validity should be investigated as exemplified above is of course
more easily written down then actually done. As far as the second bridge from
competencies to actual performance is concerned, directly studying the validity is a
feasible research task. Let us elaborate on our example a little bit further. First, one
samples evaluations of the actual skill of sales managers or future sales managers in
terms of a number of competencies. Then one collects present or future sales
figures. Evaluating the predictive power of the skill in those competencies might then
actually show that persuasiveness has the highest validity. So, persuasiveness
should be the competency to be predicted with the first bridge from personality to
competency. We will not dwell here on the methodical aspects of doing a
professionally sound predictive validity study. Readers are referred to other readings
in this regard.
Studying the first validity bridge is the bigger problem. Remember what is to be
predicted here: effort and time needed to learn a specific competency. This demands
a research design in which at regular intervals over many years data on actual
competency skill should be collected. Then, after enough years there should be
enough data to establish a valid trend. One would then correlate the speed of
learning in our example: the speed of learning persuasiveness with the initial
scores obtained for a number of personality facets. In our example this might result
in the assertion that rebound time is quite a good predictor of the speed with which
one may learn persuasiveness.
Such a study is not often done and will not often be done in the future either. There
is a good alternative, however: use well-informed judges, so-called subject matter
experts (SME). Select for such SMEs people who have a lot of experience in
observing both skill in competencies and standing on personality traits for different
individuals. Applied psychologists will generally be a good choice. Now the following
not unreasonable assumption is that each of these SMEs has a non-zero validity of,
say, a modest .10 or .20 in predicting speed of learning a specific competency likepersuasiveness for a person who has a high score on a personality facet like
rebound time. Ask each SME to directly estimate the validity of rebound time in
predicting speed of learning persuasiveness Then, elementary psychometrics will
show that the average estimate of some five to ten SMEs is as good an estimate as
may be obtained from any complicated validity study conducted over many years, as
described above. This approach is probably even better and more consistent,
7/27/2019 Psicologia Traduzir
18/26
because a study conducted over many years is confronted with many
methodological problems. Of course, this is the main reason why not many such
studies are undertaken in the first place.
And the SME-strategy has been shown to work. In studying the validity of a specific
FFM questionnaire, the RBFP (Schakel & Smid, 2005), SMEs were used to estimate
the validity of FFM facets in predicting speed of learning competencies. Then, these
predictions were checked in a large database of evaluations of actual skill in a
number of competencies gathered over a number of years. The resultant validity
coefficients were between .20 and .50. These are the figures one would have
expected when the relevant facets to use in prediction were originally selected
through a complicated research project spanning many years..
Also for personality research it is therefore a wise thing to remember that theaverage appraisal of well-informed SMEs often is a good alternative to laboriously
collecting fallible data in a complicated research design. This is not tp say that SME
input should completely replace empirical data, but this approach can provide a
sufficiently accurate estimate of the validity coefficient.
Some current issues
To draw this reading to a close, some current issues will be briefly touched on.
Recent personality research literature will cover these issues in depth. The reader is
referred to this literature for more specific further information on concepts and
empirical results.
I n t e r c u l t u r a l e x c h a n g e a b i l i t y
Above we already mentioned the research into additional personality factors up and
above the FFM. In particular, research has been done in Asian (especially Chinese)
and African (especially indigenous South African) cultures. As mentioned,interpersonal commonness and respect as well as integrity are additional factors
which might be added to account more fully for what the personality construct
encompasses within these cultures. At present, targeted personality questionnaires
taking these additional factors into account are being developed. In these research
approaches the original FFM kernel is safeguarded while adding some extra factors.
7/27/2019 Psicologia Traduzir
19/26
Another line of research concerns itself with the question of whether the FFM has
intercultural applicability. This research has mostly been done with personality
questionnaires. For example, the intercultural applicability of the NEO (McCrae &
Teracciano, 2005) has been extensively investigated. In general one might conclude
from this research that the FFM structure of the NEO (and other FFM based
questionnaires) has a broad intercultural exchangeability, though on specific facets
some reliable small average differences between cultures have been found.
How these differences should be interpreted is still open to debate. This hinges on
the definition of the concept of culture. This is often loosely paraphrased as the way
we do things around here. Remember that in a previous section we drew a
distinction between personality questionnaire items as behavioural indicators of
latent personality traits. It might well be conceivable, for example, that Japanese and
European people do not differ on average on a personality facet like deference, butthat they simply express it in a culturally different way. Japanese people might
express it by the steepness of their bows, European people by the politeness of how
they address a stranger. Personality questionnaires with such items as indicators will
show differences in responses between both cultures. But this need not indicate
differences on the latent trait of deference.
As also mentioned above, disentangling cultural differences in indicators and
underlying identity in latent traits can only be well researched by using modern test
theory, namely, Item Response Theory (IRT). This is an additional reason to expect
that the use of IRT models in personality research will increase.
I m p o r t a n c e o f s i t u a t i o n s : b r o a d v e r s u s s m a l l p r e d i c t o r s
In presenting the structure of the FFM we mentioned the broad level of aggregation
of the five factors. On the simple question are you conscientious? most people will
respond it depends. Depends on what? The situation.
Two aspects should be taken into account as regards this issue. First, a factor such
as Conscientiousness consists of a broad set of behaviours which comprise
different facets that are only imperfectly correlated. These facets might be broadly
categorized into two classes. In particular, facets which describe orderly behaviour
like I regularly clean my desk, as well as facets describing dependable behaviours
like people can count on me. Not all people who show the latter behaviour also
7/27/2019 Psicologia Traduzir
20/26
show the former, though, more often than no, they are found together in one person
than not. This is the reason they are subsumed under the general factor of
Conscientiousness in the first place. So, even where someones general behavioural
tendencies are concerned, it depends on the person who you observe whether (s)he
will show both classes of behaviour in general.
A second aspect to attend to in this context, however, is on the individual level. Even
if a person is in general both orderly and dependable, then it still depends on the
situation whether one or both of these behaviours will in reality be manifested. For
this to happen the latent trait orderly or dependable should first be activated
through demands within the situation. Without proper activation nothing is
manifested. This refers to an important present research approach, the so-called
trait-activation theory (Tett & Burnett, 2003).
This line of arguing surely calls for not using the broad big five factors of the FFM in
predicting external criteria, but instead using much smaller and more homogeneous
predictors at the level of facets. In fact, most research on FFM-based questionnaires
like the NEO as well as the RBFP are based on this strategy. And this research
results in higher validities than using the much broader big five factors themselves as
predictors.
P er s o n a l i t y a n d m e n t a l a b i l i t y
There is a big difference between tests for mental ability and personality
questionnaires. The former are so-called maximum performance instruments, and
the latter are commonly referred to as typical performance instruments.
In mental ability tests it is known beforehand which is the correct answer, and you
certainly cannot fake the correct answer if you do not actually know it. Someones
standing on a personality trait on the other hand is generally defined as the actual
relative frequency (s)he shows on a set of behaviours which are presumably
indicators of that trait. This is assessed by simply asking a person whether (s)he
does so. Such a question refers to typical behaviour. Of course, here one may easily
fake what one does, even if this is not the case. And this occurs more frequently
when a 5-point so-called Likert scale is used, as explained above.
7/27/2019 Psicologia Traduzir
21/26
Now, more intelligent persons those scoring higher on mental ability tests might
more easily fake on the one hand and at the same time understand that it is not in
their interest to do so. This is a really important state of affairs to take into account
when trying to prevent faking in personality questionnaires. Though, as also
explained above, most of the time a technical solution like the use of forced choice
items is preferred. Anyhow, background and prevention of faking are presently
important issues and are likely to remain so for the foreseeable future. Especially in
relation to the more frequent use of IRT models.
Research shows that there is virtually no correlation between mental ability and
profiles on a personality questionnaire when the latter has been honestly responded
to. There is only a small correlation with the factor Openness but the interpretation
of this correlation is a matter of debate.
Yet, at the same time more intelligent persons may learn and master competencies
easier, which does not fit well with their personality relative to the personalities of
less intelligent ones. To take our earlier example: even persons who in general have
a long rebound time might well learn to adequately master a competency that does
not fit them like persuasiveness, simply because it might easier for them to know
how to do this even when it is not their most natural behaviour. This is one of the
reasons why general intelligence is often shown to be the best general predictor of
performance in organizations. The smarter you are, the more you may not only
capitalize on your personality but also compensate for those facets which fit less well
with the demands that are made of you.
Therefore, in using personality as a predictor of external criteria the interplay with the
level of general intelligence will remain a very important and fascinating research
subject for time to come.
Conclusion
The concept of personality has been studied from a multitude of perspectives. In the
present article the psychometric perspective was elaborated. The role of well-
constructed personality questionnaires as the primary source to study personality
was described. The Five Factor Model was put forward as the integrative framework
for constructing and using such questionnaires.
7/27/2019 Psicologia Traduzir
22/26
It has been stressed that personality should be conceived as enacting itself in the
public domain. The average appraisal of the well informed other should therefore
be approximated by any assessment instrument. Under conditions of honest
responding it can be shown that a personality questionnaire does a reasonably good
job in that respect. Of course, such conditions are not easily ensured, but both paired
comparison response formats and modern test theory in the form of Item Response
Theory models are promising tools in this respect.
As a final comment it is worthwhile to realize that there is much more to predicting
behaviour than only including personality traits. Culturally specific ways of behaving,
the powerful influences of situations as well as the ability to monitor your own
behaviour should also be taken into account. In the same way that we can tell
different persons apart from each other, we can tell situations and cultures apart as
well.
7/27/2019 Psicologia Traduzir
23/26
References
Ashton, M.C., Lee, K., Perugini, M., Szarota, P., De Vries, R.E., & Di Blas, L. (2004)
A six-factor structure of personality-descriptive adjectives: Solutions from
psycholexical studies in seven languages. Journal of Personality and Social
Psychology , 86 (2), 356-366.
Buss, D.M. (Ed.)(2005). The handbook of evolutionary psychology. New York: Wiley.
Cheung, F. M., Leong, F. T. L., & Ben-Porath, Y. (Guest Editors) (2003). Special
Section: Psychological Assessment in Asia. Psychological Assessment, 15, 243-
310.
De Raad, B., & Barelds, D.P.H. (2008). A new taxonomy of Dutch personality traits
based on a comprehensive and unrestricted list of descriptors. Journal of personality
and Social Psychology, 94 , 347-364.
McCrae, R.R., & Costa, P.T., Jr. (2003). Personality in adulthood: A five-factor theory
perspective . New York: Guilford.
McCrae, R.R., & Terraciano, A. (2005). Personality profiles of cultures: aggregate
personality traits. Journal of personality and Social Psychology, 89, 407-425.
Meiring, D., Vijver, F.J.R. van de, Rothmann, S., & Barrick, M.R. (2005). Construct,
Item, and Method Bias of Cognitive and Personality Tests in South Africa. South
African Journal of Industrial Psychology , 31 (1), 1-8.
Schakel, L., Smid, N. & Jaganjac, A. (2007). Workplace Big Five: Professional
Manual . Utrecht: PiCompany.
Schakel. L. & Smid, N.G. (2005). Predicting career decisions through combining
personality and competencies. HRM Network Conference , TU Twente.
7/27/2019 Psicologia Traduzir
24/26
Schmidt, Frank L., & Hunter, John E. (1998). The Validity and Utility of Selection
Methods in Personnel Psychology. Practical and Theoretical Implications of 85 Years
of Research Findings. Psychological Bulletin, 124 , 262-274.
Tett, R. P., & Burnett, D. D. (2003). A personality trait-based interactionist model of
job performance. Journal of Applied Psychology, 88, 500517.
7/27/2019 Psicologia Traduzir
25/26
Questions for discussion
How would you explain the psychometric perspective of personality?
What are the implications of restricting the domain of personality data to publicly
observable behaviours?
What are the main arguments why the Five Factor Model has a global
intercultural applicability, and which should be the limiting conditions in this
respect?
What are reasons to expect that smaller and more homogeneous behaviouralfacets of the main big five factors of the FFM are better predictors of real life
criteria than those big five factors themselves?
What are the conceptual differences between behaviours, intentions and
motivations in relation to the concept of personality as defined in this reading?
What are the consequences for research methodology?
In what way may well-constructed self appraisal personality questionnaires be
perceived as good approximations of the average personality appraisal by well-
informed others?
Which methods may be used to counter dishonest responding in self appraisal
personality questionnaires?
What important contributions to personality research can be made by modern
test theory in the form of Item Response Theory (IRT) in contrast to deficiencies
in Classical Test Theory (CTT)?
How would you describe the two bridges to be crossed in the chain of inference
from personality assessment to a real life criterion?
In what way is the predictive power of personality influenced by differences in
mental ability?
7/27/2019 Psicologia Traduzir
26/26
About the author
Nico Smid
Nico Smid has been a principal consultant within PiCompany since 1999. Before that
time he has had a broad and varied experience in both scientific research and
teaching as well as HR consulting. As a university lecturer he taught research
methodology as well as personality theory and assessment, while at the same time
fulfilling central university management roles.
He became a PhD in Psychology with a dissertation on Determinants of Personality
Judgements. He was a co-founder and board member of the European Associationof Personality Psychology. From 1986 onwards he was a central concept
development consultant for management development within Philips Electronics and
he has among other things redesigned and implemented selection and potential
appraisal systems within that company. Following that he has been a HR strategy
and assessment centre development consultant within two consultancy firms, Beteor
and Towers Perrin. At present within PiCompany he is responsible for quality
management and concept development as well as maintaining external professional
networks.