Comparison
HEXACO vs Big Five vs MBTI: which personality test is best?
There are dozens of personality tests. Three models dominate: MBTI, Big Five, and HEXACO. They don't measure the same things, they aren't equally reliable, and they paint a very different picture of who you are.
In short
HEXACO is scientifically the most complete model: it measures the five Big Five dimensions plus a sixth, Honesty-Humility (the H-factor). Big Five is the validated standard. MBTI and Enneagram are popular but not scientifically reliable — low test-retest reliability and no robust validity. Innerscape uses HEXACO, combined with 7 other validated questionnaires (365 questions, ~45 min).
Comparison table
In short: HEXACO and Big Five are scientifically reliable, MBTI and Enneagram are not.
| MBTI | Enneagram | Big Five | HEXACO | |
|---|---|---|---|---|
| Dimensions | 4 dichotomies → 16 types | 9 types + instinctual variants | 5 continuous scales | 6 continuous scales |
| Developed by | Carl Jung / Myers & Briggs | Ichazo, Naranjo (1970s) | Costa & McCrae (1985) | Ashton & Lee (2000) |
| Scientific basis | Weak — not reproducible | Very weak — no empirical basis | Strong — the gold standard | Strong — successor to Big Five |
| Measures the H-factor? | No | No | No | Yes |
| Type vs dimension | Boxes (introvert OR extravert) | Boxes (1 of 9 types) | Scales (more or less) | Scales (more or less) |
| Cross-cultural validation | Limited | Minimal | Good | Excellent (12+ languages) |
| Used at Innerscape | No | No | No (we use HEXACO) | Yes |
What makes a personality test scientifically reliable?
A reliable personality test meets three scientific criteria: test-retest reliability, internal consistency, and validity. Test-retest reliability means you get roughly the same result if you retake the test a few weeks later. Internal consistency — often measured with Cronbach's alpha — shows that items measuring the same trait genuinely hang together. Validity means the test measures what it claims to measure, and that the results actually predict something about behavior.
Big Five and HEXACO score excellently on all of these points. Test-retest reliability is above 0.70, Cronbach's alpha above 0.80, and both models predict behavior in work, relationships, and health. HEXACO also has a unique strength: cross-cultural validation across more than 12 languages through lexical studies. That means the dimensions don't just show up in English — they are universal.
MBTI and Enneagram fail these criteria. MBTI has a test-retest reliability of around 0.50 — when retested, roughly half of people are assigned a different type. The Enneagram has no scientific basis in peer-reviewed research and was developed from spiritual traditions rather than from empirical personality research. Both are popular because they offer a recognizable language for self-reflection — but recognition is not proof of reliability.
Put practically: if a test tells you something different every time, or if the result predicts nothing about your actual behavior, it is not reliable — however fun or recognizable the categories may sound. For a reliable personality test, the science behind the method should matter at least as much as the report you get back.
Is the MBTI scientifically reliable?
No — the MBTI is popular but not scientifically reliable: its test-retest reliability is low and there is no peer-reviewed consensus on its validity. The MBTI is based on Carl Jung's type theory (1921) and was popularized by Katharine Briggs and Isabel Myers. The model sorts people into 16 personality types based on four dichotomies: introversion/extraversion, sensing/intuition, thinking/feeling, and judging/perceiving. The test is enormously popular with the general public and widely used in corporate training and team building.
The scientific criticism, however, is substantial. Test-retest reliability is low: when retested, a considerable share of participants are assigned a different type. There is no peer-reviewed consensus on the validity of the model. The dichotomies are artificial — most people score in the middle of each scale, not at the extremes. You are rarely purely introverted or purely extraverted; the MBTI forces you into a box that does not reflect reality.
Is the Enneagram scientifically grounded?
No — the Enneagram is a spiritual typology without scientific grounding: there is no peer-reviewed empirical basis and its test-retest reliability is low. The Enneagram sorts people into 9 personality types, each with a core motivation and associated pitfalls. The model was popularized in the 1970s by Oscar Ichazo and Claudio Naranjo and is widely used in coaching, spiritual growth, and self-help literature. For many people it is recognizable and offers a language to reflect on themselves.
Scientifically, however, the Enneagram is weak. There is no peer-reviewed empirical research supporting the 9 types as independent, measurable constructs. Its origins lie in mystical and spiritual traditions, not in personality research. Test-retest reliability is low and the types overlap substantially with Big Five dimensions — suggesting that the Enneagram essentially lays a narrative layer over known personality traits without adding new information.
What does the Big Five measure — and what does it miss?
The Big Five measures five personality factors — Openness to experience, Conscientiousness, Extraversion, Agreeableness, and Neuroticism (together OCEAN) — but misses the H-factor (Honesty-Humility). The model, developed by Costa and McCrae, was the dominant model in academic personality research for decades, with strong validation and good test-retest reliability across cultures.
The model does have a blind spot: it misses the H-factor. Big Five cannot predict ethical behavior, integrity, and manipulative behavior well. Someone who scores high on Agreeableness in the Big Five model can still be manipulative or dishonest — a distinction the Big Five does not make. For a complete picture of personality, more is needed.
Why is HEXACO the most complete model?
HEXACO is the most complete model because it measures everything the Big Five measures plus a sixth dimension: Honesty-Humility (the H-factor). Ashton and Lee discovered through lexical studies in more than 12 languages that this sixth dimension exists and that the Big Five misses it. This dimension measures the tendency toward sincerity, fairness, and modesty versus manipulation, greed, and self-aggrandizement. The H-factor recurs in every language studied and predicts narcissism, Machiavellianism, and integrity.
HEXACO also redefines existing dimensions. Emotionality is broader than Neuroticism in the Big Five and includes sentimentality and attachment. Agreeableness is defined differently than in the Big Five, with a sharper focus on patience, forgiveness, and flexibility. The result is a model that is at least as scientifically strong as the Big Five, but gives a more complete and nuanced picture of personality.
Which test does Innerscape use, and why?
HEXACO is the most complete scientific personality model available. It measures everything the Big Five measures, plus the H-factor that reveals honesty, integrity, and humility. Innerscape combines the HEXACO test with 7 other validated questionnaires — from attachment style to values and coping strategies — for the most complete personality profile you can get online.
Frequently asked questions
Which personality test is best?+
Scientifically, HEXACO is the most complete model. It measures everything the Big Five measures, plus honesty and humility. MBTI is popular but not scientifically reliable.
Is the MBTI scientific?+
The scientific consensus is that the MBTI does not meet the standards for reliability and validity. You can be assigned a different type when you retake it.
What is the difference between Big Five and HEXACO?+
HEXACO adds a sixth dimension: Honesty-Humility. It also defines Emotionality and Agreeableness differently from the Big Five.
Can I translate my MBTI type into HEXACO?+
Not directly. MBTI measures types (boxes), HEXACO measures continuous dimensions (scales). There is some overlap on Extraversion and Openness, but the models are fundamentally different.
And the Enneagram — is it scientific?+
No. The Enneagram sorts people into 9 types based on spiritual traditions from the 1970s. There is no peer-reviewed empirical basis for this typology and its test-retest reliability is low. It can be useful for self-reflection, but not for reliable personality measurement.
What makes a personality test scientifically reliable?+
Three criteria: test-retest reliability (you get the same result when retested), internal consistency (items that measure the same thing hang together), and validity (the test actually predicts behavior). Big Five and HEXACO meet all three; MBTI and Enneagram do not.
Explore more