Paul Ekman's FACS at 46: Facial Action Coding Then and Now

Paul Ekman's Facial Action Coding System turns 46. The anatomical observation framework remains the gold standard — but what practitioners can infer from it has been refined, challenged, and occasionally exploited beyond what the science supports.

The System That Started It All

In 1978, Paul Ekman and Wallace Friesen published the Facial Action Coding System — a taxonomy of every anatomically possible facial movement, catalogued into 44 "Action Units" (AUs) that map directly to underlying muscle contractions. FACS was not the first attempt to classify facial expression, but it was the first to be anatomically grounded rather than impression-based. It did not ask "does this face look angry?" It asked "which muscles contracted, in what combination, at what intensity, for how long?"

That distinction — observation over interpretation — made FACS the foundation of modern facial expression research, clinical psychology assessment, animation (Pixar licenced FACS for character animation in the 1990s), security training, and the now-ubiquitous "emotion AI" industry. Forty-six years later, it remains the reference standard against which all other facial measurement methods are validated.

But FACS is not unchanged, and the field it spawned has evolved — sometimes in directions Ekman himself has challenged. Here is where things stand.

What FACS Got Right

Anatomical precision

The core insight — that facial expression can be decomposed into discrete, observable muscle movements rather than holistic emotional labels — remains unchallenged. AU 6 (cheek raiser, orbicularis oculi contraction) plus AU 12 (lip corner puller, zygomaticus major contraction) produces what we call a "Duchenne smile" — one that involves the eyes. AU 12 alone produces a social or posed smile. This distinction, invisible to most untrained observers, is reliably detectable once learned and has been replicated across hundreds of studies.

Cross-cultural universality of basic expressions

Ekman's original cross-cultural studies (including work with pre-literate New Guinea populations) demonstrated that certain facial configurations are produced and recognised across cultures: anger, fear, disgust, surprise, sadness, happiness, and contempt. This "basic emotions" framework has been debated (see below) but the core finding — that these specific AU combinations occur universally — has been replicated more than any other finding in affective science.

The distinction between spontaneous and deliberate expression

FACS enabled researchers to distinguish between genuine emotional expressions (involuntary, typically symmetric, with appropriate timing) and performed expressions (deliberate, often asymmetric, with timing that is too fast or too slow). This distinction underpins modern deception detection research, clinical assessment of flat affect, and understanding of emotional regulation.

What Has Changed Since 1978

1. Automated FACS coding (AU detection via computer vision)

The most significant practical change is automation. Manual FACS coding is slow — a trained coder analyses approximately one minute of footage per hour. This made large-scale research expensive and clinical application impractical.

Since the early 2010s, machine learning systems have been trained to detect Action Units from video in real time. OpenFace (Baltrušaitis et al., 2018), AFAR (Ertugrul et al., 2020), and commercial systems (Affectiva/Smart Eye, Noldus FaceReader) can now detect most AUs with accuracy approaching trained human coders for posed expressions — though performance degrades for spontaneous, subtle, or partially occluded expressions.

This automation has enabled research at scales Ekman could not have imagined: millions of faces coded across cultures, longitudinal tracking of facial behaviour in clinical populations, and real-time feedback systems for training applications.

2. The constructionist challenge

Lisa Feldman Barrett and the constructionist school have mounted the most significant theoretical challenge to Ekman's framework since its publication. Their argument: there is no one-to-one mapping between facial configurations and emotional states. A furrowed brow (AU 4) does not "mean" anger — it occurs in anger, concentration, confusion, pain, and bright sunlight. The same emotion can be expressed through different facial configurations in different contexts.

This debate matters operationally because it challenges the inference step: FACS tells you which muscles moved, but it does not — cannot — tell you what the person is feeling. That inference requires context, baseline, cultural knowledge, and verbal information. Practitioners who skip the observation layer and jump directly to "they are lying" or "they are angry" based on a single AU are making exactly the error Barrett identifies.

The practical resolution: FACS remains valid as an observation tool (it describes what the face did). The dispute is about what you can infer from what the face did. In professional application, this means treating AU observations as data points that inform hypotheses, not as diagnostic conclusions.

3. Individual and cultural variation

Ekman's original work emphasised universality — what is the same across cultures. Subsequent research has increasingly documented what differs: display rules (when and how much emotion to show), decoder expectations (what expressions you expect to see and therefore perceive), intensity norms, and individual variation in facial anatomy that affects which AUs are producible and visible.

For security and hospitality applications, this means that a person whose face does not display expected emotional markers is not necessarily concealing something — they may simply have different display norms. East Asian populations, on average, produce lower-intensity facial expressions in public settings compared to Western European populations. This is a display rule difference, not a deception indicator.

4. FACS 2.0 and beyond

The system itself was updated in 2002 (Ekman, Friesen & Hager), refining scoring criteria and adding AU combinations. Independent extensions have added head movement coding, gaze direction coding, and vocal prosody integration. The current state of the art treats facial expression not as an isolated channel but as one component of a multimodal communication system that includes voice, gesture, posture, and context.

5. The "emotion AI" industry — and its problems

FACS spawned a commercial industry that claims to detect emotions from faces at scale — in hiring, in education, in advertising testing, in surveillance. This industry has drawn criticism from Ekman himself, from Barrett, and from AI ethics researchers (Crawford, 2021) for several reasons:

Inferring internal states from external expressions violates the very caution FACS was designed to enable (observe the action unit; do not assume the emotion)
Training data for these systems is predominantly Western, white, and posed — producing systematic bias against non-Western facial morphology and expression norms
The accuracy claims made by commercial vendors rarely survive independent validation
The ethical implications of automated emotion surveillance have not been adequately addressed

What Remains True for Practitioners

If you work in a field that requires reading facial expression — hospitality, security, healthcare, leadership, sales — these principles have survived 46 years of research:

Observation is separate from interpretation. Train yourself to see what the face does (which muscles move) before deciding what it means. FACS's lasting gift is this discipline.
Context determines meaning. The same expression means different things in different situations. A furrowed brow at a funeral is not the same as a furrowed brow at a desk.
Clusters outperform single cues. One AU tells you very little. Multiple AUs appearing simultaneously, congruent with body language and voice, tell you something useful.
Baselines are required. You cannot identify deviation without knowing what is normal for this person, in this context, at this time.
Cultural humility is non-negotiable. Your own cultural norms for facial expression are not universal. Train against the assumption that your decoder ring works everywhere.
The face is one channel of many. It is a valuable channel — the most information-dense region of the body — but professionals who fixate on the face while ignoring hands, posture, voice, and context are working with partial data.

The Bottom Line

FACS at 46 is what most foundational scientific tools become with age: still structurally sound, partially superseded by automation, surrounded by commercial exploitation that exceeds what the science supports, and embedded in a theoretical debate that has refined rather than replaced its core contribution. For practitioners, the takeaway is simple: learn to see the face with anatomical precision. Do not assume what it means. Let context, clusters, and baselines inform your interpretation. Ekman gave us the observation tool. What we do with the observation is still, and always was, a matter of skill, context, and intellectual honesty.

Paul Ekman's FACS at 46: What Has Changed in Facial Action Coding — and What Has Not

The System That Started It All

What FACS Got Right

Anatomical precision

Cross-cultural universality of basic expressions

The distinction between spontaneous and deliberate expression

What Has Changed Since 1978

1. Automated FACS coding (AU detection via computer vision)

2. The constructionist challenge

3. Individual and cultural variation

4. FACS 2.0 and beyond

5. The "emotion AI" industry — and its problems

What Remains True for Practitioners

The Bottom Line

Related reading

See the ‘no’ coming before they say it

The Baseline & Cluster Method

More Articles

Behavioural Profiling at the Gate: What Airport Security Can Learn from FBI Interview Techniques

Proxemics in Practice: How Personal Space Drives Commercial Outcomes in Hospitality and Retail

ROI of Communication Training: A Measurement Framework for L&D