← Back to Blog

Language‑Fair WPM in 2026: Normalizing Typing Tests for Emoji, CJK, and New Unicode Scripts

Language‑Fair WPM in 2026: Normalizing Typing Tests for Emoji, CJK, and New Unicode Scripts

Why “words per minute” needs an update in 2026

If you’ve ever compared an English WPM score against a Chinese or Japanese one, you’ve probably sensed something was off. The classic WPM convention—counting a “word” as five characters, including spaces and punctuation—was designed around Latin‑script English, not CJK scripts or emoji‑heavy chats. By design, it bakes language bias into leaderboards. (en.wikipedia.org)

Meanwhile, the writing system itself keeps expanding. Unicode 16.0 (released September 10, 2024) added 5,185 characters and seven new scripts, bringing the total to 154,998 encoded characters—and even more CJK‑relevant metadata via 36,000+ new Japanese source references for ideographs. Emoji 16.0 added eight new emoji in total (seven encoded in Unicode 16.0 plus the Flag of Sark in Emoji 16.0). (blog.unicode.org)

Looking ahead, Unicode 17.0 shipped September 9, 2025 with 4,803 additional characters and four new scripts, and Emoji 17.0 is expected to land broadly on major platforms during the first half of 2026 (with some early support in late 2025). That means your test takers will soon encounter brand‑new emoji and scripts in everyday typing. (blog.unicode.org)

Bottom line: it’s time to make typing tests language‑fair.

What makes raw WPM incomparable across languages

A better scoreboard: CPM, graphemes, and bits

Here are practical, build‑today scoring upgrades that make leaderboards fairer across scripts and emoji‑heavy prompts.

1) Make CPM the primary speed metric

2) Count what users perceive: grapheme clusters

Implementation tip: In the browser or app, segment the reference text into grapheme clusters up front (e.g., using Intl.Segmenter or a UAX #29 library). Score accuracy and speed by clusters, not code points.

3) Offer information‑theoretic normalization (advanced)

Pragmatically, start by normalizing CPM with a fixed per‑language bits/character estimate from a representative corpus, and refine over time.

4) Keep “classic WPM,” but make it language‑aware

Emoji‑inclusive prompts that reflect 2026 reality

Practical tip: Validate that your editor input and rendering treat emoji sequences as single grapheme clusters—cursoring, deletion, and backspace should operate atomically. UAX #29 and UTS #51 set the ground rules. (unicode.org)

Multilingual corpora and transliteration fairness

Implementation checklist (for test builders)

Quick wins for typists (actionable tips)

The takeaway

Unicode’s rapid growth—5,185 characters and seven scripts in 16.0, plus thousands more in 17.0—along with the 2026 wave of Emoji 17.0, makes old‑school WPM comparisons feel increasingly out of date. Shift to CPM and grapheme‑aware counting, consider an information‑theoretic score, and design emoji‑inclusive, multilingual prompts. Your users will get a fairer, more modern leaderboard—no matter what they type. (blog.unicode.org)

Article illustration

Ready to improve your typing speed?

Start a Free Typing Test