Unicode, Tolkien, and Privacy

15 · John Cook · March 9, 2025, 8:37 p.m.
Summary
This blog post discusses the author's thought process linking discussions on LLM tokenization, Unicode character tokenization, and its implications regarding the Private Use Area, drawing connections to themes from Tolkien and privacy.