refactor: support emojis and high unicode characters #38

avdoseferovic · 2026-01-16T15:44:21Z

⚠️ DISCLAIMER: This PR was mostly produced by AI, I did manual testing and verify that there is no panics with emoji usages.

Details

This PR implements full support for emojis and Unicode characters outside the BMP.
It includes:

Data structure updates (map-based widths).
CMAP Format 12 parsing.
CID remapping strategy to handle characters > U+FFFF using Identity-H encoding.
Updates to Text, CellFormat, write, and generateCIDFontMap to support this remapping.
Backward compatibility for existing fonts and tests.

emoji.pdf

Changes: - Change fontDefType.Cw and utf8FontFile.CharWidths from slice to map[int]int to support sparse and high unicode characters (fixing crash). - Update utf8toutf16 to correctly handle 4-byte UTF-8 sequences using surrogate pairs. - Add UnmarshalJSON to fontDefType to backward-compatibility with array-based font definitions. - Remove hardcoded limit checks for character widths.

Changes: - Implement CMAP Format 12 parsing in utf8fontfile.go. - Implement CID remapping in fpdf.go to support characters outside BMP (e.g. Emojis). - Add runeToCid map to fontDefType. - Add helper methods stringToCIDs and getOrAssignCID. - Update Text, CellFormat, and generateCIDFontMap to use CID remapping and correct width lookup. - Update parseSymbols to use CIDs as keys for GID lookup.

avdoseferovic added 2 commits January 16, 2026 16:24

avdoseferovic changed the title ~~Refactor: Support emojis and high unicode characters~~ refactor: support emojis and high unicode characters Jan 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: support emojis and high unicode characters #38

refactor: support emojis and high unicode characters #38

Uh oh!

avdoseferovic commented Jan 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

refactor: support emojis and high unicode characters #38

Are you sure you want to change the base?

refactor: support emojis and high unicode characters #38

Uh oh!

Conversation

avdoseferovic commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

avdoseferovic commented Jan 16, 2026 •

edited

Loading