Question 1

What causes mojibake?

Accepted Answer

It happens when text saved as UTF-8 is later read using a different, single-byte encoding — most often Windows-1252 or ISO-8859-1. Each non-ASCII character was stored as two or more UTF-8 bytes, and reading those bytes one at a time produces the wrong characters: é (two bytes) shows up as the two characters Ã©. CSV imports, database migrations, and copy-paste between mismatched systems are common culprits.

Question 2

Will it damage text that's already correct?

Accepted Answer

No. The repair only succeeds when the reversed bytes form valid UTF-8, which genuine mojibake does but correctly-encoded text does not. So 'café', 'Köln', '한국어', or '日本語' that are already right are detected as valid and left exactly as they are — the tool reports that no fix was needed.

Question 3

Why does it sometimes apply more than one pass?

Accepted Answer

If text was mis-decoded twice — for example UTF-8 read as Windows-1252, saved, then read as Windows-1252 again — the garbling is layered. The tool repeats the repair until the text stops changing or no longer reverses to valid UTF-8, and tells you how many passes it used.

Question 4

It didn't fix my text — why?

Accepted Answer

Either the text is already correct, or the corruption isn't the common UTF-8-as-Windows-1252 kind (for example it was mis-decoded as Shift_JIS or EUC-KR, or bytes were actually lost). This tool targets the most frequent case; for opening a file in a specific legacy encoding, use a text-encoding converter instead.

Mojibake Fixer (Repair Garbled UTF-8)

How to use

Frequently asked questions

Related tools

Markdown Table to CSV Converter

Markdown Table Generator

Text Diff Viewer

Lorem Ipsum Generator

Case Converter

Character & Word Counter