Instantly convert text between multiple encoding formats: Base64, URL encoding, Hexadecimal (UTF‑8 bytes), HTML entities, JavaScript escape sequences, and more. Fully client‑side, no data leaves your device.
Text encoding is the cornerstone of digital communication. In computing, characters must be represented as bytes. The Unicode standard (UTF‑8, UTF‑16) now dominates, but legacy encodings (ASCII, ISO‑8859‑1) and transfer encodings (Base64, URL encoding, HTML entities) remain essential for web development, APIs, and data storage. This tool provides exact conversions using industry‑standard algorithms.
? Core principle: All conversions rely on Unicode code points → UTF‑8 byte sequences → encoded representation (Base64, hex, percent‑encoding, etc.).
Base64 encodes binary data (or UTF-8 text) into an ASCII string using 64 characters. Used extensively in MIME, JWT, and data URIs. Our implementation converts text → UTF‑8 bytes → Base64, and reverse with robust error handling for non‑ASCII inputs.
Replaces unsafe ASCII characters with %xx hexadecimal codes. Essential for query strings and URL paths. Compliant with RFC 3986: encodes spaces as %20 (not +). Use decode for recovering original text.
Displays each UTF‑8 byte as two hexadecimal digits (0–9, A–F). Useful for inspecting raw byte representation, debugging character encoding issues, or low‑level protocols.
HTML entities (&, <, >, ", ') prevent XSS and ensure correct rendering. JavaScript escape converts non‑ASCII characters to \\uXXXX form, essential for string literals in source code or JSON without Unicode support.
| Character | Unicode (code point) | UTF‑8 bytes (hex) | Base64 encoded | URL encoded | HTML Entity |
|---|---|---|---|---|---|
| A | U+0041 | 41 | QQ== | A | A |
| © | U+00A9 | C2 A9 | wqk= | %C2%A9 | © or © |
| € | U+20AC | E2 82 AC | 4oKs | %E2%82%AC | € |
| ? | U+1F30D | F0 9F 8C 8D | 8J+M | %F0%9F%8C%8D | 🌍 |
Imagine receiving a JSON Web Token (JWT) containing Base64Url segments. Using our tool, paste the token's payload and decode to plain JSON. Similarly, URL‑encoded form data from legacy clients can be instantly decoded for inspection. Professionals from cybersecurity to full‑stack development rely on robust encoding converters to eliminate hidden character corruption.
Before Unicode, dozens of incompatible character encodings (Windows‑1252, Shift‑JIS, KOI8‑R) caused mojibake. UTF‑8, designed by Ken Thompson and Rob Pike, uses variable‑width encoding (1–4 bytes) and backward compatibility with ASCII. Today, over 98% of websites use UTF‑8. Understanding UTF‑8 byte representation is critical when dealing with hexadecimal dumps, database storage, and low‑level networking. Our Hex encoder shows the exact bytes generated by the TextEncoder API, aligning with WHATWG standards.
btoa/atob with UTF‑8 bridging; URL encoding uses encodeURIComponent; Hex uses TextEncoder; HTML entity encoding uses a robust mapping. Every operation is tested against standard vectors and is entirely deterministic.