Question 1

How does the text cleaner work?

Accepted Answer

Paste or type text into the input area and the cleaner processes it instantly through a pipeline of up to 13 operations, each controlled by its own toggle. Operations run in a carefully ordered sequence — for example, HTML tags are stripped before spaces are normalized, and case transformation happens last. The cleaned result appears in the output panel in real time, along with stats showing exactly what was removed.

Question 2

What are "smart quotes" and why should I convert them?

Accepted Answer

Smart quotes (also called curly quotes) are the typographically correct quotation marks used in published text: " " instead of " ". Word processors like Microsoft Word and Google Docs automatically convert straight quotes to smart quotes. While they look better in print, they can cause problems in code, CSV files, databases, command-line tools, and email clients that don't support Unicode. The cleaner also converts em dashes (—) to double hyphens and ellipses (&hellip;) to three dots.

Question 3

What are invisible characters and where do they come from?

Accepted Answer

Invisible characters are Unicode code points that produce no visible output but are present in the text data. Common examples include zero-width spaces (U+200B), byte order marks (U+FEFF), soft hyphens (U+00AD), zero-width joiners (U+200D), and directional formatting characters. They sneak into text from web pages, PDFs, word processors, and copy-paste operations. They can cause string comparisons to fail, break JSON parsing, interfere with regular expressions, and produce unexpected whitespace in rendered output.

Question 4

What does "Remove non-ASCII" do?

Accepted Answer

This option strips any character outside the basic ASCII range (character codes 0–127). This includes accented characters (e.g., &eacute;, &uuml;), emojis, CJK characters, mathematical symbols, and all other non-Latin characters. Use this when your target system only accepts ASCII, such as legacy databases, certain APIs, or systems that lack proper Unicode support. Note that this is an aggressive operation — only enable it when you specifically need ASCII-only output.

Question 5

What's the difference between "Remove blank lines" and "Remove all line breaks"?

Accepted Answer

"Remove blank lines" collapses three or more consecutive newlines into two — so paragraph breaks are preserved but excessive whitespace between paragraphs is eliminated. "Remove all line breaks" is more aggressive: it replaces every newline with a space, merging all text into a single continuous paragraph. Use blank line removal for general cleanup; use full line break removal when you need single-line output (e.g., pasting into a spreadsheet cell or form field).

Question 6

What does "Normalize line endings" mean?

Accepted Answer

Different operating systems use different characters to represent line breaks. Windows uses CRLF (carriage return + line feed, 
), macOS and Linux use LF (
), and old Mac systems used CR (). This option converts all line endings to the Unix standard LF format. This prevents issues when sharing text files between systems or when pasting text into tools that expect consistent line endings.

Question 7

Can I use this to clean text from PDFs?

Accepted Answer

Absolutely — this is one of the most common use cases. Text copied from PDFs often contains random line breaks in the middle of sentences (because the PDF preserved visual line wrapping), extra spaces, non-breaking spaces that look like regular spaces but aren't, and various invisible characters. Use the "PDF / Word Paste" preset for best results — it removes line breaks, strips HTML, fixes invisible characters, and normalizes all spacing in one click.

Question 8

How does HTML tag stripping work?

Accepted Answer

The HTML stripper uses a regex pattern to match and remove anything between angle brackets (< and >). This handles standard tags like

,

, , self-closing tags like
, and tags with attributes. It does not decode HTML entities — & remains as & in the output. For basic web-to-plain-text conversion, combining HTML stripping with double space removal and blank line cleanup gives excellent results.

Question 9

Is this free, private, and unlimited?

Accepted Answer

Yes to all three. The cleaner runs entirely in your browser — your text is never sent to any server. There are no accounts, no cookies, no tracking, and no limits on text length or usage. When you close the tab, everything is gone. Process sensitive documents, confidential emails, or private data with complete confidence that nothing leaves your device.

Text Cleaner

Powerful Cleaning, Precise Control

12 Cleaning Operations

One-Click Presets

Real-Time Stats

Invisible Character Removal

Case Transformation

HTML Tag Stripping

Clean Text for Every Workflow

Writers & Editors

Developers

Data Analysts

Email & Marketing

Frequently Asked Questions

How does the text cleaner work?

What are "smart quotes" and why should I convert them?

What are invisible characters and where do they come from?

What does "Remove non-ASCII" do?

What's the difference between "Remove blank lines" and "Remove all line breaks"?

What does "Normalize line endings" mean?

Can I use this to clean text from PDFs?

How does HTML tag stripping work?

Is this free, private, and unlimited?

Why Text Cleaning Matters More Than You Think

The Copy-Paste Problem

Invisible Characters: The Hidden Menace

Smart Quotes and the Encoding Problem

HTML Artifacts in Plain Text

The Pipeline Approach to Text Cleaning