How do I extract domains from text, emails, and URLs at once?
Paste everything into the box—CRM exports, log lines, markdown, and plain sentences. The scanner finds http(s) links, protocol-relative // URLs, bracketed links, email addresses, and bare hostnames. Results are deduplicated; turn on “Normalize to registrable domain” if you want one root per site (for example api.mail.google.co.uk → google.co.uk).
What is the difference between a hostname and a registrable (root) domain?
A hostname is the full DNS name (www.blog.example.com). The registrable domain is what you could register under a public suffix—often called the eTLD+1 (example.com, example.co.uk). This tool can show hostnames as extracted, then optionally roll subdomains up to that registrable root for reporting and deduplication.
What input formats does the domain extractor support?
The extractor handles mixed text: bare URLs (https://example.com/path), protocol-relative links (//cdn.example.com), markdown links ([text](https://url)), email addresses (user@domain.com), and plain hostnames. Paste logs, CSV exports, HTML source, or any mixed text.
What is registrable-root domain extraction?
Registrable-root mode strips subdomains and returns only the registrable domain—the part you would register with a registrar. For example, blog.company.co.uk becomes company.co.uk (using the Public Suffix List in PSL mode). Useful for summarizing which companies appear in a URL list.
Can I extract domains from email lists?
Yes. Paste email addresses and the extractor returns the domain portion of each address. With deduplication and optional registrable-root mode, you get a unique company-domain list from any export.
How do I export the extracted domains?
Export as CSV (one domain per line, optional count column), JSON array, or plain newline-separated text. CSV counts help prioritize domains that appeared most often in the source.
Does it handle international domains (IDN) and punycode?
Yes when the input is URL-shaped or uses ASCII punycode labels (xn--…). The browser’s URL parser normalizes hostnames like the address bar. Raw Unicode labels only in prose may be missed—prefer pasted links or punycode for full coverage.
Why are some domains missing from my results?
The extractor skips IPv4/IPv6 literals, drops candidates that fail hostname validation, and ignores strings that never look like a URL, email, or dotted hostname. Enable Maximum accuracy (PSL) under Advanced for unusual TLDs and provider domains like github.io.
How accurate is “Fast (curated list)” vs “Maximum accuracy (PSL)”?
Fast mode uses a curated set of multi-part suffixes (e.g. .co.uk, .com.au) for speed and offline use. PSL mode downloads public suffix data and matches the longest registrable suffix—better for edge cases. Both modes stay in your browser.
Are www and non-www treated as the same domain?
By default www is kept as part of the hostname. You can strip www, dedupe www vs non-www, and normalize case from the options.
Is my data uploaded or stored?
No. Parsing, filtering, and export run entirely in your browser tab. Nothing is sent to EverydayTools servers for this tool.
What are the size limits?
Input is capped at roughly 500k characters so tabs stay responsive. Very large pastes may hit candidate or unique-domain caps; the UI warns you to split the file.