MUTools

Heading Extractor

Heading Extractor pulls a page's title, description, and h1–h6 heading structure in one shot. It organizes the information design and SEO heading hierarchy in a form that is easy to scan.

Entered 0 / 10
Target tags
Extraction scope
Enter URLs and press "Extract headings" to see each URL's heading structure and SEO diagnostics here.

Each URL is fetched server-side for HTML parsing. Connections to private IPs or localhost are rejected.

Each fetch times out after 8 seconds and reads only the first 2MB of the HTML body. Tag filters affect only the on-screen view and CSV; SEO diagnostics are evaluated on the unfiltered page state.

About Heading Extractor

Heading Extractor pulls a page's title, description, and h1–h6 heading structure in one shot. It organizes the information design and SEO heading hierarchy in a form that is easy to scan.

Process up to 10 URLs per run, and download results as CSV (UTF-8 + BOM). Common SEO problems are diagnosed automatically: "multiple H1s", "level skipped (H2 followed by H4)", "title over 60 characters", "description over 160 characters", and more.

Useful for auditing site information design, taking stock of current state before a rewrite, and observing competitor structures.

How to use

  1. Paste one URL per line into the input (up to 10).
  2. Click "Extract headings" — each URL is analyzed and the results are shown.
  3. The top "SEO diagnosis" section flags H1 issues, level skips, and title / description length.
  4. Use the H1 / H2 / H3 / … checkboxes to filter which heading levels are shown.
  5. Click "Download CSV" to save results for analysis in Excel or Google Sheets.

Use cases

  • Web producers auditing every key page's heading hierarchy before a refresh.
  • SEO leads and marketers studying competitor heading structures.
  • Editors and writers listing the current headings of a target article when drafting a rewrite outline.
  • Operators auditing for pages with multiple H1s or skipped heading levels.
  • Web directors building TOC / sitemap-style deliverables from extracted headings.

Notes

  • Up to 10 URLs per request.
  • Connections to private IP addresses or localhost are refused for safety.
  • Each URL has an 8-second fetch timeout — slow servers may error.
  • Only the first 2 MB of HTML is read; very large pages may not yield headings.
  • Headings inserted by JavaScript (SPAs) are not captured if they are not in the initial HTML.
  • SEO diagnosis runs on the full page (including headings hidden by the checkbox filter).

FAQ

How many URLs can I process at once?
Up to 10 — fixed at 10 to balance concurrent fetch load and UX. Run multiple batches for larger jobs.
What does the SEO diagnosis check?
Missing title / description, 0 or 2+ H1s, heading level skips (e.g., H2 followed by H4), title over 60 characters, and description over 160 characters — categorized as errors (needs fix) and warnings (recommended).
What apps can open the CSV?
UTF-8 (BOM) output opens cleanly without mojibake in Microsoft Excel, Google Sheets, LibreOffice Calc, Numbers, and similar apps. A per-URL CSV and a combined CSV across URLs are both available.
Can I capture headings generated by JavaScript?
No. The tool parses the initial HTML, so headings inserted after JavaScript runs are not captured. Google's crawler executes JavaScript, so its view may differ from these results.
Can I extract headings from pages behind a login?
No. The fetcher has no credentials and treats every page as public, so login-only or session-only pages are not reachable.