commands.sh

htmlq

all

Use CSS selectors to extract content from HTML files.

More info →

Options (6)

-t, --textboolean

Get the text content of the first paragraph

Example: cat {{path/to/file.html}} | htmlq {{[-t|--text]}} 'p:first-of-type'
-a, --attributeboolean

Find all the links in a page

Example: cat {{path/to/file.html}} | htmlq {{[-a|--attribute]}} href 'a'
-r, --remove-nodesboolean

Remove all images and SVGs from a page

Example: cat {{path/to/file.html}} | htmlq {{[-r|--remove-nodes]}} 'img' {{[-r|--remove-nodes]}} 'svg'
-p, --prettyboolean

Pretty print and write the output to a file

Example: htmlq {{[-p|--pretty]}} {{[-f|--filename]}} {{path/to/input.html}} {{[-o|--output]}} {{path/to/output.html}}
-f, --filenameboolean

Pretty print and write the output to a file

Example: htmlq {{[-p|--pretty]}} {{[-f|--filename]}} {{path/to/input.html}} {{[-o|--output]}} {{path/to/output.html}}
-o, --outputboolean

Pretty print and write the output to a file

Example: htmlq {{[-p|--pretty]}} {{[-f|--filename]}} {{path/to/input.html}} {{[-o|--output]}} {{path/to/output.html}}

Examples (5)

Return all elements of class `card`

cat path/to/file.html | htmlq '.card'

Get the text content of the first paragraph

cat path/to/file.html | htmlq [-t|--text] 'p:first-of-type'

Find all the links in a page

cat path/to/file.html | htmlq [-a|--attribute] href 'a'

Remove all images and SVGs from a page

cat path/to/file.html | htmlq [-r|--remove-nodes] 'img' [-r|--remove-nodes] 'svg'

Pretty print and write the output to a file

htmlq [-p|--pretty] [-f|--filename] path/to/input.html [-o|--output] path/to/output.html
made by @shridhargupta | data from tldr-pages