clean-html.sh - bash script to clean HTML.

1) Convert the stored pages in UTF-8.
2) defecating saved pages of extra spaces, tabs,
     blank lines, scripts, images, meta-information.

     PS: when an <pre> produces limited filtering!

! Not all characters can be transcoding UTF-8. Be careful.

Project Samples

Project Activity

See All Activity >

Categories

HTML/XHTML

License

GNU General Public License version 2.0 (GPLv2)

Follow clean-html-sh

clean-html-sh Web Site

Other Useful Business Software
Enterprise AI Search, Intranet, and Wiki in one platform. Icon
Enterprise AI Search, Intranet, and Wiki in one platform.

Your company’s all-in-one solution for trusted information

Cut through the noise and end information overload with Guru, an all-in-one wiki, intranet, and knowledge base that serves as your company's single source of truth.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of clean-html-sh!

Additional Project Details

Operating Systems

Linux

Languages

Russian

User Interface

Console/Terminal

Programming Language

Unix Shell

Related Categories

Unix Shell HTML XHTML

Registered

2012-09-26