Microsoft
Software
Hardware
Network
Question : Bash scripts: how to strip these html tags
Hi experts,
Here is a html file. How can I print the html tags and "common words" such as "in", "the", "a" into one file "tag_file.txt"; and print others into another file "content_file.txt"?
I use Bash scripts. I am a newbie in Bash field. Any help is highly appreciated!
For example: in the code snippt:
All the html tags such as "
" and "the" are all printed to "tag_file.txt" whereas the rest should be printed to "content_file.txt"
Code Snippet:
1: 2: 3: 4: 5: 6: 7: 8: 9: 10: 11: 12: 13:
3019337
story
the element is +
-
Open in New Window
Select All
Answer : Bash scripts: how to strip these html tags
Do you mean like this?
tr ' >' '\n' < file | grep '<\|\<\(in\|the\|a\)\>' > tag_file.txt
Random Solutions
Monitor recursive a directory for new files
How can i save my bookmarks from (firefox and explorer) to a file in order to restore them on another system.
Customise toolbar through Group Policy + Server 2003
Wake On Lan, Red Hat Linux 8.0
Vision Slogan
Search Engine Optimization Software
SQL select query syntax, attempting to pass a integer variable from vb.net to a sql string select command
concatenate string of %variable% to SQL from VB
I have an Excel spreadsheet that is too large
Excel Table Synchronization- Why does this code not work?