Interesting, you can remove everyhing between bracktes (<>). It works with html and I think it handle xml but didn’t test that for my homework. sed ‘s/< [^>]*//g’ It finds the first < character and then deletes all text up to and including the next > character. Interesting. You could rehtml stuff but it might take several different scripts. There may be better ways to rehtml stuff.
4/29/2009 ~ 1 min read