By the way, you may want to look at the BeautifulSoup Python library for working with html files (and extracting text from them).
I agree with this,but now it look like boiishuvo will destroy the stucture of html.
Should it replace like this or keep <> intact?
>>> s = '<html>'
>>> s.replace('<html>', '***')
'***'
Something like this with regex.
import re
html = '''\
<html>
<head>
<title></title>
</head>
<body>
</body>
</html>'''
print re.sub(r'<.*>', '****', html)
"""Output-->
****
****
****
****
****
****
****
"""