Regular Expressions

Question

tiddster 0 Newbie Poster

14 Years Ago

I am using regular expressions to look for certain keyword sin a string. At the moment I have got:

import re
thisText = "<meta http-equiv='content-type' content='text/html; cHarSet=gBk' />"

n = re.compile(r'\b\s*charset=gbk[a-z]*', re.IGNORECASE|re.VERBOSE)
print n.findall(thisText)

this allows me to get the string, ignoring the case of the text. I also want it to ignore whitespace i.e if the text i am searching is charset = gbk I want it to match charset=gbk. Is there anyway of doing this?

python

2 Contributors
1 Reply
110 Views
6 Hours Discussion Span
Latest Post 14 Years Ago Latest Post by snippsat

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

snippsat 661 Master Poster · Answer 1 · 2010-07-26T22:02:27+00:00

Something like this maybe.

import re

html = '''\
"<meta http-equiv='content-type' content='text/html; cHarSet=gBk' />"
'''

test_match = re.search(r'cHarSet\=(\w*)', html)
print test_match.group()
#-->cHarSet=gBk
print test_match.group(1)
#-->gBk