Halp: Using Regular Expressions

Question

SuperMetroid 0 Newbie Poster

15 Years Ago

Here is the code I have so far:

import re
from urllib.request import urlopen

pg = urlopen('http://www.url.com')
pg_r = pg.read()
pg.close()
print(pg_r)
print(re.search('(?<=http://user.url.com/)\w+', pg_r))

What I *want* to do with re.search is find the word between the strings 'http://user.url.com/' and '"' but I'm not sure how to do that..
Can anyone help me with the regular expression for that?

python

3 Contributors
3 Replies
138 Views
23 Hours Discussion Span
Latest Post 15 Years Ago Latest Post by Ene Uran

All 3 Replies

djidjadji 28 Light Poster

15 Years Ago

re.search() returns a MatchObejct, not the string that matches the regex.
Look up in the docs what you can do with it to get the match string.

A better regex to use is

matchObj = re.search('(?<=http://user.url.com/)[^"]+', pg_r)
if matchObj:
    pass # found a match

SuperMetroid commented: Thanks for helping me with regular expressions! o.o +1

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

SuperMetroid 0 Newbie Poster · Answer 1 · 2009-09-27T08:01:28+00:00

Thank you so much! Problem solved. Also, thanks for the insight into what type is returned.

Ene Uran 638 Posting Virtuoso · Answer 2 · 2009-09-27T22:37:22+00:00

Also in Python3 'pg_r' would be a byte string and module re will complain without converting 'pg_r' to a string first.

Halp: Using Regular Expressions

Recommended Answers Collapse Answers

All 3 Replies

Recommended Answers