Extract href attribute and value from <a> tag

Question

codedhands 0 Light Poster

16 Years Ago

Hi,am trying to search and extract the text

href="http://www.yahoo.com"

from a string
<a href="http://www.yahoo.com" id="link1">.Here is my code:

import re

p=re.compile(r'\b(href="(.*)"){1}\b')
m=p.search('<a href="live.net" link="go2">')
print m.group()

#Prints: href="live.net" link="

The code above

Prints: href="live.net" link="

,but i want to the href="live.net"
I need help on this please

python

3 Contributors
3 Replies
126 Views
17 Hours Discussion Span
Latest Post 16 Years Ago Latest Post by jice

All 3 Replies

sneekula 969 Nearly a Posting Maven

16 Years Ago

p = re.compile(r'(href="(.*?)")')

will do the trick

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

codedhands 0 Light Poster · Answer 1 · 2008-12-17T06:20:42+00:00

codedhands 0 Light Poster

16 Years Ago

Thanks you so much.

jice 53 Posting Whiz in Training · Answer 2 · 2008-12-17T14:53:31+00:00

You should use "*?" instead of "*" : * will take the longest string that match the pattern while *? will take the shortest.
p=re.compile(r'\b(href="(.*?)"){1}\b')

Extract href attribute and value from <a> tag

Recommended Answers Collapse Answers

All 3 Replies

Recommended Answers