HELP! Return URLs in redirect chain

Question

LB22 0 Newbie Poster

15 Years Ago

Hi,

I'm stuck with a certain problem. I'm using URLLIB2 to get the end url of a list of links. This was pretty straightforward. Some of the links I'm probing pass through 1 or more other urls before landing the user at the end destination.

For example, the start url might be 1, but then you only end up at 3 after a quick and probably unnoticeable redirect to 2:

1. http://click.someurl.net/script?query=somedata
2. http://bridge.url.net/redirect?url=http://thenewurl.com
3. http://thenewurl.com

In my code url 1 is passed to my function, and url 3 is returned, but how do I capture/return any URL in between the start and end points?

Thanks in advance for any help!

LB

http-protocol python

2 Contributors
1 Reply
151 Views
1 Week Discussion Span
Latest Post 15 Years Ago Latest Post by pythopian

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

pythopian 10 Junior Poster in Training · Answer 1 · 2009-11-11T21:00:31+00:00

Your post does not provide many details, but I think what you are looking for is a generator function., Instead of returning an URL, yield each URL, like this:

def generateURLs(startingURL):
    url = startingURL
    ...
    while url:
        yield url
        url = getNextURL(url)
...
for url in generateURLs(http://click.someurl.net/script?query=somedata):
    print url