urllib.request.urlopen (python 3)

Question

scru 909 Posting Virtuoso

16 Years Ago

I noticed a weird thing happening when i use this function. If the page i open is in latin-1 encoding, the bytes returned by this function would have some weird junk characters inserted in various places.

However, if i use urlretrieve to fetch the page to disk, there is no junk in the resulting file.

Ideas?

EDIT: I decoded the bytes returned by urllib.request.urlopen with the latin-1 encoding and saved it to a file for comparision; this is how i know the junk data is there.

python

1 Contributor
1 Reply
210 Views
6 Hours Discussion Span
Latest Post 16 Years Ago Latest Post by scru

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

scru 909 Posting Virtuoso Featured Poster · Answer 1 · 2009-01-29T00:38:41+00:00

Never mind, it's a bug in python 3: http://bugs.python.org/issue4631