I've been playing around with this library called 'mechanize' for automated form-filling ( http://wwwsearch.sourceforge.net/mechanize/ ).

Now, I've tried this library across a number of websites, but although it works on some, on others I get an error saying -
mechanize._html.ParseError: ParseError(ParseError(SGMLParseError("unexpected char 'u' in internal subset",),),)

I also tried the offending websites with another similar librarty called ClientForm:

from ClientForm import ParseResponse
from urllib2 import urlopen

response = urlopen("http://www.bairstoweves.co.uk/")
forms = ParseResponse(response, backwards_compat=False)
form = forms[0]
print urlopen(form.click()).read()

-- same error

can anyone tell me what this error means and how I can fix it? as there is no proper documentation out there for these libraries besides the docstrings (not very helpful)

Looks like the library you have does not know how to handle unicode characters.

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.