Question : Python, how can I parse a document object?

As per the code below, tidy_page is a document object. How do I save that as a file or read it into something else?

Cheers,

Code Snippet:

         
             import tidy
import urllib2
 
options = dict(output_xhtml=0, add_xml_decl=0, indent=0, tidy_mark=0)
 
f = urllib2.urlopen("http://pcs.essex.ac.uk/")
page = f.read()
f.close()
 
tidy_page = tidy.parseString(page, **options)

Open in New Window Select All

Answer : Python, how can I parse a document object?

You can use the str() function, it will be converted to a string:

                       
             tidy_page = tidy.parseString(page, **options)
out=open('output.html','w')  # save to file 'output.html'
out.write(str(tidy_page))
out.close()

Open in New Window Select All

Question : Python, how can I parse a document object?

Answer : Python, how can I parse a document object?

You can use the str() function, it will be converted to a string: 1: 2: 3: 4: tidy_page = tidy.parseString(page, **options) out=open('output.html','w') # save to file 'output.html' out.write(str(tidy_page)) out.close() Open in New Window Select All

You can use the str() function, it will be converted to a string:

1: 2: 3: 4:

tidy_page = tidy.parseString(page, **options) out=open('output.html','w') # save to file 'output.html' out.write(str(tidy_page)) out.close()

Open in New Window Select All