how to get display content from html file?

I want to get the text content from a html.

exactly I am tring to chansfer a *.html to a *.txt.

get rid of <B>...</B> etc such control characters.

Read the html file into a buffer of some kind, then search and replace tags as you see fit. What kind of code have you written to do this? We could offer more guidance if we saw the way you were doing it.
can you show me some codes?

I am a beginner.I often open file using ifsstream ifs("test.html").

I often use getline(ifs,str) to get content.do you think it is efficient?

also I don't know hot to replace and delete some characters.
Last edited on
Topic archived. No new replies allowed.