How to scrape a page without knowing its html structure

Hi,

I'm doing one module in my site, there I need to import user blog into my site. I can use RSS feeds to read the blog information but using RSS feeds I'm not getting entire information. So, I need to scrape the user blog page. How to scrape a pages without knowing its html structure of a page? Please anyone can help me for this issue. Thanks in advance.