Click to See Complete Forum and Search --> : Extract Data From Website


Brenton S.
May 18th, 2006, 06:14 AM
Hi all,

I need to programmatically extract data from a website, ie. www.myspace.com (http://www.myspace.com/). The question is, how would I go about doing this? All responses will be appreciated.

Marc G
May 18th, 2006, 06:33 AM
You can just use the HTTP protocol. See www.codeguru.com for some articles and wrappers for HTTP.

humptydumpty
May 18th, 2006, 11:49 AM
You can just use the HTTP protocol. See www.codeguru.com (http://www.codeguru.com/) for some articles and wrappers for HTTP.

no Need of doing this you simply can use Function URLDownloadToFile()
this function simply Downloads bits from the Internet and saves them to a file.now you can do with the file what ever you want .
thankyou

golanshahar
May 18th, 2006, 12:41 PM
no Need of doing this you simply can use Function URLDownloadToFile()
this function simply Downloads bits from the Internet and saves them to a file.now you can do with the file what ever you want .
thankyou

You are right but the thing is that you can use it only when you know what are the url and the exact file name, now if you want to develop a web scrapper you need also to get a list of the files in order to download them. :wave:

Cheers

humptydumpty
May 18th, 2006, 01:18 PM
You are right but the thing is that you can use it only when you know what are the url and the exact file name, now if you want to develop a web scrapper you need also to get a list of the files in order to download them. :wave:

Cheers
Abs Right and that time you are going to get all the URL name Either from the company for Whom u r developing the product or by Some Other Resources. you are not going to download n no of files or something else.we had done some project like this. this all URL is provided by Product manufacturer like my product will work with these URL . or if he don't wann be specific to one Then no need to do anything simply what ever is there in AddresBar just simply go with that

Thankyou

golanshahar
May 18th, 2006, 02:00 PM
Abs Right and that time you are going to get all the URL name Either from the company for Whom u r developing the product or by Some Other Resources. you are not going to download n no of files or something else.we had done some project like this. this all URL is provided by Product manufacturer like my product will work with these URL . or if he don't wann be specific to one Then no need to do anything simply what ever is there in AddresBar just simply go with that

Thankyou

Ah that is depend on what you need to do, suppose you want to download a site to your local computer (I think this what OP wants) , you don’t expect that the webmaster of this site or the company that this site belong to will give you all the links right? :D

Then you should use other methods to download it :cool: and ::URLDownloadToFile() cant help in finding the unknown pages ;)


Cheers

Marc G
May 19th, 2006, 12:06 PM
no Need of doing this you simply can use Function URLDownloadToFile()
this function simply Downloads bits from the Internet and saves them to a file.now you can do with the file what ever you want .
thankyou
You are right, but technically, URLDownloadToFile is also a wrapper for the HTTP protocol :wave: