Screen Scraping with Perl to get the top 25 Websites and the Web Servers they use


This was a really fun little script I wrote that I thought I would share with everyone. I basically wrote a Perl script using the popular LWP Perl module to screen scrape and obtain the top 25 sites and the web servers they use. Once I screen scrape to get the relevant website, I am able to send a specific request to the server to obtain further information.

Now, I am not saying I ran this script, and, I am not responsible in any way for how you may use this script. Further, you should be aware that some sites may not allow you to use automated processes to capture and use their information; this could be a possible violation of copyright or intellectual property laws. So now that you have been warned, below I display the code I wrote and also what the output would look like if you were to run this script. You can download the full source code from the download section.

Perl Screen Scraping Code

Perl Screen Scraping Code


Output:
Perl Screen Scraping Code Output

Perl Screen Scraping Code Output


Twitt

Tags: , , , , , , , , , , , , , , , , , , , , , , , ,

3 Responses to “Screen Scraping with Perl to get the top 25 Websites and the Web Servers they use”

  1. open source virtualization…

    [...]Humair’s Blogs » Blog Archive » Screen Scraping with Perl to get the top 25 Websites and the Web Servers they use[...]…

  2. Hey we’re Hacksomeone and we would like to give you a great new hack! Gain COD3 Call of duty 3 for free…

    [...]Humair’s Blogs » Blog Archive » Screen Scraping with Perl to get the top 25 Websites and the Web Servers they use[...]…

  3. Welcome we’re Hacksomeone and we could want to present you some brand new hacks! Hacksomeone is an every day upgraded web space who looks the virtual world for you! We’re not love most hacksites sharp at one sort of hack. We actually hunt all around …

    [...]Humair’s Blogs » Blog Archive » Screen Scraping with Perl to get the top 25 Websites and the Web Servers they use[...]…

Leave a Reply

*


eight × 3 =