Archive for the ‘Data Mining’ Category

Screen Scraping with Perl to get the top 25 Websites and the Web Servers they use

Friday, October 21st, 2011

This was a really fun little script I wrote that I thought I would share with everyone. I basically wrote a Perl script using the popular LWP Perl module to screen scrape and obtain the top 25 sites and the web servers they use. Once I screen scrape to get the relevant website, I am able to send a specific request to the server to obtain further information. (more…)

Twitt

Facebook Hack – How “shared” is your website on Facebook?

Wednesday, May 25th, 2011

In this blog I describe a cool little data mining hack you can use to determine how “shared” a specific website is on Facebook. Let’s say you want to determine how often the “http://www.cnet.com” site has been “shared” on Facebook. Simply type in the following as the URL address in a browser: “http://graph.facebook.com/http://cnet.com”. The syntax to use to check any site is “http://graph.facebook.com/[website_address]“. See below snapshots for examples. Have fun! (more…)

Twitt