Projects
May 3rd, 2005
This script pulls down the current list of events on the Bug Jar web site - can be manipulated later to fit into an RSS feed or iCal Calendar (for example).
This script utilizes the excellent Webscrape utility.
Downloads
Tags: scrape, events, perl
Comment on 'Bug Jar Event Scraper'
May 3rd, 2005
This script pulls down the current list of events on the George Eastman House web site - can be manipulated later to fit into an RSS feed or iCal Calendar (for example).
This script utilizes the excellent Webscrape utility.
Downloads
Tags: scrape, events, films, perl
Comment on 'Eastman House Event Scraper'
February 10th, 2005
This application scrapes the excellent anonymous proxy list available at Proxy 4 Free (I use to use Stay Invisible), filters them, and renders them appropriately for most applications. For example, the SwitchProxy extension to Firefox plays nicely with this.
I've made a copy of the program available for all to use, to which the query string options are:
- c = County - The country by which you want to filter the proxies by, defaults to 'United States'.
- t = Type - The type of proxies that you want, leave blank for all. Defaults to 'anon'.
Updated 2005-02-10: Requests are now cached daily. I was afraid that too many requests might upset the server owners, whom provide such an excellent service.
Updated 2005-11-16: Stay Invisible removed their free proxy lists, so I'm now using a different service that seems comparable.
Links
Downloads
Tags: perl, scrape, proxy, popular
119 Comments on 'Anonymous Proxy List'
January 12th, 2005
There was a post the other day to Boing Boing concerning the fact that Ken Lay is paying to have links to his defense placed on Google and Yahoo. A friend of mine wrote a brief example of how to automatically click the links using Perl and I thought I could probably do better. Below is my attempt at it - uses a proper User Agent, proper referrers and bounces through multiple, anonymous, proxies to get the job done. Thank goodness for WWW::Mechanize.
Downloads
Tags: corp:google, corp:yahoo, humor, perl, scrape, search
Comment on 'Ken Lay Clicker'
October 1st, 2004
The script scrapes the Yahoo TV web site and converts the results into an RSS feed. You can set the station, channel, and duration in the future.
Additionally, I've set up the script, publically, so that anyone can use it. It can be found a the URL below. The configurable options are:
- s=STATION_ID - This is provided by Yahoo, it will look something like this: us_NY31666d.
- c=CHANNEL_ID - This is also provided by Yahoo, and will look something like this: us_COMEDY
- d=DURATION - This is how far in the future you would like the feed to grab, in hours. Defaults to 1.
Links
Downloads
Tags: convertor, perl, rss, scrape, tv
Comment on 'TV2RSS'
September 14th, 2004
This script and module are designed to scrape the course section information made available on the RIT Schedule of Courses. The module is a proper Object-Oriented Perl Module which can be easily ported from one application to another. This script is used for the current version of the Schedule Maker.
Downloads
Tags: module, perl, rit, scrape
Comment on 'RIT::Courses'
September 6th, 2004
This is a partial mirror of one of my favorite Internet pastimes: lowbrow.com. This mirror of over 15,000 entries was made over the course of a night, until I got in trouble by their ISP. Nonetheless, here's the results - plus with the ability to link to a specific post, something the official one lacks.
Link
Tags: db, web, humor, perl, mysql, scrape
Comment on 'Lowbrow Mirror'
September 5th, 2004
This is the script that I wrote to retreive entries from lowbrow.com, one of my favorite Internet pastimes.
The only unmentioned point is that there should be some sort of 'Unique' restriction placed on the database to keep duplicate entries from being entered. The only way to access entries on lowbrow is by randomly retreiving on from the home page - which means lots of duplicates, and it becomes progressively harder to get new entries as you progress.
Downloads
Tags: humor, perl, scrape
Comment on 'Lowbrow Scraper'
·
« Previous entries