Web crawling with Ruby
Friday, June 15th, 2007Learned a little bit about scRUBYt! today (damn thats hard to type!).
It looks like a pretty cool way to do web crawlers. To use it, you define a “learning” crawler using the scRUBYt! DSL, which combined with the actual site at a specific point in time, creates the real crawler to do the dirty work - essentially, example text gets converted to xpath expressions. The cool thing about this is that when the site changes, one has to do minimal changes to create a new working crawler, not to mention that creating the crawler in the first place is a lot easier.