select * from html where url="http://news.ycombinator.com/" and xpath='//tr/td/a[substring(@href,1,4)="http"][@href!="http://ycombinator.com"]'
http://developer.yahoo.com/yql/console/
I think it's pretty crazy that you can now scrape well-marked pages with a SQL-like syntax.
http://developer.yahoo.com/yql/console/