<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://www.feedreader.com" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>Feedreader.com - HTML 2 RSS (web page scraping) - Comments</title>
 <link>http://www.feedreader.com/node/1021</link>
 <description>Comments for &quot;HTML 2 RSS (web page scraping)&quot;</description>
 <language>en</language>
<item>
 <title>hello!</title>
 <link>http://www.feedreader.com/node/1021#comment-4934</link>
 <description>&lt;p&gt;With menwei! Merry Christmas!&lt;/p&gt;
</description>
 <pubDate>Tue, 24 Nov 2009 15:12:53 +0200</pubDate>
 <dc:creator>Lypeerakadalp</dc:creator>
 <guid isPermaLink="false">comment 4934 at http://www.feedreader.com</guid>
</item>
<item>
 <title>Links are working again.</title>
 <link>http://www.feedreader.com/node/1021#comment-4707</link>
 <description>&lt;p&gt;Links are working again. Feel free to download!&lt;/p&gt;
</description>
 <pubDate>Thu, 23 Apr 2009 12:59:17 +0300</pubDate>
 <dc:creator>tom-admin</dc:creator>
 <guid isPermaLink="false">comment 4707 at http://www.feedreader.com</guid>
</item>
<item>
 <title>Link to HTML2RSS does not work</title>
 <link>http://www.feedreader.com/node/1021#comment-4700</link>
 <description>&lt;p&gt;Hi,&lt;/p&gt;
&lt;p&gt; Can you please provide the link to HTML2RSS converter ?&lt;/p&gt;
&lt;p&gt;Thanks&lt;/p&gt;
</description>
 <pubDate>Tue, 14 Apr 2009 00:53:23 +0300</pubDate>
 <dc:creator>pr_arun</dc:creator>
 <guid isPermaLink="false">comment 4700 at http://www.feedreader.com</guid>
</item>
<item>
 <title>cool</title>
 <link>http://www.feedreader.com/node/1021#comment-4480</link>
 <description>&lt;p&gt;cool&lt;/p&gt;
</description>
 <pubDate>Wed, 03 Sep 2008 22:54:28 +0300</pubDate>
 <dc:creator>truut</dc:creator>
 <guid isPermaLink="false">comment 4480 at http://www.feedreader.com</guid>
</item>
<item>
 <title>pages that requires login confirmation</title>
 <link>http://www.feedreader.com/node/1021#comment-3435</link>
 <description>&lt;p&gt;I was wondering is there a way to grab info from pages that requires login confirmation.&lt;/p&gt;
&lt;p&gt;Thanks for your time and answer.&lt;/p&gt;
</description>
 <pubDate>Tue, 09 Oct 2007 17:35:04 +0300</pubDate>
 <dc:creator>vBm</dc:creator>
 <guid isPermaLink="false">comment 3435 at http://www.feedreader.com</guid>
</item>
<item>
 <title>I got it to work but have a question</title>
 <link>http://www.feedreader.com/node/1021#comment-3414</link>
 <description>&lt;p&gt;I tried to scrape a local paper I don&#039;t feel like leafing through.&lt;/p&gt;
&lt;p&gt;Here is my script&lt;/p&gt;
&lt;p&gt;http://localhost:8182/?serverurl=http://news.mywebpal.com/index.cfm?pnpid=573&amp;amp;feedtitle=Howard County Times%20scraped%20feed&amp;amp;linkfilter=local&amp;amp;encoding&lt;/p&gt;
&lt;p&gt;The linkfilter variable narrows the scope to local news only.  I&#039;m not interested in their other articles.&lt;/p&gt;
&lt;p&gt;I wonder why I couldn&#039;t just use &quot;http://news.mywebpal.com&quot; as the serverurl variable.  Will the other stuff (index.cfm?pnpid=57) be different each day?&lt;/p&gt;
</description>
 <pubDate>Wed, 03 Oct 2007 22:33:12 +0300</pubDate>
 <dc:creator>jgallihue</dc:creator>
 <guid isPermaLink="false">comment 3414 at http://www.feedreader.com</guid>
</item>
<item>
 <title>musicaplay</title>
 <link>http://www.feedreader.com/node/1021#comment-3326</link>
 <description>&lt;p&gt;solo musica&lt;/p&gt;
</description>
 <pubDate>Mon, 03 Sep 2007 11:55:02 +0300</pubDate>
 <dc:creator>vituccio</dc:creator>
 <guid isPermaLink="false">comment 3326 at http://www.feedreader.com</guid>
</item>
<item>
 <title>Service doesn&#039;t load</title>
 <link>http://www.feedreader.com/node/1021#comment-3317</link>
 <description>&lt;p&gt;When I press the EXE file, I see the service load for about a second then it disappears from the services list.&lt;/p&gt;
&lt;p&gt;What do I do?&lt;/p&gt;
</description>
 <pubDate>Fri, 31 Aug 2007 09:25:22 +0300</pubDate>
 <dc:creator>victoria</dc:creator>
 <guid isPermaLink="false">comment 3317 at http://www.feedreader.com</guid>
</item>
<item>
 <title>Question and answer...</title>
 <link>http://www.feedreader.com/node/1021#comment-2384</link>
 <description>&lt;p&gt;Q: How to add webpages that have &amp;amp; symbol inside link.&lt;br /&gt;
A: Replace &amp;amp; symbol with %26.&lt;/p&gt;
</description>
 <pubDate>Wed, 20 Jun 2007 17:06:17 +0300</pubDate>
 <dc:creator>tom-admin</dc:creator>
 <guid isPermaLink="false">comment 2384 at http://www.feedreader.com</guid>
</item>
<item>
 <title>HTML 2 RSS (web page scraping)</title>
 <link>http://www.feedreader.com/node/1021</link>
 <description>&lt;p&gt;Have you sometimes had the situation where the web page that you really like to read with Feedreader does not have RSS feed. I have been in this situation quite a lot of times. So we did a little bit of hacking and released a tool called HTML2RSS to public.&lt;/p&gt;
&lt;p&gt;So what does this tool do? Basically it&#039;s a little webserver that you can access from URL http://localhost:8182.  If you open this location then you will see a brief description of usage. I will copy the example here and explain it :&lt;/p&gt;
&lt;p&gt;http://localhost:8182/?serverurl=http://bbc.co.uk&amp;amp;feedtitle=BBC%20scraped%20feed&amp;amp;linkfilter=news.bbc.co.uk&amp;amp;encoding=gb2312&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;serverurl : url for target webpage (without RSS feed)
&lt;li&gt;feedtitle : title for feed that you are creating (if you omit this variable then feed title will be webpage url)
&lt;li&gt;linkfilter : return only links that contain string provided by linkfilter variable (if you omit this variable then all links are taken from webpage).
&lt;li&gt;encoding : output encoding (must be the same as encoding of webpage). If you omit this variable then tool tries to get encoding itself but it does not work every time.
&lt;/ul&gt;
&lt;p&gt;So this is it. Just let the application run and add this example link to Feedreader. News will come in :).&lt;/p&gt;
&lt;p&gt;HINT : This tool works with other RSS readers, too :).&lt;/p&gt;
&lt;p&gt;DISCLAIMER : This is not official product. This is just tool that we are experimenting with. We will hopefully develop it a little bit further (tray option) but development priority can be hectic :).&lt;br /&gt;
&lt;br&gt;&lt;br /&gt;
&lt;strong&gt;Download&lt;/strong&gt;&lt;br /&gt;
HTML2RSS can be downloaded from &lt;a href=http://www.feedreader.com/releases/html2rss.exe&gt;here&lt;/a&gt;.&lt;br /&gt;
&amp;nbsp;&lt;br /&gt;
Newer version of HTML2RSS running as windows service can be downloaded from &lt;a href=http://www.feedreader.com/releases/html2rss_service.exe&gt;here&lt;/a&gt;. Just install it with command line &quot;html2rss_service.exe /install&quot; and then start from services control panel.&lt;/p&gt;
&lt;p&gt;&lt;!--break--&gt;&lt;/p&gt;
</description>
 <comments>http://www.feedreader.com/node/1021#comment</comments>
 <pubDate>Wed, 20 Jun 2007 16:47:18 +0300</pubDate>
 <dc:creator>tom-admin</dc:creator>
 <guid isPermaLink="false">1021 at http://www.feedreader.com</guid>
</item>
</channel>
</rss>

