<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/xsl" href="../assets/xml/rss.xsl" media="all"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Joshua Hernandez's Blog (Posts about Web Scraping)</title><link>http://joshuahernandezblog.com/</link><description></description><atom:link href="http://joshuahernandezblog.com/categories/web-scraping.xml" rel="self" type="application/rss+xml"></atom:link><language>en</language><copyright>Contents © 2019 &lt;a href="mailto:Joshua.M.S.Hernandez@gmail.com"&gt;Joshua Hernandez&lt;/a&gt; </copyright><lastBuildDate>Wed, 16 Jan 2019 22:40:37 GMT</lastBuildDate><generator>Nikola (getnikola.com)</generator><docs>http://blogs.law.harvard.edu/tech/rss</docs><item><title>Web Scraping with R</title><link>http://joshuahernandezblog.com/blog/MovieProject/web-scraping-with-r/</link><dc:creator>Joshua Hernandez</dc:creator><description>&lt;div tabindex="-1" id="notebook" class="border-box-sizing"&gt;
    &lt;div class="container" id="notebook-container"&gt;

&lt;div class="cell border-box-sizing text_cell rendered"&gt;&lt;div class="prompt input_prompt"&gt;
&lt;/div&gt;
&lt;div class="inner_cell"&gt;
&lt;div class="text_cell_render border-box-sizing rendered_html"&gt;
&lt;p&gt;&lt;i&gt; I go over how to use R to harvest information from web pages. This post chronicles my use of rvest to harvest movie information from Rotten Tomatoes to explore the difference between professional critics and general audiences. &lt;/i&gt;
&lt;/p&gt;&lt;p&gt;&lt;a href="http://joshuahernandezblog.com/blog/MovieProject/web-scraping-with-r/"&gt;Read more…&lt;/a&gt; (23 min remaining to read)&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;</description><category>data extraction</category><category>R</category><category>rvest</category><category>Web Scraping</category><guid>http://joshuahernandezblog.com/blog/MovieProject/web-scraping-with-r/</guid><pubDate>Fri, 18 Aug 2017 06:14:20 GMT</pubDate></item></channel></rss>