seattle-java-401d1

CF Web Scraping, Threads and Concurrency

Resources

Learning Objectives

Lecture Outline

Code Samples

Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
log(doc.title());
Elements newsHeadlines = doc.select("#mp-itn b a");
for (Element headline : newsHeadlines) {
  log("%s\n\t%s",
    headline.attr("title"), headline.absUrl("href"));
}

Configuration

Add JSoup as a dependency in your build.gradle file.

dependencies {
  compile 'org.jsoup:jsoup:1.11.1'
}