Package org.jsoup.examples
Class HtmlToPlainText
java.lang.Object
org.jsoup.examples.HtmlToPlainText
public class HtmlToPlainText extends Object
HTML to plain-text. This example program demonstrates the use of jsoup to convert HTML input to lightly-formatted plain-text. That is divergent from the general goal of jsoup's .text() methods, which is to get clean data from a scrape.
Note that this is a fairly simplistic formatter -- for real world use you'll want to embrace and extend.
To invoke from the command line, assuming you've downloaded the jsoup-examples jar to your current directory:
java -jar jsoup-examples.jar url [selector]
-
Constructor Summary
Constructors -
Method Summary
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Constructor Details
-
Method Details
-
main
- Throws:
-
IOException
-