|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.jsoup.safety.Whitelist
public class Whitelist
Whitelists define what HTML (elements and attributes) to allow through the cleaner. Everything else is removed.
Start with one of the defaults: If you need to allow more through (please be careful!), tweak a base whitelist with:addTags(java.lang.String...)
addAttributes(java.lang.String, java.lang.String...)
addEnforcedAttribute(java.lang.String, java.lang.String, java.lang.String)
addProtocols(java.lang.String, java.lang.String, java.lang.String...)
body fragment of HTML (to add user supplied HTML into a templated page), and not to clean a full HTML document. If the latter is the case, either wrap the document HTML around the cleaned body HTML, or create a whitelist that allows html and head elements as appropriate.
If you are going to extend a whitelist, please be very careful. Make sure you understand what attributes may lead to XSS attack vectors. URL attributes are particularly vulnerable and require careful validation. See http://ha.ckers.org/xss.html for some XSS attack examples.
| Constructor Summary | |
|---|---|
Whitelist() Create a new, empty whitelist. |
|
| Method Summary | |
|---|---|
Whitelist
|
addAttributes(String tag, String... keys) Add a list of allowed attributes to a tag. |
Whitelist
|
addEnforcedAttribute(String tag, String key, String value) Add an enforced attribute to a tag. |
Whitelist
|
addProtocols(String tag, String key, String... protocols) Add allowed URL protocols for an element's URL attribute. |
Whitelist
|
addTags(String... tags) Add a list of allowed elements to a whitelist. |
static Whitelist
|
basic() This whitelist allows a fuller range of text nodes: a, b, blockquote, br, cite, code, dd, dl, dt, em, i, li, ol, p, pre, q, small, strike, strong, sub, sup, u, ul, and appropriate attributes.
|
static Whitelist
|
basicWithImages() This whitelist allows the same text tags as basic(), and also allows img tags, with appropriate attributes, with src pointing to http or https.
|
static Whitelist
|
none() This whitelist allows only text nodes: all HTML will be stripped. |
static Whitelist
|
relaxed() This whitelist allows a full range of text and structural body HTML: a, b, blockquote, br, caption, cite, code, col, colgroup, dd, dl, dt, em, h1, h2, h3, h4, h5, h6, i, img, li, ol, p, pre, q, small, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, u, ul
Links do not have an enforced rel=nofollow attribute, but you can add that if desired.
|
static Whitelist
|
simpleText() This whitelist allows only simple text formatting: b, em, i, strong, u.
|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
|
| Constructor Detail |
|---|
public Whitelist()
basic(), basicWithImages(), simpleText(), relaxed()| Method Detail |
|---|
public static Whitelist none()
public static Whitelist simpleText()
b, em, i, strong, u. All other HTML (tags and attributes) will be removed.
public static Whitelist basic()
a, b, blockquote, br, cite, code, dd, dl, dt, em, i, li, ol, p, pre, q, small, strike, strong, sub, sup, u, ul, and appropriate attributes. a elements) can point to http, https, ftp, mailto, and have an enforced rel=nofollow attribute.
Does not allow images.
public static Whitelist basicWithImages()
basic(), and also allows img tags, with appropriate attributes, with src pointing to http or https.
public static Whitelist relaxed()
a, b, blockquote, br, caption, cite, code, col, colgroup, dd, dl, dt, em, h1, h2, h3, h4, h5, h6, i, img, li, ol, p, pre, q, small, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, u, ul rel=nofollow attribute, but you can add that if desired.
public Whitelist addTags(String... tags)
tags - tag names to allow
public Whitelist addAttributes(String tag,
String... keys)
:all, e.g. addAttributes(":all", "class").
tag - The tag the attributes are forkeys - List of valid attributes for the tag
public Whitelist addEnforcedAttribute(String tag,
String key,
String value)
addEnforcedAttribute("a", "rel", "nofollow") will make all a tags output as <a href="..." rel="nofollow">
tag - The tag the enforced attribute is forkey - The attribute keyvalue - The enforced attribute value
public Whitelist addProtocols(String tag,
String key,
String... protocols)
addProtocols("a", "href", "ftp", "http", "https")
tag - Tag the URL protocol is forkey - Attribute keyprotocols - List of valid protocols
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||