Package org.jsoup.helper

Class DataUtil

java.lang.Object
org.jsoup.helper.DataUtil

public final class DataUtil
extends Object
Internal static utilities for handling data.
  • Field Details

  • Method Details

    • load

      public static Document load​(File in, String charsetName, String baseUri) throws IOException
      Loads and parses a file to a Document. Files that are compressed with gzip (and end in .gz or .z) are supported in addition to uncompressed files.
      Parameters:
      in - file to load
      charsetName - (optional) character set of input; specify null to attempt to autodetect. A BOM in the file will always override this setting.
      baseUri - base URI of document, to resolve relative links against
      Returns:
      Document
      Throws:
      IOException - on IO error
    • load

      public static Document load​(InputStream in, String charsetName, String baseUri) throws IOException
      Parses a Document from an input steam.
      Parameters:
      in - input stream to parse. You will need to close it.
      charsetName - character set of input
      baseUri - base URI of document, to resolve relative links against
      Returns:
      Document
      Throws:
      IOException - on IO error
    • load

      public static Document load​(InputStream in, String charsetName, String baseUri, Parser parser) throws IOException
      Parses a Document from an input steam, using the provided Parser.
      Parameters:
      in - input stream to parse. You will need to close it.
      charsetName - character set of input
      baseUri - base URI of document, to resolve relative links against
      parser - alternate parser to use.
      Returns:
      Document
      Throws:
      IOException - on IO error
    • readToByteBuffer

      public static ByteBuffer readToByteBuffer​(InputStream inStream, int maxSize) throws IOException
      Read the input stream into a byte buffer. To deal with slow input streams, you may interrupt the thread this method is executing on. The data read until being interrupted will be available.
      Parameters:
      inStream - the input stream to read from
      maxSize - the maximum size in bytes to read from the stream. Set to 0 to be unlimited.
      Returns:
      the filled byte buffer
      Throws:
      IOException - if an exception occurs whilst reading from the input stream.