public final class WebPageMetaDataExtractorUtils
extends java.lang.Object
| Modifier and Type | Method and Description |
|---|---|
static java.lang.String |
extractContent(org.jsoup.nodes.Document document,
java.lang.String attrName,
java.lang.String... cssQueries)
Returns the extracted content for given cssQueries and given attribute name
|
static WebPageMetaData |
getWebPageMetaData(java.lang.String url,
java.lang.String userAgent)
Returns metadata as
WebPageMetaData object by connecting to given url |
static WebPageMetaData |
getWebPageMetaDataFromHtml(java.lang.String html)
Returns metadata as
WebPageMetaData object by traversing given html source |
public static WebPageMetaData getWebPageMetaDataFromHtml(java.lang.String html)
WebPageMetaData object by traversing given html sourcehtml - the html to get meta data fromWebPageMetaData objectpublic static WebPageMetaData getWebPageMetaData(java.lang.String url, java.lang.String userAgent)
WebPageMetaData object by connecting to given urlurl - the url to get meta data fromuserAgent - the user agent to access url (a default user-agent will be used if null)WebPageMetaData objectpublic static java.lang.String extractContent(org.jsoup.nodes.Document document,
java.lang.String attrName,
java.lang.String... cssQueries)
document - the DocumentattrName - the attribute name to search for elements returned by the css queries (Can be empty)cssQueries - the css queries performed to search for elementsCopyright © 2001-2017 Jalios SA. All Rights Reserved.