public final class WebPageMetaDataExtractorUtils
extends java.lang.Object
Modifier and Type | Method and Description |
---|---|
static java.lang.String |
extractContent(org.jsoup.nodes.Document document,
java.lang.String attrName,
java.lang.String... cssQueries)
Returns the extracted content for given cssQueries and given attribute name
|
static WebPageMetaData |
getWebPageMetaData(java.lang.String url,
java.lang.String userAgent)
Returns metadata as
WebPageMetaData object by connecting to given url |
static WebPageMetaData |
getWebPageMetaDataFromHtml(java.lang.String html)
Returns metadata as
WebPageMetaData object by traversing given html source |
public static WebPageMetaData getWebPageMetaDataFromHtml(java.lang.String html)
WebPageMetaData
object by traversing given html sourcehtml
- the html to get meta data fromWebPageMetaData
objectpublic static WebPageMetaData getWebPageMetaData(java.lang.String url, java.lang.String userAgent)
WebPageMetaData
object by connecting to given urlurl
- the url to get meta data fromuserAgent
- the user agent to access url (a default user-agent will be used if null)WebPageMetaData
objectpublic static java.lang.String extractContent(org.jsoup.nodes.Document document, java.lang.String attrName, java.lang.String... cssQueries)
document
- the Document
attrName
- the attribute name to search for elements returned by the css queries (Can be empty)cssQueries
- the css queries performed to search for elementsCopyright © 2001-2017 Jalios SA. All Rights Reserved.