Package com.jalios.util.webpage
Class WebPageMetaDataExtractorUtils
- java.lang.Object
-
- com.jalios.util.webpage.WebPageMetaDataExtractorUtils
-
public final class WebPageMetaDataExtractorUtils extends java.lang.ObjectUtils to extract a webpage metadata (Title, description, images...)- Since:
- jcms-9.0.4 && jcms-10
- Author:
- Kevin Bransard
-
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static java.lang.StringextractContent(org.jsoup.nodes.Document document, java.lang.String attrName, java.lang.String... cssQueries)Returns the extracted content for given cssQueries and given attribute namestatic WebPageMetaDatagetWebPageMetaData(java.lang.String url, java.lang.String userAgent)Returns metadata asWebPageMetaDataobject by connecting to given urlstatic WebPageMetaDatagetWebPageMetaDataFromHtml(java.lang.String html)Returns metadata asWebPageMetaDataobject by traversing given html source
-
-
-
Method Detail
-
getWebPageMetaDataFromHtml
public static WebPageMetaData getWebPageMetaDataFromHtml(java.lang.String html)
Returns metadata asWebPageMetaDataobject by traversing given html source- Parameters:
html- the html to get meta data from- Returns:
- a
WebPageMetaDataobject - Since:
- jcms-9.0.4
-
getWebPageMetaData
public static WebPageMetaData getWebPageMetaData(java.lang.String url, java.lang.String userAgent)
Returns metadata asWebPageMetaDataobject by connecting to given url- Parameters:
url- the url to get meta data fromuserAgent- the user agent to access url (a default user-agent will be used if null)- Returns:
- a
WebPageMetaDataobject - Since:
- jcms-9.0.4
-
extractContent
public static java.lang.String extractContent(org.jsoup.nodes.Document document, java.lang.String attrName, java.lang.String... cssQueries)Returns the extracted content for given cssQueries and given attribute name- Parameters:
document- theDocumentattrName- the attribute name to search for elements returned by the css queries (Can be empty)cssQueries- the css queries performed to search for elements- Returns:
- a value based on cssQueries and attribute name
-
-