Package com.jalios.util.webpage
Class WebPageMetaDataExtractorUtils
- java.lang.Object
 - 
- com.jalios.util.webpage.WebPageMetaDataExtractorUtils
 
 
- 
public final class WebPageMetaDataExtractorUtils extends java.lang.ObjectUtils to extract a webpage metadata (Title, description, images...)- Since:
 - jcms-9.0.4 && jcms-10
 - Version:
 - $Revision: 136288 $
 - Author:
 - Kevin Bransard
 
 
- 
- 
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static java.lang.StringextractContent(org.jsoup.nodes.Document document, java.lang.String attrName, java.lang.String... cssQueries)Returns the extracted content for given cssQueries and given attribute namestatic WebPageMetaDatagetWebPageMetaData(java.lang.String url, java.lang.String userAgent)Returns metadata asWebPageMetaDataobject by connecting to given urlstatic WebPageMetaDatagetWebPageMetaDataFromHtml(java.lang.String html)Returns metadata asWebPageMetaDataobject by traversing given html source 
 - 
 
- 
- 
Method Detail
- 
getWebPageMetaDataFromHtml
public static WebPageMetaData getWebPageMetaDataFromHtml(java.lang.String html)
Returns metadata asWebPageMetaDataobject by traversing given html source- Parameters:
 html- the html to get meta data from- Returns:
 - a 
WebPageMetaDataobject - Since:
 - jcms-9.0.4
 
 
- 
getWebPageMetaData
public static WebPageMetaData getWebPageMetaData(java.lang.String url, java.lang.String userAgent)
Returns metadata asWebPageMetaDataobject by connecting to given url- Parameters:
 url- the url to get meta data fromuserAgent- the user agent to access url (a default user-agent will be used if null)- Returns:
 - a 
WebPageMetaDataobject - Since:
 - jcms-9.0.4
 
 
- 
extractContent
public static java.lang.String extractContent(org.jsoup.nodes.Document document, java.lang.String attrName, java.lang.String... cssQueries)Returns the extracted content for given cssQueries and given attribute name- Parameters:
 document- theDocumentattrName- the attribute name to search for elements returned by the css queries (Can be empty)cssQueries- the css queries performed to search for elements- Returns:
 - a value based on cssQueries and attribute name
 
 
 - 
 
 -