Package org.apache.nutch.urlfilter.api
Class RegexRule
- java.lang.Object
-
- org.apache.nutch.urlfilter.api.RegexRule
-
public abstract class RegexRule extends Object
A generic regular expression rule.- Author:
- Jérôme Charron
-
-
Method Summary
All Methods Instance Methods Abstract Methods Concrete Methods Modifier and Type Method Description protected boolean
accept()
Return if this rule is used for filtering-in or out.protected String
hostOrDomain()
Return if this rule is used for filtering-in or out.protected abstract boolean
match(String url)
Checks if a url matches this rule.protected String
regex()
Return if this rule's regex.
-
-
-
Constructor Detail
-
RegexRule
protected RegexRule(boolean sign, String regex)
Constructs a new regular expression rule.- Parameters:
sign
- specifies if this rule must filter-in or filter-out. Atrue
value means that any url matching this rule must be accepted, afalse
value means that any url matching this rule must be rejected.regex
- is the regular expression used for matching (seematch(String)
method).
-
RegexRule
protected RegexRule(boolean sign, String regex, String hostOrDomain)
Constructs a new regular expression rule.- Parameters:
sign
- specifies if this rule must filter-in or filter-out. Atrue
value means that any url matching this rule must be accepted, afalse
value means that any url matching this rule must be rejected.regex
- is the regular expression used for matching (seematch(String)
method).hostOrDomain
- the host or domain to which this regex belongs
-
-
Method Detail
-
accept
protected boolean accept()
Return if this rule is used for filtering-in or out.- Returns:
true
if any url matching this rule must be accepted, otherwisefalse
.
-
hostOrDomain
protected String hostOrDomain()
Return if this rule is used for filtering-in or out.- Returns:
- host or domain this regex rule belongs to
-
regex
protected String regex()
Return if this rule's regex.- Returns:
- this regex
-
match
protected abstract boolean match(String url)
Checks if a url matches this rule.- Parameters:
url
- is the url to check.- Returns:
true
if the specified url matches this rule, otherwisefalse
.
-
-