Class PrefixURLFilter
- java.lang.Object
-
- org.apache.nutch.urlfilter.prefix.PrefixURLFilter
-
- All Implemented Interfaces:
Configurable
,URLFilter
,Pluggable
public class PrefixURLFilter extends Object implements URLFilter
Filters URLs based on a file of URL prefixes. The file is named by (1) property "urlfilter.prefix.file" in ./conf/nutch-default.xml, or (2) the attribute "file" in plugin.xml of this plugin.The format of this file is one URL prefix per line.
-
-
Field Summary
-
Fields inherited from interface org.apache.nutch.net.URLFilter
X_POINT_ID
-
-
Constructor Summary
Constructors Constructor Description PrefixURLFilter()
PrefixURLFilter(String stringRules)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description String
filter(String url)
Interface for a filter that transforms a URL: it can pass the original URL through or "delete" the URL by returning nullConfiguration
getConf()
static void
main(String[] args)
void
setConf(Configuration conf)
-
-
-
Constructor Detail
-
PrefixURLFilter
public PrefixURLFilter() throws IOException
- Throws:
IOException
-
PrefixURLFilter
public PrefixURLFilter(String stringRules) throws IOException
- Throws:
IOException
-
-
Method Detail
-
filter
public String filter(String url)
Description copied from interface:URLFilter
Interface for a filter that transforms a URL: it can pass the original URL through or "delete" the URL by returning null
-
main
public static void main(String[] args) throws IOException
- Throws:
IOException
-
setConf
public void setConf(Configuration conf)
- Specified by:
setConf
in interfaceConfigurable
-
getConf
public Configuration getConf()
- Specified by:
getConf
in interfaceConfigurable
-
-