Package org.apache.nutch.urlfilter.fast
URL filter plugin that first does fast exact suffix matches on host/domain
names before applying regular expressions to the path component of a URL. See
FastURLFilter
for a description of
the rule format.-
Class Summary Class Description FastURLFilter Filters URLs based on a file of regular expressions using host/domains matching first.FastURLFilter.DenyAllRule Rule forDenyPath .*
orDenyPath .?
FastURLFilter.DenyPathQueryRule FastURLFilter.DenyPathRule FastURLFilter.Rule