Meta:Spam Blacklist: Difference between revisions

From Perplex City Wiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 45: Line 45:
ku.edu
ku.edu
usc.edu
usc.edu
msoe.edu


  #</pre> <!-- leave this line exactly as it is -->
  #</pre> <!-- leave this line exactly as it is -->

Revision as of 16:37, 29 June 2007

#

 
#
# In simple terms:
#    * Everything from a "#" character to the end of the line is a comment
#    * Every non-blank line is a regex fragment which will only match inside URLs
#
# Internally, a regex is formed which looks like this:
#
#    !http://[a-z0-9\-.]*(line 1|line 2|line 3|....)!Si
#
# A few notes about this format. It's not necessary to add www to the start of
# hostnames, the regex is designed to match any subdomain. Don't add patterns
# to your file which may run off the end of the URL, e.g. anything containing 
# ".*". Unlike in some similar systems, the line-end metacharacter "$" will not
# assert the end of the hostname, it'll assert the end of the page.
#

# Test blacklist
example\.com

# Spammer URL partials added 2006-10-16
casino-games
adipex
tramadol

# Spammer URL partials added 2006-12-22 because of User 1166728278
dgo5d.info
ringtone

# More spam terms, added 2007-06-10
diazepam
xenical
ringtones
wellbutrin
ativan
viagra
lorazepam
vicodin

# Pages that have been hacked (the spam keywords aren't in the URL, so blacklist isn't effective against the link)
stanford.edu/seminar
sunysb.edu
bc.edu
sc.edu
ku.edu
usc.edu
msoe.edu

 #