Regular Expressions for Filtering BOT Traffic?
-
How is it titled in the ISP report exactly?
-
rackspace cloud servers
Maybe my problem is I'm not looking in the right place.
I'm in audience>technology>network and the column shows "service provider."
-
Agreed! That's why I suggest using it in combination with the variables you mentioned above.
-
Ok, try this:
^(microsoft corp|inktomi corporation|yahoo! inc.|google inc.|stumbleupon inc.|rackspace cloud servers)$|gomez
Just added rackspace as another match, it should work if the name is exactly right.
Hope this helps,
Chris
-
Does it need the . before the )
-
Not unless there's a . after the word servers in the name.  The . is escaping the . at the end of stumbleupon inc.
-
Crap.
Well, I guess the vernacular is what I need to know.
Knowing what to put where is the trick isn't it? Is there a dummies guide somewhere that spells this out in kindergarten speak?
I could really see myself botching this filtering business.
-
If you copy and paste my RegEx, it will filter out the rackspace bots. Â If you want to learn more about Regular Expressions, here is a site that explains them very well, though it may not be quite kindergarten speak.
-
I will definitely do that for Rackspace bots, Chris.
Thank you for taking the time to walk me through this and tweak my filter.
I'll give the site you posted a visit.
-
No problem, feel free to reach out if you have any other RegEx related questions.
Regards,
Chris