The Adblock Project Forum Index The Adblock Project
Pull up a seat ...stay a while.
 
 FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

Why isn't everyone here compiling the ultimate filter list?
Goto page Previous  1, 2, 3, 4, 5  Next
 
Post new topic   This topic is locked: you cannot edit posts or make replies.    The Adblock Project Forum Index -> Main
View previous topic :: View next topic  
Author Message
Zachariah



Joined: 21 Jul 2004
Posts: 703
Location: Earth

PostPosted: Sun Jul 25, 2004    Post subject: Reply with quote

kstahl wrote:
Is there any performance difference between these filters:
I am not sure if it would have that kind of info, but
http://www.regular-expressions.info/
is a nice site to help understand regexp's

(its created by the the creator of EditPad Pro (which I'm not affilated in any way with, but it is my favorite text editor and I have paid for many upgrades over the years))
_________________
Latest Adblock!

If all else fails try a really fresh install of Firefox
.
Back to top
View user's profile Send private message
NJH



Joined: 13 Nov 2003
Posts: 183
Location: Hampshire, England

PostPosted: Sun Jul 25, 2004    Post subject: Reply with quote

kstahl wrote:
Reminds me of something I've been wondering for a while.

Is there any performance difference between these filters:
Code:
/\Wad(server|s)?\W/
/\Wad(s|server)?\W/


I have a vague memory of the first being better...

kstahl,
I had quite a bit of a thread going with rue - it may even have been this one - towards the end of last year trying to understand why some filters were not working. In the end rue came to the conclusion that you should always state the "bigger" filter first so the first of your two options is better. I think the problem I was having was where there was another mandatory part of the filter after the optional bit so it may have been with a filter like/\Wad(s|server)?abc\W/ where the filter would (excluding the \W's) never trigger on adserverabc because it matches ad, then in the optional bit it matches the s. Then as it matched the s it would never try the server bit. I came up with another example where it worked OK but the general conclusion was the do the complex match before the easier one.

Nick
Back to top
View user's profile Send private message
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sun Jul 25, 2004    Post subject: Reply with quote

Thank you, Nick. I must've seen that thread and subconciously remembered it, because that's the way I've been doing my filters. Smile

I think this might be it:

Remember That The Regex Engine Is Eager
http://www.regular-expressions.info/alternation.html
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
kalisto_911



Joined: 29 Jul 2004
Posts: 2

PostPosted: Thu Jul 29, 2004    Post subject: Reply with quote

This is my list at the time

Quote:

[Adblock]
http://a.as-us.falkag.net/*
http://pagead2.googlesyndication.com
http://www.phun.org/phun/gfx/banner/*
http://bans.bride.ru/*
*adserver*
http://c1.zedo.com/*
http://www.hotlinkpics.com/*
http://a1444.g.akamai.net/*
http://us.a1.yimg.com/*
http://banners.*
http://images.webattack.com/waimages/filescavban.gif
http://images.webattack.com/ads/*
http://tribalfusion.speedera.net/*
http://www.qksrv.net/image-1432625-10282733
http://www.qksrv.net/image-*
http://jnova.cjt1.net/*
http://ak.maxserving.com/*
http://s0b.bluestreak.com/*
http://kermit.macnn.com/*
http://engage.everyone.net/*
http://a.tribalfusion.com/*
yimg.com/*.js
us.yimg.com/a/
/\/buy_assets\//
/[\W\d](double|fast)click[\W\d]/
/[\W\d]click(stream|thrutraffic|thru|xchange)[\W\d]/
/[\W\d]value(stream|xchange|click)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/\.geocities.com/js_source/(ygNSLib9|pu5geo).js/
/\Wgoogle(adservices|syndication)\W/
*adview.php*
http://media.bonnint.net/ksl/*
http://view.atdmt.com/*
http://media.fastclick.net/*



but now for some reason, even when I disable adblock, no images at all show up on ebay. Anyone know what gives?
Back to top
View user's profile Send private message
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Thu Jul 29, 2004    Post subject: Reply with quote

You've probably restricted image loading in Firefox.

Tools > Options > Web Features

Make sure that "Load Images" is enabled, but "for the originating Web site only" is DISabled.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
kalisto_911



Joined: 29 Jul 2004
Posts: 2

PostPosted: Thu Jul 29, 2004    Post subject: Reply with quote

kstahl wrote:
You've probably restricted image loading in Firefox.

Tools > Options > Web Features

Make sure that "Load Images" is enabled, but "for the originating Web site only" is DISabled.


Good Call
Back to top
View user's profile Send private message
Drache
Guest





PostPosted: Mon Aug 02, 2004    Post subject: Banner not getting stopped? Reply with quote

I was going to mention that us.falkag.net is a new advertiser on the scene with scripts, iframes and other junk loading......but Mr 911 prevously mentioned it.

Also, I'm using most of the regular expression filters as mentioned in the "advanced" section.

Quote:
*us.falkag.net*
/[\W\d](double|fast)click[\W\d]/
/[\W\d](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W\d]/
/[\W\d](top|bottom|left|right)?banner(s|id=|\d)[\W\d]/
/[\W\d]click(stream|thrutraffic|thru|xchange)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/[\W\d]value(stream|xchange|click)[\W\d]/
/[\W_](b(an|nr)s?|jump|redir(ect|s)?|stat)[\W_]/
/\/buy_assets\//
/\W(cy|r)?c(ou)?nt(er|ed)?\W/
/\Wadvertis(ing|ement)\W/
http://pagead2.googlesyndication.com/
http://view.atdmt.com/*
http://www.ocremix.org/includes/php/banner.php
us.yimg.com/a/
yimg.com/*.js


Why is it not catching http://www.ocremix.org/includes/php/banner.php unless I specify it directly then?
Back to top
Drache
Guest





PostPosted: Mon Aug 02, 2004    Post subject: Banner Reply with quote

Nevermind! I found one courtesy of Zachariah.

Code:
 /[\W\d_]banner(s|farm)?[\W\d_]/


I've since added this, and it works just fine!
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Mon Aug 02, 2004    Post subject: Reply with quote

It will also work if you change
Code:
/[\W\d](top|bottom|left|right)?banner(s|id=|\d)[\W\d]/

to
Code:
/[\W\d](top|bottom|left|right)?banner(s|id=|\d)?[\W\d]/


The first version requires that "banner" is followed by either an s, id=, or a digit. The modified version makes it optional instead.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
aliumalik



Joined: 20 Jul 2004
Posts: 15

PostPosted: Thu Aug 05, 2004    Post subject: Reply with quote

From what I have filtered through this thread and my personal adblock usage of around 2 months these seem to be the best filter options.
Quote:
[Adblock]
googlesyndication
us.yimg.com/a/
/\/buy_assets\//
/[\W\d_](top|bottom|left|right|)?banner(s|id=|\d|_)?[\W\d]/
/[\W\d](double|fast)click[\W\d]/
/[\W\d]click(stream|thrutraffic|thru|xchange)[\W\d]/
/[\W\d]value(stream|xchange|click)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/[\W_](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|v|vert(s|ising|isements)?|xchange)?)[_\W\d]/
/(hot|spy)log/
/[\W_](b(an|nr)s?|jump|redir(ect|s)|stat)[\W_]/
/\W(cy|r)?c(ou)?nt(er|ed)?\W/
/p(artner|ing\.cgi|romotion)/
reklama
/sp(onsor|ymagic)/
/top(100|cto)/


Other than these I have a few site specific filters like
Code:
/\Wa(tdmt|twola|dsdk)\W/
falkag
qksrv
http://bb.connextra.com/selector/*
http://www.resellerratings.com/price-direct-theinquirer.pl
http://www.vibrantmedia.com/images/spacer.gif
http://img.cricinfo.com/cricinfo/scbank/*
http://common.ziffdavisinternet.com/util_get_image/2/0,1311,i=25788,00.gif
http://rad.msn.com/ADSAdClient31.dll


http://rad.msn.com/ADSAdClient31.dll is a very good filter for hotmail users as it completely removes the ad iframe

Now can someone optimize (more) the first set of filters please?
Thanks
Back to top
View user's profile Send private message
Guest






PostPosted: Fri Aug 06, 2004    Post subject: Reply with quote

Just wanted to say Org's list from page 2 is good too. Have you seen it, aliumalik?

Quote:
[Adblock]
.telkku.com/*.swf
.yimg.com/a/
/(ad-flow|adsdk|advertising|bizrate|prohosting|resellerratings|tradedoubler|\.atwola|\.atdmt|valueclick|\.adtrix)\.com/
/(doubleclick|fastclick|spinbox|falkag)\.net/
/(yahoo|geocities)\.com\/(toto\?s|js_source\/)/
/[.\/]adcontent[.\/]/
/[.\/_](page|online)?ad(v|_banner|[zs\?]|click|image.php|image[zs]?|serv(er?)?)?[_\/\?]/
/[\/]banner([sz]?\/|it\/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.|.?\d*\.)/
/\/\?DC=/
/\/ad(js.php|size=|frame)/
/\/bnr\//
/_ad[sz]?\.(php|js|gif|jpg|swf)/
/aslframe.html
/bd.m?
/http:\/\/(mainos|rcm-images)\./
/http:\/\/banner[sz]?\./
/http:\/\/r?ad([sz]?|images?|img|tech|serv(er?)?)\./
/imdb\.com\/(google\/|.*\.swf)/
/popup(\d*)?\.js/
http://www.theregister.co.uk/media/
Back to top
aliumalik



Joined: 20 Jul 2004
Posts: 15

PostPosted: Fri Aug 06, 2004    Post subject: Reply with quote

most of orgs list seems site specific to me i.e yahoo, imdb etc. This is "universal". Choose what works best for you though.
Regards
Ali
Back to top
View user's profile Send private message
fir
Guest





PostPosted: Fri Aug 06, 2004    Post subject: open adblock preferences Reply with quote

how can you open it in order to put it in?
Back to top
fir
Guest





PostPosted: Fri Aug 06, 2004    Post subject: Reply with quote

i mean adblock preferences
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Fri Aug 06, 2004    Post subject: Reply with quote

fir,

Cut and paste the list into a text file. Go to Tools > Adblock > Preferences and click the Adblock Options button. You'll find the import function in that menu.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
aliumalik



Joined: 20 Jul 2004
Posts: 15

PostPosted: Fri Aug 06, 2004    Post subject: Reply with quote

What would be the readexp for any word string. I want to block anyword_kiosk what would be the expression.
Thanks
Back to top
View user's profile Send private message
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Fri Aug 06, 2004    Post subject: Reply with quote

Regular Expressions.

You could try something like:
Code:
/[a-z]*_kiosk/


Or, if the word can contain digits too:
Code:
/\w*kiosk/

(\w matches underscore as well as digits and letters)

I tested them in RegEx Coach and they both work fine.
http://www.weitz.de/regex-coach/

Great site on RegExs:
http://www.regular-expressions.info/
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
NJH



Joined: 13 Nov 2003
Posts: 183
Location: Hampshire, England

PostPosted: Fri Aug 06, 2004    Post subject: Reply with quote

kstahl wrote:
Regular Expressions.

You could try something like:
Code:
/[a-z]*_kiosk/


Or, if the word can contain digits too:
Code:
/\w*kiosk/

If you have a good think about it, you might as well match a simple expression
Code:
_kiosk
as the only difference between your second filter and this is that my version will also match things like /_kiosk. as well and I would guess that it is not too likely. The reg ex version would be
Code:
/_kiosk/
.

Nick
Back to top
View user's profile Send private message
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Fri Aug 06, 2004    Post subject: Reply with quote

Well, your filter could also match #%#&%&%#!!!#"!#")(/==(/&_kiosk.

Seriously, you are of course correct.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
Org



Joined: 23 Oct 2003
Posts: 349

PostPosted: Sun Aug 08, 2004    Post subject: Reply with quote

aliumalik wrote:
most of orgs list seems site specific to me i.e yahoo, imdb etc.

Actually, no. Most filters in my list are trying to be as universal as possible without creating false positives. The few site specific filters are there only to prevent false positives. I don't get unwanted blocks just about all, but YMMV depending on the sites you visit. When ever I get a false positive, I will refine my list.

aliumalik wrote:
Choose what works best for you though.

Exactly. After comparing a few filterlists here it becomes soon obvious why it's very difficult to ship Adblock with any kind of default list.
Back to top
View user's profile Send private message
I



Joined: 08 Aug 2004
Posts: 21

PostPosted: Sun Aug 08, 2004    Post subject: false positive Reply with quote

Org,

Overall I'm very happy with you list... I do get a false positive though, at a site that stupidly uses /images/banner_01.gif and 02.gif, 03.gif etc for it's buttons I believe it's
Code:
/[\/]banner([sz]?\/|it\/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.|.?\d*\.)/
that's blocking it (even though I'm not 100% sure how) was wondering if you can tell me how to revise this.


Also I saw the crash course on this regexp and it gave me a decent idea how it works Something that wasn't mentioned was when do you use [] brackets and what is a ? question mark used for.

Thanks
Back to top
View user's profile Send private message
Org



Joined: 23 Oct 2003
Posts: 349

PostPosted: Sun Aug 08, 2004    Post subject: Reply with quote

That filter has many conditional elements. "/banner_01.gif" gets triggered by this branch of the RegExp:

[\/]banner.?\d*\.

(Actually you can remove the square brackets from the start, because the character class has only one character. Earlier it used to have more, but they got edited out because of false positives.)

The offending part is this:

.?\d*\.

".?" means zero or one of any character -> this matches "_" of "/banner_01.gif" ("." is any character, and "?" matches the previous element 0 or 1 times)

"\d*" means zero or more digits -> this matches "01" ("\d" is a single digit between 0-9, and * matches the previous element 0 or more times)

"\." matches period -> this matches "." from ".gif" ("." has special meaning, so it must be escaped with "\" to prevent the "any character" meaning)

This is one more proof that everybody should customize their own filters. I added the offending branch specifically to block images like "/banner_01.gif". It works for me but not for you, because we visit different pages.

The best way to revise the filter to match your needs is to either remove the offending block, or change the part that matches "_", ie. this: ".?". You could try if one of these variants of the filter work better for you:

/\/banner([sz]?\/|it\/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.)/

/\/banner([sz]?\/|it\/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.|[-.]?\d*\.)/

The latter would match "/banner-01.gif", "/banner.01.jpg" and "/banner01.swf", but not "/banner_01.gif"
Back to top
View user's profile Send private message
I



Joined: 08 Aug 2004
Posts: 21

PostPosted: Sun Aug 08, 2004    Post subject: fixed! Reply with quote

Org,

That worked

(Your filter works great I added
Quote:
/[\W\d](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W\d]/

that I copied from somewere else to cover /ads2/ (and I guess some more stuff too))

thanks a lot
Back to top
View user's profile Send private message
Anystupidassname



Joined: 13 Aug 2004
Posts: 1

PostPosted: Fri Aug 13, 2004    Post subject: Cleanup Reply with quote

Hello All,

Thanks for all of the filter examples everybody.
I'm wondering if one of the RegExp specialists in here would be willing to cleanup/combine/discombobulate this little collection. I'll have to readup on RegExp to figure out how it works eventually, but for now I'm being a lazy bastard. I know one of you gurus could do this in 60 seconds flat so please, would you? I'm thinking this could be reduced to 10 lines or so with the proper leetness, removing redundancies, combining expressions etc...
Quote:
[Adblock]
/(new|double|fast|value)click./
/[\/.]ban(ner|nerfarm|neri|image|source)s?[\/.]/
/[\/._-](|dhtm|i)ad(banner||mentor|s|s2|sdk|sv3|server|trix|image|img|log|vt|bureau|counter|v|vert|vertising|vertisement)s?[\/._?-]/
/[\/._]promo(|tion)s?[\/._]/
/[\/\.](smart)?ad(server|s)?\d*[\/\.]/
/[\/](associates|affiliates|us.yimg.com\/a)[\/]/
/[\W\d\/.](203.199.70.2|2o7|atdmt|atwola|bfast|bluestreak|coremetrics|dgm2|falkag|hitbox|marketbanker|qksrv|ru4|tribalfusion|zedo)[\W\d\/.]/
/[\W\d](ad|dime|double|fast|value|click)(stream|s|thrutraffic|thru|xchange|click)[\W\d]/
/[\W\d](double|fast)click[\W\d]/
/[\W\d](onlineads?|ad(banner|click|frame|images?|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|xchange)?)[\W\d]/
[
/[\W\d]click(stream|thrutraffic|thru|xchange)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/[\W\d]value(stream|xchange|click)[\W\d]/
/[\W_]ad(click|cycle|image|js|server|s|tech|trix|vert(ising)?|words)?[\W\d_]/
/[\W_]banner(s|id=)?[\W\d_]/
/\/buy_assets\//
/\?clickTAG\d?=/
/\D(588|468|234|120)x600?\D/
/\W(affiliates|annons(er)?|associates|marketing|promos|media)\W/
/\W(double|fast|value)click[\W\d]/
/\W(ned|one)stat(basic)?\W/
/\W(page|side|value)ads\W/
/\Wat(dmt|wola)\W/
/\Wgoogle(adservices|syndication)\W/

This is what my list currently looks like. Lame huh?
Quote:
[Adblock]
*&ad.*
*&banner_*
*-ad.*
*-ads.*
*.a1.yimg.com*
*.ad&*
*.ad-*
*.ad.*
*.ad/*
*.adpost.*
*.ads.*
*.ads/*
*.ads_*
*.adv-*
*.adv.*
*.atdmt.com/*
*.banner*
*.banner.*
*.banner/*
*.banners.*
*.banners/*
*.content.ru4.com/images
*.doubleclick.*
*.fineclicks.com*
*.maxserving.com*
*.partner2profit.com/*
*.pued.com*
*.websponsors.com/*
*.zope.com/*
*/BANNER*
*/ab/*
*/ad*.nextag.com*
*/ad-*
*/ad.*
*/ad/*
*/ad2.*
*/ad?*
*/ad_*
*/adclick.*
*/ads.*
*/ads/*
*/ads5.*
*/ads_*
*/adview.*
*/assets/*
*/ban.*
*/banner*
*/banner.*
*/banner/*
*/banner_*
*/banners.*
*/banners/*
*/bans.*
*/counter.*
*/dyad/*
*/imageads.*
*/img.crack*.jpg
*/onlineads/*
*/pagead2.*
*/skyscrapers/*
*/sponsors/*
*100x400*
*120x240*
*125x125*
*160x160*
*468x60*
*70x468*
*728x90*
*88x33*
*ADBOX*
*AdClient*
*AdsPub*
*Banner*.jpg
*Outiemkm*.gif
*Outiemkm*.swf
*_ad.*
*_ads.*
*_ads/*
*_banner*
*adbureau*
*adforce*
*adimage*
*admaster*
*adprofile*
*adquest*
*adreporting*
*ads*120x90*
*ads/*.gif
*ads/*.jpg
*ads0*
*ads1*
*ads2*
*ads3*
*adserve*
*advert*
*advertising*
*ar.atwola.*
*atdmt*
*b.as-us.falkag.net/*
*babescamp*
*banner*.gif
*banner.*
*bannerads*
*bfc.myway.com/search/fiSearchMax.js
*clickxchange.com/*
*di.image.eshop.msn.com/img/*
*dvdrhelp.com/images/ab/*
*dvdrhelp.com/images/ccc*
*dvdsqueeze*
*f.about.com/*
*fastclick*
*gcw*.gif
*gcw*.swf
*gcw.devioustrap*
*goclickfind*
*gsads*
*i.a.cnn.net/cnn/.element/img/1.0/sect/LAW/*
*i.cnn.net/cnn/.element/img/1.1/misc/cb/*
*images.ibsys.com/sh/sponsors/*
*images.icnetwork.co.uk/upl/*
*img.freexxxpages.*
*miniban*
*msnbcmedia.msn.com/i/msnbc/Components/Art/SITEWIDE/In_house_promos/*
*pagead*
*popup.msn.com/*
*qksrv.net/*
*s0b.bluestreak.com/*
*satirewire.com/images/TSTSbutton.jpg
*satirewire.com/images/eoe120-click-here.gif
*services.springstreetnetworks.com/Services/Pod/Images/*
*sextracker*
*sonnerie.net/external*
*spacetarget.com/games/i/*
*sportsbybrooks.com/*
*static.worldnow.com/Revenue/images/*
*tribalfusion.speedera.net/*/media/*
*wiretarget.com*
*wx.gamecopyworld*
http://astalavista.box.sk/adult20.jpg
http://www.independent.co.uk/images/*SKYSCRAPER*
install.xxxtoolbar.com

Thanks so much for any help!
Back to top
View user's profile Send private message
Zachariah



Joined: 21 Jul 2004
Posts: 703
Location: Earth

PostPosted: Fri Aug 13, 2004    Post subject: Reply with quote

not a regex expert (yet) but I'd have to say that your list looks like good test cases to check for making regex's
_________________
Latest Adblock!

If all else fails try a really fresh install of Firefox
.
Back to top
View user's profile Send private message
h00ligan
Guest





PostPosted: Fri Aug 13, 2004    Post subject: maxboxing.com Reply with quote

anyone want to venture a stab at maxboxing.com.. the right nav stuff i starred.. it's like 30 flash pieces on one page with no rhyme or reason..

bah, any ideas? - aside from a couple of /nav_right/*.swf and per element filters?
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Fri Aug 13, 2004    Post subject: Reply with quote

h00ligan,

I don't think there is any smart way to block ads at that site. You'll have to do as you said, use /nav_right/*.swf and some specific filters.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
asdf
Guest





PostPosted: Tue Aug 17, 2004    Post subject: Reply with quote

I'd really like to see a really good filter list I could recommend to people. The one in the advanced filter thread is a bit lacking, and hasn't been updated with the newer suggestions from this thread.

Would someone look over my filter list and make recommendations?
Quote:
[Adblock]
.iad.
/(\/|_)?ad(bureau|Wrapper|system|content|mentor|count|space|client|image|word|cycle|frame|view|tech|trix|img|sys|vt|_|\.)/
/(ad-flow|adsdk|advertising|bizrate|prohosting|resellerratings|tradedoubler|\.atwola|\.atdmt|valueclick|\.adtrix)\.com/
/(hot|spy|ad)log/
/(yahoo|geocities)\.com\/(toto\?s|js_source\/)/
/[.\/_](page|online)?ad(v|_banner|[zs\?]|click|image.php|image[zs]?|serv(er?)?)?[_\/\?]/
/[\/]banner([sz]?\/|it\/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.|.?\d*\.)/
/[\W\d](double|value|human|specific|banner|sponsor|fast|com|ad)click[\W\d]/
/[\W\d](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W\d]/
/[\W\d](side|bottom|right|left|top)ad[\W\d]/
/[\W\d]click(thrutraffic|xchange|stream|ability|thru|zs)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/[\W\d]value(stream|xchange|click|ad)[\W\d]/
/[\W\d_](ad)?serv(edby|ing)?[\W\d_]/
/[\W\d_](banner|ad)farm[\W\d_]/
/[\W\d_](bottom|side|left|right|top)?banner(source|farm|adfiles|id=|s|\d)?[_\W\d]/
/[\W_](b(an|nr)s?|jump|redir(ect|s)?|stat)[\W_]/
/\/\?DC=/
/\/bnr\//
/\/buy_assets\//
/\?clickTAG\d?=/
/\D\d{2,3}x\d{2,3}\D/
/\W(affiliates|annons(er)?|associates|marketing|promos|media)\W/
/\W(cy|r)?c(ou)?nt(er|ed)?\W/
/\W(ned|one)stat(basic)?\W/
/\Wa(tdmt|twola|dsdk)\W/
/\Wgoogle(adservices|syndication)\W/
/_ad[sz]?\.(php|js|gif|jpg|swf)/
/ban(ner|ners|man)/
/p(romotion|artner|ing\.cgi)/
/sads
/sp(onsor|ymagic)/
/top(100|cto)/
/track(er|ing)?/
207.net
a1.yimg.com
adcentriconline.com
addfreestats.com
addupp.com
advertising.com
affiliatebot.com
bfast.com
blogads.com
click-fr.com
clickability.com
clickzs.com
comclick.com
counter
dtmpub.com
extreme-dm
falkag
gator.com
hbads
hitbox
humanclick.com
ia.imdb.com
imdb.com/google
interpolls.com
liveperson.net
maxserving.com
nedstatbasic.net
netshelter.net
popup
qksrv.net
rcm-images.amazon.com
rcm.amazon.com
reklama
sageanalyst.net
setadcookie
sitemeter.com
specificclick.net
spinbox.net
superstats.com
textads
tradepub.com
trafic.ro
valuead.com
vibrantmedia.com
webtrendslive
yimg.com/*/bb/*
yimg.com/a/
Back to top
bene



Joined: 17 Aug 2004
Posts: 123
Location: Home, I think

PostPosted: Wed Aug 18, 2004    Post subject: Reply with quote

.iad.
Not a RegExp, that's very specific.
/(\/|_)?ad(bureau|Wrapper|system|content|mentor|count|space|client|image|word|cycle|frame|view|tech|trix|img|sys|vt|_|\.)/
The "(\/|_)?" is unnecessary. It's been shown that forwardslash doesn't need a preceeding backslash - update to "(/|_)?". However, this section of the regex says "find a character that is either / or _, or don't". The "or don't" comes from the question mark, which says "match 0 or 1 times", which isn't particularly useful at the start of a RegExp - it effectively means that any character can preceed what follows. Use ? inside your RegExp, but not at the front or back - you can see proper use in later filter - |ima?g(es?)?| says |img|imge|imges|imag|image|images|, without iterating them all.
/(ad-flow|adsdk|advertising|bizrate|prohosting|resellerratings|tradedoubler|\.atwola|\.atdmt|valueclick|\.adtrix)\.com/
This filters for advertising.com, which is also explicitly included later. You should have one RegExp for all domains you want to exclude - I find it easiest to do "/(domain1|domain2|domain3)\.(com|net)/" If you want to be specific filter "adsdk.com" but not "badsdk.com", modify to be "/\.(domain1|domain2|domain3)\.(com|net)/"
/(hot|spy|ad)log/
/(yahoo|geocities)\.com\/(toto\?s|js_source\/)/
These clean up some of the content on Geocities pages?
/[.\/_](page|online)?ad(v|_banner|[zs\?]|click|image.php|image[zs]?|serv(er?)?)?[_\/\?]/
The "\/" should be just "/", unless you are trying to also include "\", which would be "\\/". In "[.\/_]", I think that the "." is a typo - it should either be "[\./_]" or just, simply ".". The rest makes this a pretty potent filter anyways.
/[\/]banner([sz]?\/|it\/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.|.?\d*\.)/
"\/" should be "\\/" (to get both) or just "/" (to get just /). The last section "|.?\d*\.)" does some damage to the specific nature of this RegExp - it now matches "banner" followed by any character then a period, or "banner" followed by a period.
/[\W\d](double|value|human|specific|banner|sponsor|fast|com|ad)click[\W\d]/
/[\W\d](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W\d]/
I've updated this one with "|se?rv(er|e)?|" instead of "|serv(er|e)?|", "|vt?|" instead of "|v|"
/[\W\d](side|bottom|right|left|top)ad[\W\d]/
/[\W\d]click(thrutraffic|xchange|stream|ability|thru|zs)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/[\W\d]value(stream|xchange|click|ad)[\W\d]/
/[\W\d_](ad)?serv(edby|ing)?[\W\d_]/
/[\W\d_](banner|ad)farm[\W\d_]/
/[\W\d_](bottom|side|left|right|top)?banner(source|farm|adfiles|id=|s|\d)?[_\W\d]/
/[\W_](b(an|nr)s?|jump|redir(ect|s)?|stat)[\W_]/
/\/\?DC=/
/\/bnr\//
/\/buy_assets\//
/\?clickTAG\d?=/
.
You could include this in above "click(thrutraffic|exchange|..." as "click(tag|thrutraffic|exchange|...", which would be less specific, but reduce the number of filters.

/\D\d{2,3}x\d{2,3}\D/
/\W(affiliates|annons(er)?|associates|marketing|promos|media)\W/
/\W(cy|r)?c(ou)?nt(er|ed)?\W/
/\W(ned|one)stat(basic)?\W/
/\Wa(tdmt|twola|dsdk)\W/
/\Wgoogle(adservices|syndication)\W/
/_ad[sz]?\.(php|js|gif|jpg|swf)/
Might want to see how this could be integrated into the filter that is currently "/[.\/_](page|online)?ad(v|_banner|...". It may fit.
/ban(ner|ners|man)/
This filter makes any previous filter based on matching "banner" redundant. This filter says: "If it has 'banner', exclude. If it has 'banners', exclude. If it has 'banman', exclude." If you are going to keep this one, you'll want to drop a couple filters from above.
/p(romotion|artner|ing\.cgi)/
This one seems a bit confused - it's going for "promotion", "partner" and "ping.cgi". Maybe it is intented to be like this: /p(romotion|artner(ing)?)\.cgi/, which says "promotion.cgi", "partner.cgi", "partnering.cgi".
/sads
Ummm... Check this?
/sp(onsor|ymagic)/
/top(100|cto)/
/track(er|ing)?/
Below here can be consolodated, cleaned up, the likes.

207.net
a1.yimg.com
adcentriconline.com
addfreestats.com
addupp.com
advertising.com
affiliatebot.com
bfast.com
blogads.com
click-fr.com
clickability.com
clickzs.com
comclick.com
counter
dtmpub.com
extreme-dm
falkag
gator.com
hbads
hitbox
humanclick.com
ia.imdb.com
imdb.com/google
interpolls.com
liveperson.net
maxserving.com
nedstatbasic.net
netshelter.net
popup
qksrv.net
rcm-images.amazon.com
rcm.amazon.com
reklama
sageanalyst.net
setadcookie
sitemeter.com
specificclick.net
spinbox.net
superstats.com
textads
tradepub.com
trafic.ro
valuead.com
vibrantmedia.com
webtrendslive
yimg.com/*/bb/*
yimg.com/a/
Back to top
View user's profile Send private message Visit poster's website
asdf
Guest





PostPosted: Thu Aug 19, 2004    Post subject: Ultimate adblock filter list Reply with quote

Quote:
.iad.

Not a RegExp, that's very specific.
I know. I forget what it's for.
Quote:
/(hot|spy|ad)log/
/(yahoo|geocities)\.com\/(toto\?s|js_source\/)/

These clean up some of the content on Geocities pages?
The second one does.
Quote:
/[.\/_](page|online)?ad(v|_banner|[zs\?]|click|image.php|image[zs]?|serv(er?)?)?[_\/\?]/

The "\/" should be just "/", unless you are trying to also include "\", which would be "\\/". In "[.\/_]", I think that the "." is a typo - it should either be "[\./_]" or just, simply ".". The rest makes this a pretty potent filter anyways.
What about the end part?
[_\/\?]
?
Quote:
/[\/]banner([sz]?\/|it\/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.|.?\d*\.)/

"\/" should be "\\/" (to get both) or just "/" (to get just /). The last section "|.?\d*\.)" does some damage to the specific nature of this RegExp - it now matches "banner" followed by any character then a period, or "banner" followed by a period.
Why is that a bad thing? What would you suggest?
Quote:
/_ad[sz]?\.(php|js|gif|jpg|swf)/

Might want to see how this could be integrated into the filter that is currently "/[.\/_](page|online)?ad(v|_banner|...". It may fit.
OK, help would be appreciated. I'm still a complete newb at RegEx. I only know a bit of the very basics.
Quote:
This one seems a bit confused - it's going for "promotion", "partner" and "ping.cgi". Maybe it is intented to be like this: /p(romotion|artner(ing)?)\.cgi/, which says "promotion.cgi", "partner.cgi", "partnering.cgi".
OK, fixed.
Quote:
/sads

Ummm... Check this?
I added that myself. It was the only filter that would work for a certain site. Don't remember which. It's not intended to be regex or anything.


Thanks very much for your help. Here's an updated version of my list, which has a few extra tweaks you didn't specifically point out. Could you review it again?

asdf's new list wrote:
[Adblock]
.iad.
/(adcentriconline|addfreestats|vibrantmedia|interpolls|liveperson|nedstatbasic|tradedoubler|sageanalyst|advertising|sitemeter|spinbox|superstats|ad-flow|tradepub|blogads|click-fr|bizrate|prohosting|dtmpub|gator|qksrv|adsdk|addup|\.atwola|\.atdmt|valueclick|\.adtrix|bfast|207)\.(com|net)/
/(hot|spy|ad)log/
/(yahoo|geocities)\.com\/(toto\?s|js_source\/)/
//\?DC=/
//bnr//
//buy_assets//
/[/]banner([sz]?/|it/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.|.?\d*\.)/
/[\./_](page|online)?ad(v|[zs\?]|click|image.php|image[zs]?|serv(er?)?)[_\/\?]/
/[\W\d](click|value|dime|ad)xchange[\W\d]/
/[\W\d](double|value|human|specific|sponsor|dime|fast|com|ad)click[\W\d]/
/[\W\d](onlineads?|ad(click|-?flow|frame|ima?g(es?)?|_id|js|log|se?rv(er|e)?|stream|_string|s|trix|type|vertisements?|vt?|vert|farm)?)[\W\d]/
[/size=8]/[\W\d](value|side|bottom|right|text|left|top|set)ad[\W\d]/
/[\W\d]click(stream|ability|thru|tag|zs)[\W\d]/
/[\W\d]value(stream|ad)[\W\d]/
/[\W\d_](ad)?serv(edby|ing)?[\W\d_]/
/[\W_](b(an|nr)s?|jump|redir(ect|s)?|stat)[\W_]/
/\D\d{2,3}x\d{2,3}\D/
/\W(affiliates|annons(er)?|associates|marketing|promos)\W/
/\W(cy|r)?c(ou)?nt(er|ed)?\W/
/\W(ned|one)stat(basic)?\W/
/\Wa(tdmt|twola|dsdk)\W/
/\Wgoogle(adservices|syndication)\W/
/_ad[sz]?\.(php|js|gif|jpg|swf)/
[size=8]/[\W\d_]ad(bureau|Wrapper|system|content|mentor|cookie|count|space|client|image|word|cycle|frame|view|track|tech|trix|farm|img|sys|vt|_|\.)[\W\d_]/
/ban(ner|ners|man)/
/p(romotion|artner(ing)?)\.cgi)/
/sads
/sp(onsor|ymagic)/
/top(100|cto)/
/track(er|ing)?/
a1.yimg.com
counter
extreme-dm
falkag
hbads
hitbox
ia.imdb.com
imdb.com/google
popup
/rcm(-images)?\.amazon\.com/
reklama
trafic.ro
webtrendslive
yimg.com/*/bb/*
yimg.com/a/
Back to top
Guest






PostPosted: Thu Aug 19, 2004    Post subject: Reply with quote

Ok, what about the defaults for Active Directory Services? Eg: subsite.ads.domain.com

My current filter list, yes, I know there are overlapping rules. Note the unblocking of edu, these don't normally have ads, it reduces false positives.

Quote:

[Adblock]
/(ad|live|net|video|view)banners?/
/(hot|spy)log/
/[\W\d](double|fast)click[\W\d]/
/[\W\d]click(stream|thrutraffic|thru|xchange)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/[\W\d]value(stream|xchange|click)[\W\d]/
/[\W\d_](top|bottom|left|right|)?banner(s|id=|\d|_)[\W\d]/
/[^\w|&|=|\+](fast|smart|value)?clicks?(2net|adhere|over|trade|xchange)?[\W](?!\w+\.edu)/
/[^\w|&|=|\+](html|live|main|net|show|view)?ad[sv]?(ales|bot|center|click|client|content|council|count|data|ert|ertise?r?s?|ertising|erve?r?|iew|gifs?|id|images?|info|juggler|link|log|man|max|net|optimis?z?er|pics|popup|proof|redire?c?t?)?[\W_](?!\w+\.edu)(?!aware)/
/[^\w|&|=|\+][^&][^\+]promos?t?e?(ions?)?[\W_](?!\w+\.edu)/
/[^\w|&|=|\+]sponsors?[\W_]/
/\D\d{2,3}x\d{2,3}\D/
/affiliates/
/banner(?=ad|exchange|swap)/
/linkshare/
/promos?\.gif/
/yimg.com/a[^u]/
adsdk.com
atdmt.com
atwola.com
bfast.com
clk_thru
doubleclick.com
flycast.com
hitbox.com
http://www.wpdfd.com/*.swf
imgis.com
instacontent.net
java.yahoo.com/a
linkexchange.com
pagecount
partner2profit.com
qksrv.net
rd.yahoo.com/M=
spinbox
Back to top
asdf
Guest





PostPosted: Sat Aug 21, 2004    Post subject: Reply with quote

For some reason, this:
Quote:
http://aolwpnscom.112.2o7.net/b/ss/aolwpnscom/1/G.6-PD-R/s68074003878132?[AQB]&ndh=1
. . . . . . . . . . &t=20/7/2004%2020%3A55%3A54%205%200&cc=USD&g=http%3A//www.netscape.com/
. . . . . . . . . . &r=http%3A//aasted.org/adblock/viewtopic.php%3Ft%3D604&s=1024x768&c=16&j=1.3
. . . . . . . . . . &v=N&k=Y&bw=1014&bh=538&p=Default%20Plugin%3B&[AQE]

is not being blocked by this filter:
Quote:
/(adcentriconline|addfreestats|vibrantmedia|interpolls|liveperson|nedstatbasic|tradedoubler
. . . . . . . . . . |sageanalyst|advertising|sitemeter|spinbox|superstats|ad-flow
. . . . . . . . . . |tradepub|blogads|click-fr|bizrate|prohosting|dtmpub|gator
. . . . . . . . . . |qksrv|adsdk|addup|\.atwola|\.atdmt|valueclick|\.adtrix
. . . . . . . . . . |bfast|207)\.(com|net)/
.
Help?
Back to top
cell1527



Joined: 16 Jul 2004
Posts: 44

PostPosted: Sat Aug 21, 2004    Post subject: Reply with quote

asdf:
You need to change 207 to 2o7, that will block it.
Back to top
View user's profile Send private message Yahoo Messenger
asdf
Guest





PostPosted: Sat Aug 21, 2004    Post subject: Reply with quote

Ah, lol. Thanks.
I think I did something with my filters that screwed up adblock, because now it says "Unloaded" even though it's enabled. Weird thing is, some of the filters are still working.
Back to top
asdf
Guest





PostPosted: Sat Aug 21, 2004    Post subject: Reply with quote

OK, now this:
http://www.hindustantimes.com/on/img/indianmarriages170x40.gif
is not being blocked by this:
/\D\d{2,3}x\d{2,3}\D/
Or this:
/[\W\w\D]\d{2,3}x\d{2,3}[\W\w\D]/
Back to top
server_error
Guest





PostPosted: Sat Aug 21, 2004    Post subject: Reply with quote

Just started using Firefox + Adblock, man I wish I made the switch sooner.... anyways, I didn't see this mentioned on there, but I found a way to block those annoying-as-hell in text advertisements (You know, where it makes ad links out of random words, making it look like real content, like they have at Anandtech and a bunch of other sites)

Just add this:
lmcd.us.intellitxt.com

Otherwise the 2 lists posted in the sticky block out damn near everything, great work guys.[/quote]
Back to top
asdf
Guest





PostPosted: Sat Aug 21, 2004    Post subject: Reply with quote

OK, according to a program called RegEx Coach I got, this filter:
/\D\d{2,3}x\d{2,3}\D/
should match on:
http://www.hindustantimes.com/on/img/indianmarriages170x40.gif
http://www.hindustantimes.com/on/img/113x50.gif
Etc.
Yet it is not. Might it be because it says "Unloaded" on the status bar? But then why would most of the other filters work?
Here is my current filter list, which I've been tweaking.
Quote:
[Adblock]
.iad.
/(yahoo|geocities)\.com/(toto\?s|js_source/)/
//\?DC=/
//bnr//
//buy_assets//
/[\./_](page|online)?ad(v|[zs\?]|click|image.php|image[zs]?|serv(er?)?)[_/\?]/
/[\W\d](ad)?serv(edby|ing)?[\W\d]/
/[\W\d](adcentriconline|addfreestats|vibrantmedia|interpolls|liveperson|nedstatbasic|tradedoubler|sageanalyst|advertising|sitemeter|spinbox|superstats|ad-flow|tradepub|blogads|click-fr|bizrate|prohosting|dtmpub|gator|qksrv|adsdk|addup|\.atwola|\.atdmt|valueclick|\.adtrix|bfast|zedo|\.dsdk|2o7)\.(com|net)/
/[\W\d](ad|banner)?pop(layer|under|out|up)?[\W\d]/
/[\W\d](click|value|dime|ad)xchange[\W\d]/
/[\W\d](double|value|human|specific|sponsor|dime|fast|com|ad)click[\W\d]/
/[\W\d](hot|spy|ad)log[\W\d]/
/[\W\d](onlineads?|ad(click|-?flow|frame|ima?g(es?)?|_id|js|log|se?rv(er|e)?|stream|_string|s|trix|type|vertisements?|vt?|vert|farm)?)[\W\d]/
/[\W\d](value|ad)stream[\W\d]/
/[\W\d](value|side|bottom|right|page|text|left|top|set)ad[\W\d]/
/[\W\d]ad(bureau|Wrapper|system|content|mentor|cookie|count|space|client|image|word|cycle|frame|view|track|tech|trix|here|farm|img|sys|vt)[\W\d]/
/[\W\d]ban(ner|ners|man)[\W\d]/
/[\W\d]click(ability|thru|tag|zs)[\W\d]/
/[\W\d]p(romotion|artner(ing)?)\.cgi)/
/[\W\d]sp(onsor|ymagic)[\W\d]/
/[\W\d]top(100|cto)[\W\d]/
/[\W\d]track(er|ing)?[\W\d]/
/[\W](b(an|nr)s?|jump|redir(ect|s)?|stat)[\W]/
/\D\d{2,3}x\d{2,3}\D/
/\W(affiliates|annons(er)?|associates|marketing|promos)\W/
/\W(cy|r)?c(ou)?nt(er|ed)?\W/
/\W(ned|one)stat(basic)?\W/
/\Wad(\W|x)/
/\Wgoogle(adservices|syndication)\W/
/_ad[sz]?\.(php|js|gif|jpg|swf)/
/rcm(-images)?\.amazon\.com/
/sads
a1.yimg.com
clk_thru
counter
extreme-dm
falkag
hbads
hitbox
ia.imdb.com
imdb.com/google
intellitxt.com
rd.yahoo.com/M=
reklama
spinbox
trafic.ro
webtrendslive
yahoo.com/a
yimg.com/*/bb/*
yimg.com/a/
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sat Aug 21, 2004    Post subject: Reply with quote

"Unloaded" means that Adblock is not properly installed.

Did you by chance download Adblock from update.mozilla.org and are you starting up Firefox with the status bar disabled?

If yes and yes, get the updated Adblock from:
http://adblock.mozdev.org/dev.html
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
asdf
Guest





PostPosted: Sat Aug 21, 2004    Post subject: Reply with quote

I got it from update.mozilla.org, and the status bar is always visible. I don't know why it wouldn't be installed properly as it was working fine before. It seems to have gone wonky after adding some filters which I've removed.
I've also found this filter isn't working too:
/[\W\d]sp(onsor|ymagic)[\W\d]/
Doesn't match things with "sponsor" in it.
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sun Aug 22, 2004    Post subject: Reply with quote

Well, if it says "Unloaded" something is not working.

Try downloading it from the Adblock homepage instead.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
asdf
Guest





PostPosted: Thu Aug 26, 2004    Post subject: Reply with quote

I did, it still says Unloaded. Most of the filters work, but some aren't. And some seem to be working selectively. Strange.
Back to top
Guest
Guest





PostPosted: Tue Aug 31, 2004    Post subject: Advanced Filters Reply with quote

FYI:

I copied the filters in the locked Advanced Filters sticky. I found out that some trailers in apple.com/trailers are blocked. If you click on the movie, all you'll see is the movie info and the movie poster, not the "Click here to play trailer" above it. This affects movies that do not have a fancy trailer home page.
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Tue Aug 31, 2004    Post subject: Reply with quote

Can you give us an example of a trailer that is being blocked?
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
NJH



Joined: 13 Nov 2003
Posts: 183
Location: Hampshire, England

PostPosted: Tue Aug 31, 2004    Post subject: Reply with quote

I would like to bet that the last three trailers like http://a1.phobos.apple.com/Music/y2004/m08/d02/h18/s05.aczrxzok.60x60-75.jpg are being blocked because of the filter
Code:
/\D\d{2,3}x\d{2,3}\D/

To find out, go to your page then click on the Adblock element on the status bar. Any entries in red are being blocked. Select one of the blocked entries and the blocking filter will show up in the lower test box. If it is this filter, get rid of it. It seems to cause more problems than it solves.
Back to top
View user's profile Send private message
Guest
Guest





PostPosted: Wed Sep 01, 2004    Post subject: Reply with quote

kstahl wrote:
Can you give us an example of a trailer that is being blocked?


It's 9/1/2004

Go to the main trailers page and under Newest Trailers, these are the ones that are blocked:

Enduring Love
Tom Dowd & the Language of Music
CLOSER <-- This is not blocked because it has a "trailer homepage"
Benji Off the Leash
Shaun of the Dead
The Motorcycle Diaries
Her Majesty

I think all the movies with no trailer pages are blocked.

NJH found the problem. I just can't figure out why that code was the culprit since when I looked at the AdBlock protperties it blocked:

http://movies.apple.com/movies/qt_posters/qtstart5a_480x228.mov

I don't see any "D" or "X" in the URL that's blocked.
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Wed Sep 01, 2004    Post subject: Reply with quote

Guest wrote:
NJH found the problem. I just can't figure out why that code was the culprit since when I looked at the AdBlock protperties it blocked:

http://movies.apple.com/movies/qt_posters/qtstart5a_480x228.mov

I don't see any "D" or "X" in the URL that's blocked.


NJH guessed correctly of course, the filter
Code:
/\D\d{2,3}x\d{2,3}\D/

is triggering on _480x228. in the URL you provided.

It's a RegEx filter and can be roughly translated to
"NOT a digit - two OR three digits - the letter x - two OR three digits - NOT a digit"

I also recommend removing that filter, or replacing it with something a bit more spedific.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5


Last edited by kstahl on Wed Sep 01, 2004; edited 2 times in total
Back to top
View user's profile Send private message
cell1527



Joined: 16 Jul 2004
Posts: 44

PostPosted: Wed Sep 01, 2004    Post subject: Reply with quote

Why don't they edit the advanced filter post and get rid of that filter, all it does is cause nothing but problems.
Back to top
View user's profile Send private message Yahoo Messenger
JediJPrice
Guest





PostPosted: Wed Sep 01, 2004    Post subject: you guys probably don't want to hear this... Reply with quote

thought this was pretty funny... I used the filters in the sticky post and found that this whole webpage was blocked...
http://forums.mozillazine.org/index.php?sid=a8a0ad50e0fa2422f399480fcc5bb572

I'm going to have to delve into this a little deeper....
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Wed Sep 01, 2004    Post subject: Reply with quote

JediPrice,
http://aasted.org/adblock/viewtopic.php?p=3354#3354
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
rue
Developer


Joined: 22 Oct 2003
Posts: 752

PostPosted: Wed Sep 01, 2004    Post subject: Reply with quote

cell1527:
Why don't they edit the advanced filter post and get rid of that filter...
.
Fixed. You'll now find kstahl's size-specific filter.
Back to top
View user's profile Send private message
Guest






PostPosted: Fri Sep 03, 2004    Post subject: filter speed Reply with quote

I would think that complex regular expressions are slow to match, and that the fastest text to match would be 'left anchored' simple text.

I'm saying that 100 of these

www.adban.com/*

would be better than one of these

/[\W\d](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W\d]/

But I could be wrong.

Thanks for adblock,
Tom.
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Fri Sep 03, 2004    Post subject: Reply with quote

Well, you are correct that "left anchored" simple filters are fast.

But unless you have some hard performance data to back up your statement with, I think you are wrong about hundreds of them being better than one RegEx.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
rue
Developer


Joined: 22 Oct 2003
Posts: 752

PostPosted: Fri Sep 03, 2004    Post subject: Reply with quote

Simple filters are converted to full Regular Expressions, and the wildcard (*) becomes: .* Those small filters would be faster, on an individual basis, granted they didn't employ many wildcards. But, the 'individual basis' isn't what matters.
.
There's increased overhead on looping through more Regular Expression objects -- the number Tom threw out was 100. The other consideration is accuracy. Tom's example of the left-anchor is a url-host. The filter it's presuming to replace isn't specific to the host.
.
Since it really isn't documented anywhere, I'll mention again: where applicable, you can speed filter-lookup by including the full-syntax url host. It must begin with the http(s):// and include one forward-slash *before* any wildcards. Adblock stores these filters in a separate hash, potentially bypassing all others, if the host matches.
.
http://stuff.com/cgi* -> faster
http://stuff*/cgi -> not faster
Back to top
View user's profile Send private message
Guest






PostPosted: Fri Sep 17, 2004    Post subject: Problems with filter string "ad" and gmail Reply with quote

I grabbed the filters suggested in the locked thread including this one:
/[\W\d](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W\d]/
and just now discovered that it was causing a problem with gmail. The problem is that the whole big mess after "ad" is optional, leaving the regex with the possibility of matching the string "ad" anywhere as long as it has non-word characters on either side. It sounds good in practice, but in implementation, it has one big problem. The letters ad are valid hex characters which means that any GUID or other code using hex might cause a match. The best solution I can see is to make sure that you allow the string "ad" surrounded by numbers to allow hex identifiers. In the case above, that would mean removing the \d from the beginning and end of the regex resulting in:
/[\W](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W]/

I also tried to expand and optimize it a bit more:
[\W](online)?ad(s|[-_]?(banner|click|flow|frame|ima?g(es?)?|id|js|key|log|serv(er?)?|str(eam|ing)|trix|type|v(ert(isements?)?)?|e?xchange)?)?[\W]
Back to top
Guest






PostPosted: Fri Sep 17, 2004    Post subject: Re: Problems with filter string "ad" and gmail Reply with quote

Anonymous wrote:
I also tried to expand and optimize it a bit more:
[\W](online)?ad(s|[-_]?(banner|click|flow|frame|ima?g(es?)?|id|js|key|log|serv(er?)?|str(eam|ing)|trix|type|v(ert(isements?)?)?|e?xchange)?)?[\W]

Oops, obviously, that last one should still be surrounded by /'s.. I took it straight out of RegEx Coach and forgot to put the /'s back in.

/[\W](online)?ad(s|[-_]?(banner|click|flow|frame|ima?g(es?)?|id|js|key|log|serv(er?)?|str(eam|ing)|trix|type|v(ert(isements?)?)?|e?xchange)?)?[\W]/


Daniel Einspanjer
Back to top
Guest






PostPosted: Fri Sep 17, 2004    Post subject: Re: Problems with filter string "ad" and gmail Reply with quote

Crap. I see now that kstah had already found and discussed this problem. I should have searched the forum more. It still wouldn't hurt to update that sticky post with either of the changes suggested by us.
Back to top
quazi



Joined: 21 Sep 2004
Posts: 1

PostPosted: Tue Sep 21, 2004    Post subject: Reply with quote

just to give it even more bad press, /\D\d{2,3}x\d{2,3}\D/ is blocking the main logo at Ars Technica:
Quote:
http://arstechnica.com/images/logo251x70.gif


Other than that, asdf has an excellent filter list!
Back to top
View user's profile Send private message
New Adblock Convert
Guest





PostPosted: Thu Sep 23, 2004    Post subject: So, what's the "Ultimate List?" Reply with quote

I've skimmed through this thread, but haven't found it yet. Can someone submit the full-text of what they think the "Ultimate List" based on all the great testing here is? I will never understand regular expressions, I just want to power up my AdBlock....
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Thu Sep 23, 2004    Post subject: Reply with quote

The ultimate list is the one you create yourself, because it will block ads on sites you visit and not be cluttered up with unnecessary filters you'll never need.

Start with simple filters. You can switch to REs later, when you feel comfortable with them.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
guest
Guest





PostPosted: Sat Sep 25, 2004    Post subject: Reply with quote

I suggest starting with this list and adding on it.

[Adblock]
/[\W\d][aA]d(s?|v|net|ward|serv(e(rs?)?)?)[\W\d]/
/[\W\d]atwola[\W\d]/
/[\W\d]ban((ner)?s?)[\W\d]/
/[\W\d](hot|spy)log[\W\d]/
/[\W\d]top(cto)?[\W\d]/
/[\W\d]sp(onsor|ymagic)[\W\d]/
/[\W\d]p(artner|ing)[\W\d]/
/[\W\d]hit[\W\d]/
/[\W\d]doubleclick[\W\d]/
/[\W\d](cnt|cycounter|count(er)?)[\W\d]/
/[\W\d]smartserve[\W\d]/
/[\W\d]paypal[\W\d]/
http://pagead2.googlesyndication.com
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sat Sep 25, 2004    Post subject: Reply with quote

Adblock is case insensitive.

Here are some of my filters.

Code:
[Adblock]
.falkag.net
.imrworldwide.com
.tradedoubler.com
.vibrantmedia.com
/[\W_]ad(bot|cycle|frame|serv(er)?|s|trix|vert(isers|ising|s)?|view|v|x)?[\W\d_]/
/[\W_]banner(id|s)?[\W\d_]/
/\D(728|588|468|234|120)x(600?|120|90)\D/
/\W(blog|central|page|side|value)ads?\W/
/\W(double|fast|value)click\W/
/\Wat(ax|dmt|wola)\W/
http://us.a1.yimg.com/us.yimg.com/a/


I get no false positives on sites I frequently visits.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
guest
Guest





PostPosted: Sat Sep 25, 2004    Post subject: Reply with quote

Let's concentrate on the RegEx part as sites are really user-specific.
I've integrated the changes into my list
except for the /\D(728|588|468|234|120)x(600?|120|90)\D/
line as it would block valid windows as well.
I'd appreciate any RegEx additions.

Here's the updated list:

[Adblock]
/[\W\d_](cnt|cycounter|count(er)?)[_\W\d]/
/[\W\d_](hot|spy)log[_\W\d]/
/[\W\d_]ad(s?|v|x|bot|jumper|cycle(frame)?|net|ward|trix|vert(isers|ising|s)?|view|serv(e(rs?)?)?)[_\W\d]/
/[\W\d_](blog|central|page|side|value)ads?[_\W\d]/
/[\W\d_]at(ax|dmt|wola)[_\W\d]/
/[\W\d_]ban((ner)?(id|s)?)[_\W\d]/
/[\W\d_](double|fast|value)click[_\W\d]/
/[\W\d_]hit[_\W\d]/
/[\W\d_]p(artner|ing)[_\W\d]/
/[\W\d_]paypal[_\W\d]/
/[\W\d_]reklama[_\W\d]/
/[\W\d_]smartserve[_\W\d]/
/[\W\d_]sp(onsor|ymagic)[_\W\d]/
/[\W\d_]top(cto)?[_\W\d]/
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sat Sep 25, 2004    Post subject: Reply with quote

guest wrote:
Let's concentrate on the RegEx part as sites are really user-specific.
I've integrated the changes into my list
except for the /\D(728|588|468|234|120)x(600?|120|90)\D/
line as it would block valid windows as well.


Those are typical sizes for ad banners. I can't even remember the last time I got a false positive from them. Remember that they don't trigger on the actual size of the images, only on parts of the URLs.

guest wrote:
/[\W\d_]ad(s?|v|x|bot|jumper|cycle(frame)?|net|ward|trix|vert(isers|ising|s)?|view|serv(e(rs?)?)?)[_\W\d]/


Move the ? from after the "s" to after the whole parenthesis, otherwise it won't work for URLs like http://ad.whatever.com.

Then I'd remove the leading \d. It has been known to cause problems with some forum URLs as it will trigger on things like "...ssdf987ad8df897d..."

(That only applies if you follow the first advice)
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
guest
Guest





PostPosted: Sun Sep 26, 2004    Post subject: Reply with quote

Updated the list according to kstahl's remarks.
Also changed the top regex as top.html got a false positive.
More suggestions (new regex, changes to mine) ?

[Adblock]
/[\W\d_](cnt|cycounter|count(er)?)[_\W\d]/
/[\W\d_](hot|spy)log[_\W\d]/
/[\W_]ad(s|v|x|bot|jumper|cycle(frame)?|net|ward|trix|vert(isers|ising|s)?|view|serv(e(rs?)?)?)?[_\W\d]/
/[\W\d_](blog|central|page|side|value)ads?[_\W\d]/
/[\W\d_]at(ax|dmt|wola)[_\W\d]/
/[\W\d_]ban((ner)?(id|s)?)[_\W\d]/
/[\W\d_](double|fast|value)click[_\W\d]/
/[\W\d_]hit[_\W\d]/
/[\W\d_]p(artner|ing)[_\W\d]/
/[\W\d_]paypal[_\W\d]/
/[\W\d_]reklama[_\W\d]/
/[\W\d_]smartserve[_\W\d]/
/[\W\d_]sp(onsor|ymagic)[_\W\d]/
/[\W\d_]top(100|cto)[_\W\d]/
/\D(728|588|468|234|120)x(600?|120|90)\D/
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sun Sep 26, 2004    Post subject: Reply with quote

Just some nitpicks. Smile

guest wrote:
/[\W_]ad(s|v|x|bot|jumper|cycle(frame)?|net|ward|trix|vert(isers|ising|s)?|view|serv(e(rs?)?)?)?[_\W\d]/
/[\W\d_](blog|central|page|side|value)ads?[_\W\d]/

You can combine those two into a single filter as it'd supposedly be faster. Personally, I use two filters for the sake of readability. I doubt the speed gain is even noticeable. Still, it'd look something like this:
Code:
/[\W_](blog|...)?ad(s|...)?[_\W\d]/


Also, nested optional parentheses are bad, performance wise. This first filter below will be faster than the second. Again, you have to balance performance and readability.
Code:
/\Wad(servers|server|serve|serv)?\W/
/\Wad(serv(e(rs?)?)?)?\W/


guest wrote:
/[\W\d_]hit[_\W\d]/

Personally, I think this risks false positives, but whatever...

guest wrote:
/[\W\d_]paypal[_\W\d]/

As long as you don't use PayPal... Confused
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
Guest






PostPosted: Tue Sep 28, 2004    Post subject: Reply with quote

Her is my rather big list. I gathered the entries for month. I can surf with it almost adfree on english and german sites. I thought it may be time to boil down this list to the essentials. I am new to regex so there is still some work to do.
Code:
[Adblock]
*&adsize=*
The big regex doesn't catch this.
Code:

*.adbureau.net/*
*.addfreestats.com/*
*.adlegend.com/*
*.adnetz.net/*
*.adsdk.com/*
Maybe i should make a regex that catches these ".ad*."
Code:

*.affiliate.de/*
*.akamai.net/*
*.allsponsor.de/*
*.atdmt.com/*
*.atwola.com/*
*.austrobanner.com/*
*.bfast.com/*
*.bonzi.com/*
*.clicksor.com/*
*.comclick.com/*
*.espotting.com/*
*.falkag.de/*
*.falkag.net/*
*.fastclick.net/*
*.gator.com/*
*.httpads.com/*
*.intellitxt.com/*
*.ivwbox.de/*
*.linkexchange.com/*
*.maxserving.com/*
*.mediaplazza.com/*
*.mediavantage.de/*
*.qksrv.net/*
*.qksz.net/*
*.sexlist.com/*
*.sextracker.com/*
*.shortnews.de/*
*.singleberater.com/*
*.sonnerie.net/*
*.speedera.net/*
*.valueclick.com/*
*.vibrantmedia.com/*
*.yimg.com/a/*
*.zanox-affiliate.de/*
*.zedo.com/*
*/!advert_*
Because of the _ won't be catches by the big regex
Code:

*//*.imdb.com/*.swf*
*//ad1.*
*//adserver2.*
*//adsnew.*
The big regex needs some work to catch those numbers after "ad" "adserver" or whatever. "adsnew" needs to be added.
Code:

*//affiliate.*
*//banner.*
*//banners.*
*//bans.*
Got to make a regex for banners.
Code:

*//bizad.*
*//extads.*
*//hbads.*
Hmm, "ad" at the end could also be catched by a regex.
Code:

*//promo.ebay.com/*
*//rcm-de.amazon.de/*
*//rcm-images.amazon.com/*
*//rcm-uk.amazon.co.uk/*
*//rcm.amazon.com/*
*//werbung.*
*/?DC=*
*/AD_Banner/*
*/AddBanner/*
*/Bannerads/*
*/EIGENWERBUNG/*
*/Werbung/*
*/_banner/*
*/adalive/*
*/adcycle.cgi?*
*/adplace.js*
*/ads4_*
*/adscript*
*/adv1/*
*/adview.php?*
Some more improvements for the big regex required
Code:

*/affiliate/*
*/affiliate_show_banner.php?*
*/affiliatebanners/*
*/affiliates/*
*/banman.asp?*
*/banner.cgi?*
*/banner.gif
*/banner.html
*/banner.jpg
*/banner.js?*
*/banner.php?*
*/banner/*
*/banners/*
*/bannerscript/*
*/bannerwerbung/*
*/banniere/*
I don't like banners Smile
Code:

*/content_ad/*
*/flashbanner/*
*/getad.php3?*
*/google.php?*
*/google/box?*
*/google_ads.cgi?*
*/img_banner/*
*/ivw/*
*/jsbanner.php?*
*/linkbox/*
*/pagead/*
*/phpads/*
*/refreshAllAd.js
*/showban.asp?*
*/sponsor/*
*/sponsoren/*
*/viewbanner.php?*
*/webadImg/*
*_banner_*
*banner.swf
*clickxchange.com/*
*geocities.com/js_source/*
Quote:

/[\W](online)?ad(s|[-_]?(banner|click|flow|frame|ima?g(es?)?|id|js|key|log|serv(er?)?|str(eam|ing)|trix|butler|type|v(ert((isements|s|ising)?)?)?|e?xchange)?)?[\W]/
The BIG regex. I extended ist to "adverts" "advertising" and some other things, always afarid of messing it up. I checked my old list against list, and i was able to delete some entries.
Quote:

/include\.ebaystatic\.com(?![a-z\/]+(\/myebay\/|common_functions\.js|globals\.js|\/contact\/|help_strings\.js))/

The last one is for improved ebay performance, still in work

I am not afraid of false positives but banner blocking is always a bit difficult because some webmasters use very innocent pictures names *banner* or put them into dirs with "banner" in it.
Back to top
Mugros



Joined: 28 Sep 2004
Posts: 38

PostPosted: Tue Sep 28, 2004    Post subject: Reply with quote

I was the one with the last big list. I fired up the Regex Coach and compressed my list. Here is the result: (The regex entries are in the wrong chronological order. The expression i entered last comes first.)

[Adblock]
*.yimg.com/a/*
*//*.imdb.com/*.swf*
*//bans.*
*//promo.ebay.com/*
*/?DC=*
*/showban.asp?*
*geocities.com/js_source/*

-- special stuff that didn't fit into more general expressions.

/[\W]\/google(\/box|\.php)[\W]/

-- some google box stuff, also kind of special. Couldn't resist to make an expression for it

/[\W]\/(linkbox|ivw|sponsor(en)?)\/[\W]/

-- /<whatever>/ stuff

/[\W](akamai|allsponsor|atdmt|atwola|austrobanner|bfast|bonzi|espotting|falkag|gator|intellitxt|ivwbox|linkexchange|maxserving|
mediaplazza|mediavantage|qksrv|qksz|sexlist|sextracker|shortnews|sonnerie|speedera|vibrantmedia|zedo)\.(net|com|de)[\W]/

-- domain blocking

/[\W]((value|com|fast)click|click(sor|xchange))\.(net|com|de)[\W]/

-- "*click" and "click*" domains. Don't know if this expressions is very good, but i don't wanted to block "click" (i don't visit it though).

/[\W]ad(bureau|netz|sdk|legend)\.(net|com|de)[\W]/

-- "ad*" domains

/[\W]rcm(-[a-z]+)?\.amazon[\W]/

-- amazon stuff

/[\W_](EIGEN|banner)?[wW](erbung|ERBUNG)([0-9])?[_\W]/

-- german for "ads"

/[\W_]affiliate(s|banners)?([0-9])?[_\W]/

-- "affiliate" stuff

/[\W_](AD_|flash|js|view)?[bB]anner(s|ads|script)?([0-9])?[_\W]/

-- "banner" stuff

/[\W_](http|get|web|page|php|content_|online|ext|hb|biz)?ad(s|[-_]?(banner|snew|size|cycle|click|flow|frame|[iI]ma?g(es?)?|id|js|
key|log|serv(er?)?|str(eam|ing)|script|trix|place|alive|view|butler|type|v(ert((isements|s|ising)?)?)?|e?xchange)?)?([0-9])?[_\W]/

-- the huge "ad" blocking expression

/include\.ebaystatic\.com(?![a-z\/]+(\/myebay\/|common_functions\.js|globals\.js|\/contact\/|help_strings\.js))/

-- still the experimental ebay script blocker

(PS: i wrapped the long lines)

All lines together:
Quote:
[Adblock]
*.yimg.com/a/*
*//*.imdb.com/*.swf*
*//bans.*
*//promo.ebay.com/*
*/?DC=*
*/showban.asp?*
*geocities.com/js_source/*
/[\W]\/google(\/box|\.php)[\W]/
/[\W]\/(linkbox|ivw|sponsor(en)?)\/[\W]/
/[\W](akamai|allsponsor|atdmt|atwola|austrobanner|bfast|bonzi|espotting|falkag|gator|intellitxt|ivwbox|linkexchange|maxserving|mediaplazza|mediavantage|qksrv|qksz|sexlist|sextracker|shortnews|sonnerie|speedera|vibrantmedia|zedo)\.(net|com|de)[\W]/
/[\W]((value|com|fast)click|click(sor|xchange))\.(net|com|de)[\W]/
/[\W]ad(bureau|netz|sdk|legend)\.(net|com|de)[\W]/
/[\W]rcm(-[a-z]+)?\.amazon[\W]/
/[\W_](EIGEN|banner)?[wW](erbung|ERBUNG)([0-9])?[_\W]/
/[\W_]affiliate(s|banners)?([0-9])?[_\W]/
/[\W_](AD_|flash|js|view)?[bB]anner(s|ads|script)?([0-9])?[_\W]/
/[\W_](http|get|web|page|php|content_|online|ext|hb|biz)?ad(s|[-_]?(banner|snew|size|cycle|click|flow|frame|[iI]ma?g(es?)?|id|js|key|log|serv(er?)?|str(eam|ing)|script|trix|place|alive|view|butler|type|v(ert((isements|s|ising)?)?)?|e?xchange)?)?([0-9])?[_\W]/
/include\.ebaystatic\.com(?![a-z\/]+(\/myebay\/|common_functions\.js|globals\.js|\/contact\/|help_strings\.js))/
Back to top
View user's profile Send private message
Michel
Guest





PostPosted: Sun Oct 03, 2004    Post subject: this is my filter Reply with quote

The idea is that adverts usually are a clickable image. And there will be a combination of an url and a image extension in the source code. Hence:

/img.src="http://[0-9a-zA-Z._-]+/?[0-9a-zA-Z]+[.gif|.jpg|.jpeg|.swf|.png]+/

For a first attempt not really bad, i think. Far from perfect, but it is a start.

Michel
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Tue Oct 05, 2004    Post subject: Reply with quote

Adblock can only block on URLs, not source code. So that img.src serves no purpose. And Adblock is case insensitive, so a-z is the same thing as A-Z.

Also, wouldn't your filter block every image, not just ads?
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
Mr. Dude
Guest





PostPosted: Tue Oct 05, 2004    Post subject: Excellent! Reply with quote

Thank you to all of the people who submitted the filters! I've been using them for while now and they are great! No more experts-exchange.com clutter! I can actually find the questions!
Back to top
Mugros



Joined: 28 Sep 2004
Posts: 38

PostPosted: Wed Oct 06, 2004    Post subject: Reply with quote

kstahl wrote:
And Adblock is case insensitive, so a-z is the same thing as A-Z.

I noticed this while finetuning my filters. So my list is now:

[Adblock]
*.yimg.com/a/*
*//*.imdb.com/*.swf*

*//bans.*
*//promo.ebay.com/*
*/?DC=*
*/showban.asp?*
*geocities.com/js_source/*
/[\W]((value|com|fast)click|click(sor|xchange))\.(net|com|de)[\W]/
/[\W](akamai|allsponsor|atdmt|atwola|austrobanner|bfast|bonzi|espotting|falkag|gator|intellitxt|ivwbox|linkexchange|maxserving|mediaplazza|mediavantage|qksrv|qksz|sexlist|sextracker|shortnews|sonnerie|speedera|vibrantmedia|zedo)\.(net|com|de)[\W]/
/[\W]ad(bureau|netz|sdk|legend)\.(net|com|de)[\W]/
/[\W]rcm(-[a-z]+)?\.amazon[\W]/
/[\W_](AD_|flash|js|view)?banner(s|ads|script)?([0-9])?[_\W]/
/[\W_](eigen|banner)?werbung([0-9])?[_\W]/
/[\W_](http|get|web|page|php|content_|online|ext|hb|biz)?ad(s|[-_]?(banner|snew|size|cycle|click|flow|frame|[iI]ma?g(es?)?|id|js|key|log|serv(er?)?|str(eam|ing)|script|trix|place|alive|view|butler|type|v(ert((isements|s|ising)?)?)?|e?xchange)?)?([0-9])?[_\W]/
/[\W_]affiliate(s|banners)?([0-9])?[_\W]/
/\/(linkbox|ivw|sponsor(en)?)\//
/\/google(\/box|\.php)[\W]/
/include\.ebaystatic\.com(?![a-z\/]+(\/myebay\/|common_functions\.js|globals\.js|\/contact\/|help_strings\.js))/
Back to top
View user's profile Send private message
anti00Zero



Joined: 08 Oct 2004
Posts: 1

PostPosted: Fri Oct 08, 2004    Post subject: Reply with quote

i modifyed the advance filters a bit because of problems and not blocking popup and ads

[Adblock]
*banner*
/(hot|spy)log/
/(new|double|fast|value)click./
/[\/.]*popup*[\/.]/
/[\/.]az[\/.]/
/[\/.]ban(image|source)s?[\/.]/
/[\/._-](|dhtm|i)ad(click|mentor|s|s2|sdk|sv3|server|trix|image|img|log|vt|bureau|counter|v|vert|vertising|vertisement)s?[\/._?-]/
/[\/._]promo(|tion)s?[\/._]/
/[\/](associates|affiliates|us.yimg.com\/a)[\/]/
/[\W\d\/.](2o7|atdmt|atwola|bfast|bluestreak|coremetrics|dgm2|falkag|hitbox|marketbanker|qksrv|ru4|tribalfusion|zedo)[\W\d\/.]/
/[\W\d]click(stream|thrutraffic|thru|xchange)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/[\W\d]value(stream|xchange|click)[\W\d]/
/[\W_](b(an|nr)s?|jump|stat)[\W_]/
/\/buy_assets\//
/\D(588|468|234|120)x(600?|120|90)\D/
/\W(cy|r)?c(ou)?nt(er|ed)?\W/
/p(artner|ing\.cgi|romotion)/
/sp(onsor|ymagic)/
/top(100|cto)/
googlesyndication
us.yimg.com/a/
Back to top
View user's profile Send private message
NJH



Joined: 13 Nov 2003
Posts: 183
Location: Hampshire, England

PostPosted: Fri Oct 08, 2004    Post subject: Reply with quote

Quote:

/[\/._-](|dhtm|i)ad(click|mentor|s|s2|sdk|sv3|server|trix|image|img|log|vt|bureau|counter|v|vert|vertising|vertisement)s?[\/._?-]/


I saw this filter or something similar a few days ago in another post and I would love an explanation of the first bit in brackets - (|dhtm|i) - which appears to me to be mal-formed in that there is nothing before the first |. Does this achieve the same as (dhtm|i)? in that it allows nothing to precede ad?
Back to top
View user's profile Send private message
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Fri Oct 08, 2004    Post subject: Reply with quote

NJH, yes it does.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
Perplexed
Guest





PostPosted: Fri Oct 08, 2004    Post subject: Reply with quote

I'm having a problem with one of the filters from this thread and the Advanced Filters thread that I hope you folks can help me out with. The filter below is blocking the left frame of a phpMyAdmin installation on one of my sites, which makes it difficult to manage the SQL database there without turning Adblock off. I'm hoping one of you can recommend an adjustment that might help prevent Adblock's hiding of that frame.

Quote:
/[\W\d](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?
|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W\d]/

(Note: The filter is usually on one line, but I broke up into two for easier display in this post due to the length.)

These are the URLs phpMyAdmin uses that are being blocked. I've removed the domain name from the start and an MD5 hash from the end for security reasons, but I may be able to post them later if needed.
Quote:
/stuff/myadmin/left.php?lang=en-iso-8859-1&server=1&hash=
/stuff/myadmin/queryframe.php?lang=en-iso-8859-1&server=1&hash=


I hope you can help me out. Smile
Back to top
Org



Joined: 23 Oct 2003
Posts: 349

PostPosted: Fri Oct 08, 2004    Post subject: Reply with quote

My guess: the hash you removed has hexadecimal numbers. Given enough random hex numbers, one will sooner or later contain part:

number a d number

since a and d are valid hex chars. That will match your filter:

\d ad \d
Back to top
View user's profile Send private message
Mugros



Joined: 28 Sep 2004
Posts: 38

PostPosted: Sat Oct 09, 2004    Post subject: Reply with quote

Org wrote:
My guess: the hash you removed has hexadecimal numbers. Given enough random hex numbers, one will sooner or later contain part:

number a d number

since a and d are valid hex chars. That will match your filter:

\d ad \d

I confirm that:
The expression matches cd868ad79fe. To be precise, it matches 8ad7 in this case.
With this expression several sites will not display correctly.
I would remove both \d.
Look at my previous posts. There is a list of some entries that i combined to a regex. The regex is searching for a whitespace then "ad-whatever" then maybe a number and then again a whitespace. This would only make a false positive on a very small hexadecimal number that is unlikely to be found as a sessionid or something like that. e.g. it matches "=ad7&" but not "=2ad5&" or "=ad8322525&"
Back to top
View user's profile Send private message
drkasbd



Joined: 09 Oct 2004
Posts: 3

PostPosted: Sat Oct 09, 2004    Post subject: Org's list question Reply with quote

I'm using Org's list:

Quote:
[Adblock]
.telkku.com/*.swf
.yimg.com/a/
/(ad-flow|adsdk|advertising|bizrate|prohosting|resellerratings|tradedoubler|\.atwola|\.atdmt|valueclick|\.adtrix)\.com/
/(doubleclick|fastclick|spinbox|falkag)\.net/
/(yahoo|geocities)\.com\/(toto\?s|js_source\/)/
/[.\/]adcontent[.\/]/
/[.\/_](page|online)?ad(v|_banner|[zs\?]|click|image.php|image[zs]?|serv(er?)?)?[_\/\?]/
!/[\/]banner([sz]?\/|it\/|\.cgi|\.pl|farm\/|link\.|_pysty\.gif|source\.|.?\d*\.)/
/\/\?DC=/
/\/ad(js.php|size=|frame)/
/\/bnr\//
/_ad[sz]?\.(php|js|gif|jpg|swf)/
/aslframe.html
/bd.m?
/http:\/\/(mainos|rcm-images)\./
!/http:\/\/banner[sz]?\./
/http:\/\/r?ad([sz]?|images?|img|tech|serv(er?)?)\./
/imdb\.com\/(google\/|.*\.swf)/
/popup(\d*)?\.js/
http://www.theregister.co.uk/media/


This list appears to work great! But this is blocking some of the weatherunderground banners on my home page, which have a graphic like this:

<img src="http://banners.wunderground.com/banner/infobox_both/language/www/US/FL/Lake_City.gif" alt="Click for Lake City, Florida Forecast" height="108" width="144">

To fix this I have to disable the two lines (above) that are prepended with exclamation points. But this definitely weakens the ad defense. Is there a simle way to add certain banners to a "whitelist"? Or is there another simple way to allow this type of banner?

Thanks, Kris
Back to top
View user's profile Send private message
idbehold



Joined: 03 Oct 2004
Posts: 174

PostPosted: Sat Oct 09, 2004    Post subject: Reply with quote

Im using asdf's filter just removed alot of the doubles, and put alot of the stuff on the one line:

Quote:
[Adblock]
/(rd\.yahoo|yahoo|yimg|geocities)\.com\/(toto\?s|js_source\/|M=|a)/
//\?DC=/
/[\./_](page|online)?ad(v|[zs\?]|click|image.php|image[zs]?|serv(er?)?)[_/\?]/
/[\W\d](ad|banner)?(pop|serv)(layer|under|out|up|edby|ing)?[\W\d]/
/[\W\d](clk_|click)(ability|thru|thrutraffic|tag|zs|-fr|stream|xchange)[\W\d]/
/[\W\d](counter|extreme-dm|falkag|hbads|hitbox|reklama|webtrandslive|buy_assets|vibrantmedia|interpolls|liveperson|nedstatbasic|tradedoubler|sageanalyst|sitemeter|superstats|tradepub|blogads|bizrate|prohosting|dtmpub|gator|qksrv|\.atwola|\.?atdmt|valueclick|\.adtrix|bfast|zedo|\.dsdk|2o7|jnova\.cjt1)\.(com|net)/
/[\W\d](double|value|human|specific|sponsor|dime|fast|com|ad)(click|xchange)[\W\d]/
/[\W\d](hot|spy|ad)log[\W\d]/
/[\W\d](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange|farm)?)[\W\d]/
/[\W\d](t|i)(raffic|a|mdb|ntellitxt)\.(ro|imdb\.com|com\/google)[\W\d]/
/[\W\d](value|side|bottom|right|page|text|left|top|set)(ad|stream|xchange|click)[\W\d]/
/[\W\d]ad(vertising|sdk|-flow|dup|dfreestats|centriconline|bureau|Wrapper|system|content|mentor|cookie|count|space|client|image|word|cycle|frame|view|track|tech|trix|here|farm|img|sys|vt|vision|vernet)\.?(com|net)?[\W\d]/
/[\W\d]s?p(onsor|ymagic|inbox|romotion|artner(ing)?)[\W\d]/
/[\W\d]top(100|cto)[\W\d]/
/[\W\d]track(er|ing)?[\W\d]/
/[\W_](b(an(ner|man)|nr)s?|jump|redir(ect|s)?|stat)[\W_]/
/\D\d{2,3}(x\d{2,3}|by\d{2,3}|_mast)\D/
/\W(affiliates|annons(er)?|associates|marketing|promos)\W/
/\W(ned|one)stat(basic)?\W/
/\Wgoogle(adservices|syndication)\W/
/_ad[sz]?\.(php|js|gif|jpg|swf)/
/rand_(image|image2)/
/rcm(-images)?\.amazon\.com/
Back to top
View user's profile Send private message Visit poster's website
Tommy
Guest





PostPosted: Sat Oct 09, 2004    Post subject: Reply with quote

how to block ad, ads except upload, download? i can't build exceptions filter Sad.
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sat Oct 09, 2004    Post subject: Reply with quote

Create more specific filters.

Code:
/ad.
/ads.
*/ad/*
*/ads/*


etc.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sat Oct 09, 2004    Post subject: Re: Org's list question Reply with quote

drkasbd wrote:
Is there a simle way to add certain banners to a "whitelist"? Or is there another simple way to allow this type of banner?


drkasbd,

Whitelists are not yet implemented, but they are coming.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
Guest






PostPosted: Sat Oct 09, 2004    Post subject: Reply with quote

kstahl wrote:
Create more specific filters.

Code:
/ad.
/ads.
*/ad/*
*/ads/*


etc.

i know but it doesn't block this http://ads.koszulki.com/?id=545 (go to xp.net.pl). when i put /ad(?![a-d])/ everythiing is ok but also it blocks upload and download.
Back to top
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Sat Oct 09, 2004    Post subject: Reply with quote

The big Flash ad can be blocked by filtering out the script launching it:
Code:
/adlayer.

_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
Mugros



Joined: 28 Sep 2004
Posts: 38

PostPosted: Sat Oct 09, 2004    Post subject: Reply with quote

Anonymous wrote:
kstahl wrote:
Create more specific filters.

Code:
/ad.
/ads.
*/ad/*
*/ads/*


etc.

i know but it doesn't block this http://ads.koszulki.com/?id=545 (go to xp.net.pl). when i put /ad(?![a-d])/ everythiing is ok but also it blocks upload and download.

Why dont you use the posted "ad"-filters?
her is mine:
/[\W_](http|get|web|page|php|content_|online|ext|hb|biz)?ad(s|[-_]?(banner|snew|size|cycle|click|flow|frame|[iI]ma?g(es?)?|id|js|key|log|serv(er?)?|str(eam|ing)|script|trix|place|alive|view|butler|type|v(ert((isements|s|ising)?)?)?|e?xchange)?)?([0-9])?[_\W]/
Use Regex Coach (http://www.weitz.de/regex-coach/) to practise regular expressions.
Note, that adblock is case insensitive and you have to remove the slashes around the expression in Regex Coach.
Back to top
View user's profile Send private message
drkasbd



Joined: 09 Oct 2004
Posts: 3

PostPosted: Sat Oct 09, 2004    Post subject: Reply with quote

For what it's worth, when I discovered this forum, I tried the two "advanced" regex lists that are in Rue's stickied post. The first, entitled "Murder, she wrote" worked really good. BUT... I quickly found that it was blocking several important things in a secure web site at Penn State that I use to post homeworks and grades and stuff. I didn't have time to figure out what regex line specifically was the problem, but it didn't seem to be quite complete either.

Then I tried the second one listed in the sticky (by Guest), and this was allowing too much. Then I searched this thread, and found that Org's list seemed to do the best (except that it excluded a few banners that I actually wanted to see). But I found that disabling the two lines in Org's list that blocked "banners" still did better than most.

I briefly tried asdf's list, and found that it performed pretty similar to Org's list (both are listed in this thread), but I didn't try to edit either to allow specific banners. Anyway, it seems both are pretty good lists. I look forward to having whitelist support, though! But that's my pitiful 2 cents...
Back to top
View user's profile Send private message
G



Joined: 10 Oct 2004
Posts: 550

PostPosted: Sun Oct 10, 2004    Post subject: Filterset.G Reply with quote

Just to throw my hat in the ring, check out the latest revision of Filterset.G
_________________
Filterset.G
Back to top
View user's profile Send private message Visit poster's website
drkasbd



Joined: 09 Oct 2004
Posts: 3

PostPosted: Sun Oct 10, 2004    Post subject: Re: Filterset.G Reply with quote

G wrote:
Just to throw my hat in the ring, check out the latest revision of Filterset.G


I'm not sure which set is "G", but your most recent list seems to work like a charm for me; and I didn't have to disable anything. Very Happy
Back to top
View user's profile Send private message
G



Joined: 10 Oct 2004
Posts: 550

PostPosted: Sun Oct 10, 2004    Post subject: Reply with quote

"Filterset.G" is just the general name of the filter.

I'm very careful to avoid any false positives (which is why it's so long), so there shouldn't be a need to take any lines out. If you find any problems (false positives, unblocked ads), let me know.
_________________
Filterset.G
Back to top
View user's profile Send private message Visit poster's website
Ieremiou
Guest





PostPosted: Mon Oct 11, 2004    Post subject: Reply with quote

Nice Filter Set G
I'll keep a look out with this one Smile
Back to top
Perplexed
Guest





PostPosted: Mon Oct 11, 2004    Post subject: Reply with quote

Org wrote:
My guess: the hash you removed has hexadecimal numbers. Given enough random hex numbers, one will sooner or later contain part:

number a d number

since a and d are valid hex chars. That will match your filter:

\d ad \d


You were right. The hash contains "2ad4" right in the middle. Thanks for the help! Smile

I'm going to try using Filterset.G for a while. It doesn't have this little problem I had before, so I'll see how it does.
Back to top
G



Joined: 10 Oct 2004
Posts: 550

PostPosted: Mon Oct 11, 2004    Post subject: Hex Reply with quote

If you find that a hex string is being blocked, let me know and I'll see what I can do to fix it - I'd rather see a dozen ads than have one false positive.
_________________
Filterset.G
Back to top
View user's profile Send private message Visit poster's website
Guest
Guest





PostPosted: Mon Oct 11, 2004    Post subject: what is srong with filter Reply with quote

I cannot see the map on this officemax link.
http://www.officemax.com/max/solutions/Shanghai?loadPage=http%3A%2F%2Fwww.officemax.com%2Fcgi-bin%2Fmqinterconnect%3Flink%3Dmap2%26show%3Dtnail%26id%3DStore%2520%2523394%26streetaddress%3D3839%2520Emery%2520Street%26recordid%3DStore%2520%2523394%26name%3DEmeryville%26store%3DEast%2520Bay%2520Bridge%2520Center%26address%3D3839%2520Emery%2520Street%26city%3DEmeryville%26state%3DCA%26zip%3D94608%26phone%3D510-652-1140%26hours%3DM-F%25208-9%2C%2520Sa%25209-8%2C%2520Su%252010-7%252a%26distance%3D2.42%26level%3D9&docPart=shanghai&containerQuery=bvc%3Dyes&containerPage=http%3A%2F%2Fwww.officemax.com%2Fmax%2Fsolutions%2Fcustserv%2FStoreLocator.jsp

this is the filter that is blocking it. (it is a slight modification of on in the 'Advanced Filters' sticky notes.)
/[\W\d](onlineads?|indiads|ad(banners?|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|vertisings?|vert|vt|v|xchange)?)[\W\d]/

can someone help me locating problem in this filter and show how to fix it?

Thanks in advance.
Back to top
G



Joined: 10 Oct 2004
Posts: 550

PostPosted: Tue Oct 12, 2004    Post subject: Reply with quote

It's a problem with Mapquest maps - they often contain "ad" in the URL, as that one does.
_________________
Filterset.G
Back to top
View user's profile Send private message Visit poster's website
G



Joined: 10 Oct 2004
Posts: 550

PostPosted: Tue Oct 12, 2004    Post subject: Fix Reply with quote

To fix it, just put [^%] at the very beginning of your filter.
_________________
Filterset.G
Back to top
View user's profile Send private message Visit poster's website
Guest
Guest





PostPosted: Tue Oct 12, 2004    Post subject: thanks Reply with quote

thanks G.
that helps.

just curious which 'ad'? as in 'streetaddress' or 'address' ?

what exactly is [^%] doing? removing % from string to match?
'The Regex Coach' did not find anything in original string and filter combination!!
Back to top
G



Joined: 10 Oct 2004
Posts: 550

PostPosted: Tue Oct 12, 2004    Post subject: Reply with quote

Deep in the URL for the map image is the string "%3ad67%". Your filter is matching it as follows: non-letter character [\W] followed by digit [\d] followed by "ad" followed by digits [\d] followed by non-letter character [\W].

By adding [^%] , you tell the filter not to block a string if starts with % (overriding the [\W] for that character).

I don't know what Regex Coach is, so I can't help you in that department.
_________________
Filterset.G
Back to top
View user's profile Send private message Visit poster's website
kstahl
Support


Joined: 02 Jan 2004
Posts: 1202
Location: Stockholm, Sweden

PostPosted: Tue Oct 12, 2004    Post subject: Reply with quote

I think you are mistaken about what his filter says.

[\W\d] means "Either a non-word character OR a digit".

So the part that matches is "...%3ad67%..."

The best solution is to change [\W\d] at the beginning of the filter to just \W.
_________________
Adblock 0.5.3.042
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.Cool Gecko/20051111 Firefox/1.5
Back to top
View user's profile Send private message
G



Joined: 10 Oct 2004
Posts: 550

PostPosted: Tue Oct 12, 2004    Post subject: Reply with quote

Sorry, you're right - I was thinking of my own filter. Regardless, including a digit possibility is very important to effective filtering - a decent RegExp of this sort should start with [\W]\d*
_________________
Filterset.G
Back to top
View user's profile Send private message Visit poster's website
Mugros



Joined: 28 Sep 2004
Posts: 38

PostPosted: Tue Oct 12, 2004    Post subject: Re: what is srong with filter Reply with quote

Guest wrote:
I cannot see the map on this officemax link.

http://www.officemax.com/max/solutions/Shanghai?loadPage=http%3A%2F%2Fwww.officemax.com%2Fcgi-bin%2Fmqinterconnect%3Flink%3Dmap2%26show%3Dtnail%26id%3DStore%2520%2523394%26streetaddress%3D3839%2520Emery%2520Street%26recordid%3DStore%2520%2523394%26name%3DEmeryville%26store%3DEast%2520Bay%2520Bridge%2520Center%26address%3D3839%2520Emery%2520Street%26city%3DEmeryville%26state%3DCA%26zip%3D94608%26phone%3D510-652-1140%26hours%3DM-F%25208-9%2C%2520Sa%25209-8%2C%2520Su%252010-7%252a%26distance%3D2.42%26level%3D9&docPart=shanghai&containerQuery=bvc%3Dyes&containerPage=http%3A%2F%2Fwww.officemax.com%2Fmax%2Fsolutions%2Fcustserv%2FStoreLocator.jsp


this is the filter that is blocking it. (it is a slight modification of on in the 'Advanced Filters' sticky notes.)
/[\W\d](onlineads?|indiads|ad(banners?|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|vertisings?|vert|vt|v|xchange)?)[\W\d]/

can someone help me locating problem in this filter and show how to fix it?

I don't see why the given regex should match the posted URL. The regex matches strings like 4ad6 but there are none in the URL. Regex coach also doesn't show a match.
Quote:
The best solution is to change [\W\d] at the beginning of the filter to just \W

I also recommend this. My "ad"-regex is [\W_]<<ad>>([0-9])?[_\W]. This is based on the collected real-life entries i posted. I never encountered ads that had a digit before the "ad". Allowing digits, especially more than one digit, on both sides of the core string will potentially give false positives.
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic   This topic is locked: you cannot edit posts or make replies.    The Adblock Project Forum Index -> Main All times are GMT + 1 Hour
Goto page Previous  1, 2, 3, 4, 5  Next
Page 3 of 5

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group