Jump to content

Talk:Spam blacklist

From Meta, a Wikimedia project coordination wiki
This is an archived version of this page, as edited by Billinghurst (talk | contribs) at 00:17, 29 January 2024 (lyricstubes.com: Added using SBHandler). It may differ significantly from the current version.

Latest comment: 11 months ago by Billinghurst in topic Proposed additions
Shortcut:
WM:SPAM
WM:SBL
The associated page is used by the MediaWiki Spam Blacklist extension, and lists regular expressions which cannot be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any Meta administrator can edit the spam blacklist; either manually or with SBHandler. For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.

Proposed additions
Please provide evidence of spamming on several wikis. Spam that only affects a single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format. Please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate - that is very often the case.
Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.
#wikimedia-external-linksconnect - Real-time IRC chat for co-ordination of activities related to maintenance of the blacklist.
Whitelists
There is no global whitelist, so if you are seeking a whitelisting of a url at a wiki then please address such matters via use of the respective Mediawiki talk:Spam-whitelist page at that wiki, and you should consider the use of the template {{edit protected}} or its local equivalent to get attention to your edit.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.


Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived quickly. Additions and removals are logged · current log 2025/01.

SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days and sections whose most recent comment is older than 7 days.

Proposed additions

This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.

lava678.asia



Online casino spam, see spamcheck. --Count Count (talk) 15:09, 21 January 2024 (UTC)Reply

@Count Count: Added Added as stem to Spam blacklist. -- — billinghurst sDrewth 22:26, 21 January 2024 (UTC)Reply

korte-immobilien.de



Crosswiki real estate spam. --Count Count (talk) 11:49, 22 January 2024 (UTC)Reply

@Count Count: Added Added to Spam blacklist. -- — billinghurst sDrewth 21:51, 22 January 2024 (UTC)Reply

nuoixe.vn



Xwiki spam - XXBlackburnXx (talk) 14:21, 22 January 2024 (UTC)Reply

@XXBlackburnXx: Added Added to Spam blacklist. -- — billinghurst sDrewth 21:51, 22 January 2024 (UTC)Reply

clickbuy.com.vn



Xwiki spam [1] - 94rain Talk 17:38, 22 January 2024 (UTC)Reply

@94rain: Added Added to Spam blacklist. -- — billinghurst sDrewth 21:50, 22 January 2024 (UTC)Reply

gtnfoods.com.vn



Xwiki spam [2] - Ninhvuz (talk) 04:37, 23 January 2024 (UTC)Reply

@Ninhvuz: Added Added to Spam blacklist. -- — billinghurst sDrewth 20:50, 23 January 2024 (UTC)Reply

fb88.ngo



Online casino spam, see [3]. --Count Count (talk) 19:40, 24 January 2024 (UTC)Reply

@Count Count: Added Added to Spam blacklist. -- — billinghurst sDrewth 20:43, 24 January 2024 (UTC)Reply

kamagra-premium.net



persistent spam - XXBlackburnXx (talk) 06:13, 25 January 2024 (UTC)Reply

@XXBlackburnXx: Added Added to Spam blacklist. -- — billinghurst sDrewth 13:28, 25 January 2024 (UTC)Reply

lyricstubes.com



Persistent xwiki spam, mainly on the English speaking projects. XXBlackburnXx (talk) 19:38, 28 January 2024 (UTC)Reply

@XXBlackburnXx: Added Added to Spam blacklist. -- — billinghurst sDrewth 00:17, 29 January 2024 (UTC)Reply

Proposed additions (Bot reported)

This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
COIBot's currently open XWiki reports
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
minecraftwiki.net 2025-01-14 17:27:21 COIBot 151.101.128.194 R Carbonaro.
Josu PV
VitAlv13
Сале
1970-01-01 05:00:00 669 0

Proposed removals

This section is for proposing that a website be unlisted; please add new entries at the bottom of the section. Use a suitable 3rd level heading and display the domain name as per this example {{LinkSummary|targetdomain.com}}. Please do not add the protocol part of domain name, eg. http

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also recurring requests for repeatedly proposed (and refused) removals.

Notes:

  • The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.
  • This page is for the removal of domains from the global blacklist, not for removal of domains from the ===blacklists of individual wikis. For those requests please take your discussion to the pertinent wiki, where such requests would be made at Mediawiki talk:Spam-blacklist at that wiki. Search spamlists — remember to enter any relevant language code

actor.com

The listed \bactor(?:suriya|arya)?\.com\b includes all domain names ending with actor.com. In particular www.timowagner-actor.com is blocked, for which there does not seem to be a valid reason. I think removing the second question mark solves this issue. --Lymantria (talk) 15:42, 21 January 2024 (UTC)Reply

Indeed, it seems I made a mistake 2009-11-29. I don't see a reason why I made this non-capturing regexp group optional. I'll fix that. Sorry. -- seth (talk) 22:13, 24 January 2024 (UTC)Reply
@Lymantria: Done -- seth (talk) 22:23, 24 January 2024 (UTC)Reply

passionate-about.com

This is a photographer's project with several interesting portraits of scientists, which would to my mind be useful references on wikipedia, e.g for physicists like John Ellis, Fabiola Gianotti and Jack Steinberger. --SKraml 21 January 2024

@SKraml: I am guessing that this will be collateral damage for a block on about.com. We will need to amend the regex and run some checks, and I cannot do that immediately, though will look to get to this during the week. You could ask for a local whitelist at the wiki where you are editing to get around this earlier.  — billinghurst sDrewth 22:24, 21 January 2024 (UTC)Reply
@Billinghurst: Hmm, maybe we're going in circles. I responded to a request at the enwiki-whitelist that it may be better to fix it here. Does meta even have a whitelist? If not, replacing the existing entry \babout\.com\b with a regex like \b[^-]about\.com\b should fix it. We've done similar things on the enwiki blacklist. If you prefer that it be whitelisted locally, let me know. Anachronist (talk) 21:48, 24 January 2024 (UTC)Reply
I'll fix it. -- seth (talk) 22:20, 24 January 2024 (UTC)Reply
I prefer (?<=//|\.)about\.com\b, because that's actually what we want.
\b[^-]about\.com\b would not match about.com any longer, but domains such as aabout.com or zabout.com. We could use (?<!-)\babout\.com\b, but that would be very indirect/implicite.
Done -- seth (talk) 22:27, 24 January 2024 (UTC)Reply

sunwing.ca

Currently

\bsunwin\w{0,2}\.

is blocked, because of [4] (@Billinghurst). I'd like to change this to

\bsunwin(?:tx|z|)\.

or at least

\bsunwin(?!g\.)\w{0,2}\.

because sunwing.ca (which seems to be the official website of an airline) is blocked as a false positive. Any objections or preferences? -- seth (talk) 21:41, 22 January 2024 (UTC)Reply

Definitely a false positive. I will need to go back and look at the history of this beast. I do know that it was one of our gambling sites. @Lustiger seth:  — billinghurst sDrewth 22:00, 22 January 2024 (UTC)Reply
I don't understand that change. I thought you wanted to block sunwintx.com, sunwinz.bio, sunwin.town, sunwintx.com, and sunwinz.org (and alike). But they are not blocked any longer now. -- seth (talk) 23:44, 22 January 2024 (UTC)Reply
@Lustiger seth: You have admin rights, you don't need me to fix it, this isn't my page. I backed it off as you wanted it backed off and missing spam is less important than missing good links. Life is so busy and said I would have to go back and look at the history, and I won't have that opportunity anytime soon, so I quickly backed it off.  — billinghurst sDrewth 20:36, 23 January 2024 (UTC)Reply
Ah, ok, i see. I thought might have done that intentionally. Then I'll do something which I think matches more the mentioned domains. -- seth (talk) 23:45, 23 January 2024 (UTC)Reply
Done -- seth (talk) 23:50, 23 January 2024 (UTC)Reply
@Lustiger seth: As a general comment. If we are getting false positives for non-targeted sites when implementing broader "spam" regexes, I think that we are better to move into fix and note space, rather than the need to overly discuss. The aim is to stop the gambling spam, not prevent legitimate sites. I run the regexes through the global search to check, and perfection is not guaranteed as the tool does have its limitations. :-(  — billinghurst sDrewth 20:42, 24 January 2024 (UTC)Reply
  • If there are several ways to solve a problem and it's not clear which is the best (or at least a good) solution, then i'll always ask. It's worth striving for good solutions.
  • Which global search do you mean?
-- seth (talk) 22:06, 24 January 2024 (UTC)Reply
Agreed, though we can back off overly problematic regex to something safe, and then have the discussion about the best means forward. Typically, these are persistent low frequency attacks, so I was more trying to say let us keep the regex safer first, and improve.  — billinghurst sDrewth 00:15, 29 January 2024 (UTC)Reply

bindy.com



This is a notable product and the namesake domain of a Wikipedia article, bindy. — The preceding unsigned comment was added by 216.154.2.168 (talk) 04:25, 27 January 2024 (UTC)Reply

 Declined per Spam blacklist/About#Requests for delisting we „de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects“. bindy.com was frequent subject so xwiki spam (e.g. [5]), and there is no article about the website, just a lot of spam at en:Bindy [6] --Johannnes89 (talk) 08:29, 27 January 2024 (UTC)Reply

Troubleshooting and problems

This section is for comments related to problems with the blacklist (such as incorrect syntax or entries not being blocked), or problems saving a page because of a blacklisted link. This is not the section to request that an entry be unlisted (see Proposed removals above).

Discussion

This section is for discussion of Spam blacklist issues among other users.