Charcoal’s goal is to detect and delete every spam post on the Stack Exchange network that violates the self-promotion guidelines. Our bot, SmokeDetector, uses a variety of methods to detect this spam, including watching for any mentions of particular keywords/websites in our series of curated watchlists/blacklists. These lists encompass a large number of keywords and domain names, and it’s not always clear why a particular item was added to the list. This information should help clarify that; if you need more detail, feel free to ping a member of the Charcoal team or email us.
For a summary (TL;DR) of this page, scroll down.
From our past experience, websites that have appeared on Stack Exchange in confirmed spam posts are likely to appear again in future spam posts, so we “watch” them upon spotting any potentially suspicious appearance. This brings any future posts containing the link to our attention, so we can further review the posts to determine if they’re actually unsuitable on Stack Exchange.
That says, being on the watchlist doesn’t imply that the website is definitely spam, but rather one of our members thinks it might be correlated to spam, and is worth future attention.
In the future, if the website turns out to be legitimate (i.e. not spam) then we remove it from the watchlist. However, if this is not the case, and the website continues to appear in future spam posts, then we may decide to ‘upgrade’ the site from the watchlist to the blacklist. The blacklist is subject to several stronger criteria than the watchlist, as documented here - at present, we require that a website appear in a sufficient number of spam posts with no legitimate standing.
If a website appears on our blacklist, it means that the website is a strong indicator of spam on the Stack Exchange network. As such, blacklisted websites carry a greater weight in our system.
Similar to the watchlists, websites may be removed from the blacklist; however, this is a rarer occurence. Any external parties requesting for a domain to be removed from the blacklist must follow the defined process.
Not necessarily. Our watchlists/blacklists are designed to help identify content on the Stack Exchange network that would be classified by their rules as spam: “exists only to promote a product or service, and does not disclose the author’s affiliation.”.
There are several websites on our lists that are generally considered legitimate (e.g. fiverr), but are watched because they are commonly used in self-promotion on Stack Exchange. Similarly, there are many scammy or otherwise unsavory websites that do not appear on our lists, because they have not been seen in Stack Exchange spam before. Because of this, we do not recommend using our lists to judge the legitimacy of a website, rather we suggest that you perform further research of your own.
Glad you asked. Here is the search page for our backend database, metasmoke. Using this tool, you can access data for every post that we have ever classified, including archived post text, feedback on whether or not it was spam, why it was caught, and any general comments. The domain tracking tool may also be of use to track down related posts. You can use these tools to investigate the context in which any websites/keywords have been used on Stack Exchange in spam.
We also suggest exploring other avenues of research:
When in doubt, it may be worth getting in contact with the website/company directly - if the service is legitimate, then they should be more than happy to answer your concerns. It is always possible that their spam posts on Stack Exchange were simply a misunderstanding of the self-promotion rules. Whatever the case, it is up to you to make an informed decision about the service based on all the information avaliable.
As a summary:
Hopefully this page has answered all of your concerns. If you have any further questions, feel free to contact [email protected]
and we will be happy to help.
This page does not constitute legal advice. Charcoal does not condone or take any responsibility for any contact, interactions or business deals between any third parties (including Stack Exchange), as a result of, or in relation to, the information presented on this page.