AI uBlock Blacklist

(github.com)

132 points | by rdmuser 9 hours ago

16 comments

quiet35 1 hour ago
I like the idea and even considered contributing to the list, but this stopped me:
> NAQ (Never Asked Questions)
> My website is on your list!
> Cry about it.
That's quite a suspicious attitude. Clearly the maintainer believes he is infallible. I understand the emotions behind this, but this is not how a public blacklist should be maintained.
[-]
- TonyTrapp 1 hour ago
  Yuuup. My personal website has been inaccessible to a few friends, they thought my server was down. It turned out they had some blocklist (not related to AI) installed on their PiHole, and for whatever reason my website was on that list. It is, in fact, to this day, because my request to unblock it went completely unanswered. I still don't know why the website is on the list.
  [-]
  - jorvi 15 minutes ago
    Go to the Adguard GitHub (or use the extension) and report it. And get all your friends to switch to Adguard extension and Adguard Home (Pi Hole alternative) as blockers.
    Easylist and its sublist are notorious for being extremely poorly maintained and ignoring issues opened against it. Adguard is much more active in maintaining its lists. Especially Adguard its language blocklists have much, much less breakage and missed ads than Easylist.
  - VladVladikoff 51 minutes ago
    Perhaps it got hacked and was hosting malware without you being aware? They are pretty good at hiding it from the site owner (showing the original website to you, but not to others).
    [-]
    - TonyTrapp 47 minutes ago
      The server is and has been clean the whole time. I don't even run WordPress or anything similar on that server that would be a common hacking target. If it was hacked, I'm pretty sure Google Safe Browsing would be the first to flag the site, not some random PiHole list.
- NeutralCrane 12 minutes ago
  Also seems a bit hypocritical given the screed about how such a list is necessary because the AI content might output hallucinations or damaging content without review.
  But if it’s the author’s blocklist that is wrong, unverified, and causing harm to others? Cry about it.
dhayabaran 1 hour ago
The false positive problem gets worse over time too. Domains get sold, sites pivot, old content gets removed. A blocklist with no removal process and a "cry about it" attitude in the FAQ is basically a one-way reputational blackhole. At minimum it needs an expiry or re-review mechanism. Even browser safe browsing lists re-check URLs periodically.
throwatdem12311 1 hour ago
Ublock Origin also already has an “AI widget” blocklist you can enable. Literally the only extension that keeps me on Firefox because of how useless it is on Chromium.
rdmuser 9 hours ago
A new more grounded list focused on specifically blocking content farms and similar low quality sites.
A nice alternative to this very broad anti ai list: https://github.com/laylavish/uBlockOrigin-HUGE-AI-Blocklist
Edit: Oh I should mention I found it through reddit and there is some good discussion there where they describe how they find stuff etc: https://www.reddit.com/r/uBlockOrigin/comments/1r9uo3j/autom...
[-]
- Dwedit 3 hours ago
  The broad list seems to just be a hater list. It's not trying to cover cases of deception (passing off AI material as if it's something else), as it includes sites which are very open about what kind of content is on there.
  [-]
  - malfist 6 minutes ago
    Would you say the same about a block list that blocks anything else? I don't care how obvious an ad is, I don't want to see it. Same with social widgets or cookie consent banners, or newsletter sign-ups.
    But I wouldn't call the person that maintains the news letter popup block list as "newsletter hater"
- smusamashah 1 hour ago
  So there is a spreadsheet of websites. That is very interesting. There was an article here sometime ago about a media group who have so many super SEOd websites. They all have common footer text. I searched and added as many as I could find in uBlacklist. I have a gist listing them and how I searched for them. You might find that useful.
  Edit: https://gist.github.com/SMUsamaShah/6573b27441d99a0a0c792431...
- xnx 4 hours ago
  Hasn't been updated in 5 months
  [-]
  - rdmuser 4 hours ago
    Oh good point I also overlooked that with the anti ai list.
    The big anti ai list also seems to be focused on hiding links from ddg/bing/google where this new more focused list just blocks sites. I tend to like block ones vs hiding because they pop up a nice warning no matter where I came from and I can still decide to ignore it if I want so they is more user agency instead of just quietly hiding a unclear chunk of the net from search engines.
ramon156 1 hour ago
I would rather have a whitelist that adds a nice tag at the end of the link, indicating that overall it has high quality content. This also forces you to periodically check the sites you've whitelisted
amelius 2 hours ago
At least we're not yet in the phase where we have a whitelist for the internet.
[-]
- papichulo2023 2 hours ago
  We were close but the app dominance declined.
ossa-ma 51 minutes ago
Glad we're moving in this direction, I've also got a tool that I use to determine if writing is AI using common tropes and reconstruct the OG prompt from it: https://tropes.fyi/aidr
[-]
- mh- 48 minutes ago
  Haha, that's a neat idea. Thanks for sharing.
  https://tropes.fyi/aidr/b184cf3a
  https://tropes.fyi/aidr/9b132f92
dimava 1 hour ago
Also check the https://botblock.ai/ , AI extension to detect AI replies on twitter
[-]
- add-sub-mul-div 47 minutes ago
  That's a curious one, Twitter is worthless anyway. Before AI bots proliferated, the change to rank paid accounts high in replies turned it into a de facto entry level $8/month advertising tier.
lifthrasiir 9 hours ago
Not necessarily disagreeing the whole principle...
> All I hear is skill issue. Imagine needing an AI to write stuff.
Grammarly users (and underrepresented non-English speakers) would complain.
[-]
- QuadmasterXLII 4 hours ago
  There’s not a single group who’s ever been told skill issue that didn’t complain
  [-]
  - tclancy 1 hour ago
    Sure, but there also plenty of times “get gud!” is used for gate keeping. Life is on a continuum, man.
- duskdozer 3 hours ago
  If you don't know English and you want to write English anyway, please just use a machine translator.
  [-]
  - mrweasel 1 hour ago
    From experience: If you don't know Danish, please don't ever use machine translators to translate from English. Regardless of what some people may think, they make mistakes, so many mistakes.
    I get why it's tempting, good translators are expensive, and few and far between. A friend of my is a professional translator and she's not exactly in need of work, but a lot of customers look at her prices and opt for machine translations instead and the result not always impressive. Errors range from wrong words, bad sentence structure to an inability to correctly translate cultural references.
    [-]
    - embedding-shape 1 hour ago
      Right, makes sense for Danes, or other population where English knowledge is basically ubiquitous. But I'm think it might look differently in other places, if the choice is between "Badly translated but I can understand 95% of it" and "In a language I don't understand at all, maybe 1% I could figure out", then the choice might be a bit different.
      [-]
      - ploum 47 minutes ago
        nope, let the user does the translation, with his own choice of tool and being thus perfectly aware of the shortcomings.
        I know that some people translate my French posts to read them. That’s really cool. But I would never post something I didn’t write myself (but I use spellcheking tools. I even sometimes disagree with them)
  - victorbjorklund 2 hours ago
    And the machine translator is using AI to translate the text
  - GaggiX 3 hours ago
    Why? A model correcting your errors is a powerful tool to learn the language, much better than just writing the phrase in your native language.
  - UqWBcuFx6NV4r 3 hours ago
    …what? no? why?
- rdmuser 9 hours ago
  Personally I find that I prefer badly written english or auto-translated stuff written in languages foreign to me over ai generated or even just ai polished works I've seen. There is just so much more character, depth and variance there vs ultra ai generic or slop text.
  That being said this project seems focused on content farms not people who just need a little help writing so this whole conversation is a bit of a side tangent.
  [-]
  - flkiwi 3 hours ago
    One of my coworkers is EXTREMELY capable but functionally almost illiterate. He’s recently discovered that he can put an idea in Copilot and have it generate an email. So now instead of brief, correct, but difficult to parse emails we receive 20-paragraph, bulleted, formatted OpenAI slop. It’s been a very strange thing to see, like someone getting extraordinarily bad cosmetic surgery.
    [-]
    - ploum 45 minutes ago
      "One of my coworkers is EXTREMELY capable but functionally almost illiterate."
      I cannot imagine what it means. To me it reads like "I know someone who can run very fast but has no legs."
      [-]
      - vogu66 11 minutes ago
        Capable doesn't mean capable of office work though, I could see someone with a language disorder doing electronics and have trouble with words, not numbers. Or someone who has trouble with written words specifically doing most of their learning with classes and videos.
    - dawnerd 1 hour ago
      Sam, and when you ask them a deeper question about it on a call they usually have no idea. It’s making people very lazy.
    - wolvoleo 1 hour ago
      Yeah I hate it when people do that and I always call them out on it.
      Unfortunately our company is trying to be "AI First" so they'll just point to that and continue their bullshit.
      Our company literally promotes AI slop over personally made content even if it's mediocre crap. All they care about is rising usage numbers of things like copilot in office.
    - SpicyLemonZest 1 hour ago
      I have a similar coworker, but he's not great at prompting, so 10% of the time the AI version of himself makes confident assertions that he did not intend and are clearly not true. Genuinely no idea what I'm supposed to do about it.
      [-]
      - flkiwi 58 minutes ago
        Exactly right. He’s good at what he does, except communicating, and people are beginning to associate him with AI slop they don’t have time to read rather than the excellent work he does for them.
    - OGEnthusiast 1 hour ago
      [dead]
  - lifthrasiir 9 hours ago
    I mean, I know it is probably tongue in cheek but that never-asked-question was particularly out of place. Massively generated AI contents are usually not THAT thoughtful anyway.
- dangus 2 hours ago
  This specific list from this specific author isn’t worth using since they refuse to remove items from the list if domain ownership changes.
  E.g., bought a domain that previously hosted AI content.
  E.g., Whitehouse.com used to be a porn site, now it’s not.
- jofzar 4 hours ago
  I use Grammarly at work (it's mostly to make sure our brand guidelines are kept) and I don't find that it (defaultly) corrects too far into the ai slop territory. It's mostly just making sure your sentence is correct.
  Op is going after AI slop bot farms like android authority
  [-]
  - smohare 1 hour ago
    [dead]
- rererereferred 5 hours ago
  I mean, the reason we use grammarly is because we recognize we have a skill issue.
meindnoch 1 hour ago
Also need a rule that filters out HN submissions from that Simon Wilson guy.
[-]
- nicbou 1 hour ago
  Why? He posts high-quality content that's interesting if you care about that field. It's not my cup of tea, but it's pretty far from what this list tries to block.
  [-]
  - selridge 8 minutes ago
    Because they are scared of the future. That’s why.
- eclipticplane 52 minutes ago
  His articles are _about_ AI though, not AI slop?
  [-]
semiinfinitely 29 minutes ago
Tragic twist: repo was entirely AI generated
[-]
- mixtureoftakes 22 minutes ago
  media.tenor.com/oW5zO_6gu5gAAAAi/theomegaoof-emoji.gif
filldorns 8 minutes ago
Come on guys, 2026 and you still using "blacklist". Why not BlockList?
Dwedit 2 hours ago
What happens if a legitimate site (forums, wiki, etc) gets mass-spammed with slop?
[-]
- harladsinsteden 2 hours ago
  I ceases to be legitimate.
afcool83 4 hours ago
Admirable idea and execution…but it does apply opposing evolutionary/economic pressure for AI-slop to become less detectable over time. AI will learn and adapt.
Metaphorically speaking, it’s the Borg we’re dealing with, not the Klingons. All Janeway did was slow the Borg’s progress.
[-]
- alansaber 3 minutes ago
  It's actually rather difficult for SoTA models to shift tone without losing performance on various datasets, so not such a one-sided arms race.
- mapontosevenths 3 hours ago
  Cory Doctorow wrote a story ~20 years ago about how the first sentient machines would be spam bots because their job is to pass as human, and anti-spam systems provide competitive evolutionary pressure.
  He may not be too far off.
  [-]
  - tetris11 2 hours ago
    This one?
    https://craphound.com/overclocked/Cory_Doctorow_-_Overclocke...
    [-]
    - mapontosevenths 1 hour ago
      I think that's the one. I was a bit off on the timing, it's not 20 yet. Great read either way.
      From the story:
      “Spam-filters, actually. Once they became self-modifying, spam-filters and spam-bots got into a war to see which could act more human, and since their failures invoked a human judgement about whether their material were convincingly human, it was like a trillion Turing-tests from which they could learn. From there came the first machine-intelligence algorithms, and then my kind.”
- smohare 1 hour ago
  [dead]
metalman 2 hours ago
flip it, and build green(organic) lists perhaps work towards having sites than dont just, not use AI, but never talk about it it's not just AI, search is a scam, no mojo in the world can extract the contact info for the business next door and the mountains of porncoin, scamulous garbage and hate news taking up a full 50% of whats left, does in fact make a determined effort to greenwall a section of the web something to consider
firebot 2 hours ago
Firefox already feeling more responsive.