srctree

Gregory Mullen parent d230f0e2 2039a6e5
draft of persona non grata

inline split

filename was Deleted added: 134, removed: 1, total 133

@@ -0,0 +1,133 @@

+++

layout = "post"

title = "Persona Non Grata"

email = "non-grata"

date = 2025-04-22

tags = ["infosec", "security", "drama", "llm", "abuse"]

draft = false

#trunc = 300

summary = """I had to stop myself from giving this article it's previous working title: Stop

Externalizing The Costs Of Your Service Directly Into Your Users Face. This is

the follow up I promised to write, when I omitted the longer complaint I had

about Anubis"""

+++

I had to stop myself from giving this article it's previous working title:

> Stop Externalizing The Costs Of Your Service Directly Into Your Users

> Face[^cmpwn].

[^cmpwn]: [Please stop externalizing your costs directly into my

face](https://drewdevault.com/2025/03/17/2025-03-17-Stop-externalizing-your-costs-on-me.html)

This is the follow up I promised to write, when I omitted the longer complaint I

had about [Anubis](https://anubis.techaro.lol/) from my last post [about

consent]({{< ref consent-is-required.md >}}). Seeing people use Anubis makes me

angry! But you should in no way take as a critique against Anubis itself. If I

wanted to be shallow, there's plenty of cheap shots I could take against it's

[source code](https://github.com/TecharoHQ/anubis) I could only do so because

I'm still able to find humor in being a troll, the pedant in my dictates that I

claim it has some flaws, but there's nothing egregiously wrong with it. It's

good (enough) code! I'm even happen to double down, and I'm happy to link to

their blog which you can find....

![ https\://xeiaso.net/blog ](/assets/xeiaso-blog-down.png)

... oh, sorry, nevermind I can't reach their blog anymore. That's a shame I have

to admit that I didn't always agree with all of their takes[^never] but I've yet

to come across anything that's been flat out wrong. I've happily linked their

blog to others, and if it was still up, I'd still link to it.

[^never]: I never do, for anyone, but such is the life of a pedant!

Oh, I know, surely the internet archive has it!

![ https\://xeiaso.net/blog ](/assets/xeiaso-blog-down-archive.png)

Fuck!

Well, I guess the internet just got a lot worse. The worst part; it's made

this way by people who purport to care about an open internet. I almost decided

not to include the source for the original title. But I eventually broke down,

because... well I haven't completely figured that out yet, but I feel I need to

explain exactly why this proof of work captcha feels like the right solution to

so many people, and then hopefully why you should still shun it.

I don't object to the idea, nor the concept of proof of work captcha. I've

advocated for trying it a number of times at `$OLD_JOB`, but was never invested

enough to charge up that hill. It's a good solution to consider when you're

unable to isolate scripted, from organic traffic. It's infinitely better than

visual human captchas. The part I object to, is serving it to **me**, or your

users who may or may not know better, or to the internet archive. Yes yes, it's

very cute you have an exclusion for curl, but that's still only in the security

through obscurity category, so you're welcome to claim the negative credit for

that!

I know exactly why, Xi is doing it, it's their new toy[^cool]. But I also know

why SourceHut is doing it, and it's not because it's the best solution. It's

because the owner of sourcehut is angry, and he's taking it out on the users.

[^cool]: and it **is** a cool toy!

> If you personally work on developing LLMs et al, know this: I will never work

> with you again, and I will remember which side you picked when the bubble

> bursts.

*sigh*

> I miss the old Drew.

That's a quote from a friend, I remember it, because a few weeks before that I

said literally, the exact same thing to a completely different friend. I said

about a year ago, and I said it again this past month. The new drew is more

interested in punishing people than build cool stuff that improves the world,

and watching that change has been very disheartening. I miss the sircmpwn that

wasn't exclusively angry.

The old ddvault understood that you don't punish users, you punish the people

directly responsible for the bad behavior[^go]. The irony here being the article

about blocking only the abusive behavior, is linked within the article

explaining how they're "forced" to serve this captcha to users. I run

[srctree.gr.ht](https://srctree.gr.ht), and I can attest, the new breed of AI

crawlers love git hosts. I've spent plenty of time watching logs scroll by that

I would classify as abusive. But **none** of them are subtle. The argument that

these are advanced persistent threats, that are impossible to identify, and are

constantly rotating identifiers, but have also chosen to spend this obfuscation

time on crawling webpages, instead of a simple git clone, doesn't pass the sniff

test. I don't buy it. I'm sure some of you have already picked your side, so

I'll gladly fill in your objection for you. srctree is infinitesimally smaller

than sr.ht, so obviously I've just never seen the real abuse! Perhaps, anyone

with access to a host logs that's willing to share multiple KB of "raw" logs

showing mostly residential ASNs, maybe I can help you out. Until then it seems

unlikely, and doesn't match the traffic I've seen.

[^go]: [ SourceHut will (not) blacklist the Go module

mirror](https://sourcehut.org/blog/2023-01-09-gomodulemirror/)

### The solution I'm using on srctree[^joke]

[^joke]: There was a joke there, but I had to change it because I'm afraid of the C++

committee.

Ban them. No, seriously. If you notice abuse from a given ASN, ban the subnet.

[I've published my rules](https://github.com/GrayHatter/rules/)[^update] Some

note worthy entries, just tonight I banned the FB ASN, I'm sure that'll cause

some problems in the future, but that's future me's responsibility. As I

mentioned previously, I'm sure I'm gonna ban Alibaba's subnet next. Hetnzer is

persona non grata, GCP almost got the same treatment, but I filed an abuse

report, and it stopped[^shocked].

[^update]: I totally promise to update it soon... promise!

[^shocked]: I know, I didn't expect it to work either. But the timing seemed to

match well.

That's it, it takes banning a new ASN every other week. But the problem is

manageable. Without forcing readers to complete a POW.

TODO:

Finish describing the firewall rules.

Describe bot detection in verse.

Detect if you're banned from srctree

Describe the WHY.