And this has been fixed.
Again, I was tired of waiting for a fix, so I used Claude to help with this.
It ended up making a plugin for this site which is a lot cleaner. It’s not perfect, but at least it is something.
This applies to posts coming from the Fediverse and posts created on Caint.ie
Whenever a user on Caint.ie posts in a language that is not English nor Irish, they will see this message:
“Your post appears to be in a language that is not currently accepted on this forum. Caint only accepts posts in English or Irish (Gaeilge). If you believe this is a mistake, please contact the site administrator.”
Fediverse posts that are not in English nor Irish will simply not come through.
There are some rules:
A post will be ALLOWED if:
The text is shorter than 10 characters (too short to detect reliably)
The language cannot be determined (returns und)
The detected language is English (eng)
The detected language is Irish (gle)
A post will be BLOCKED if:
The text is 10 characters or longer, AND
The language can be determined, AND
The detected language is anything other than English or Irish
Additional things worth knowing:
HTML tags are stripped before detection, so formatting does not affect the result
Detection is based on the post content for replies, and the content or title for new topics
The language detection is statistical – very short posts that scrape over the 10 character minimum may occasionally be misidentified
Mixed language posts will be judged on whichever language dominates the text
Posts where the language genuinely cannot be determined are always let through rather than risk blocking legitimate English content