I see spam even in YouTube and Twitter comments now a days so I am wondering if it's too hard to handle?
My backend will be either NodeJS or Rust and Postgres database and Redis if needed.
2. Identify TLDs and keywords in spam comments and automatically delete those comments to reduce workload for step 1.
3. If it is an English site, delete all comments not in English alphabet or language of site.
I was able to delete ~98% of comment spam through these three steps. I had a custom page that showed all comments that passed through filters. I could bulk delete spam comments from this page and bulk approve real comments. If I notice spam comments getting through, I will add keywords and domains from those comments to my filter subroutine.
It was a cat and mouse game, but worked reasonably well. I made few good connections and friends through comments so I will not suggest to disable commenting unless there is no reason for someone to comment or can make contact through other means.
Like “what is the answer to 3 + 2?” or “Name one ingredient in a PBJ sandwich” or something like that.
For small sites, that can be enough to weed out spam but enable the “ham” to make it through.
On other sites that I am or have moderating/ed, very tight filtering on who gets in as a user in the first place, with final manual approval, seems to work.