I’ve written a little bot that listens out for mentions of SE23 on Twitter and retweets them.
I’m using this as a coding exercise to learn new technologies. If you’re on Twitter, give the bot a follow and let me know what you think.
Also, if anyone has ideas for this bot, I’m interested to hear.
Filter out low-quality tweets
Human curation (retracting RTs) - fiddly, and too late
Throttle tweets from over-tweeters
Follow human-curated list of “tastemaker” tweeters to analyse which accounts / content they like/retweet
Identify high quality tweeters via like_rate/follower_count and RT_rate/follower_count
Filter out mentions of SE23 that aren’t about the postcode
Use following/followers of user to see if they’re in the clique of relevant local users
Use DBPedia (Wikipedia interface for bots) to identify keywords related to Forest Hill / Honor Oak
Filter repetitive social media spamming
Manually curated - mute and retract RT - fiddly, and too late
Identify spam tweets by proportion of hashtags
Filter tweets with text matching earlier tweets (from any account)
Reduce repetition of the same news
Natural language processing to extract semantics and keywords, then compare to previous tweets. See https://code.google.com/archive/p/ws4j/
Filter non-English / non-local tweets
Human curation (blocking bad actors) - fiddly, and too late
Whitelist accounts by analysing quality metrics (e.g. average like count per tweet per follower count)? New / spam accounts will have low scores for these metrics.