Hey Google's Association Service bot, thank you for the 400,000+ requests for assetlinks.json file over the last 9 hours, but we truly meant it when we said 404 - File Not Found. KThxBye. #abuse
GoogleAssociationService bot was kind enough to ask 1,000,000+ times yesterday for the same file from 4000+ Google IP addresses. Answer was the same 404 - File Not Found. The User-Agent does not provide a support link unlike their other bots.
@osm_tech The only solution I can see for all this shit is the IDP.
(And, because search-engines are so clueless about the history of the 'net: https://www.catb.org/jargon/html/I/Internet-Death-Penalty.html)
@mikro2nd
As much as I dislike Google, a lot of people & browsers still seem to be using them as a search engine...
Applying IDP to Google IP ranges would mean that nobody would be able to find #OpenStreetMap on the google search, and would instead probably get some scammers as the first result. I don't think that is ideal outcome.
@osm_tech
@osm_tech personally, I'd block all the #GAFAMs by their entire #ASN|s!
Fuck the crawlers; #Blackholing of their #DDoS attacks is the only feasible option!
Also send an #AbuseReport everytime they try that shite to them and all the providers from you till them...
@osm_tech what if you explicitly ban that path in your robots.txt
file?
@osm_tech it’s becoming clear that we need to be able to block all crawlers somehow