Skip to content

HAPI robots identity and behavior #148

Closed
@berniegsfc

Description

@berniegsfc

HAPI robots should be easily identifiable and well behaved. That is, the HTTP User-Agent value should have "bot" in the name and a URL to a page containing more information about the bot (including a contact). For example,

Mozilla/5.0 (Linux; Android 6.0.1; Nexus 5X Build/MMB29P) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/99.0.4844.84 Mobile Safari/537.36 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_5) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/13.1.1 Safari/605.1.15 (Applebot/0.1; +http://www.apple.com/go/applebot)

For this issue, the significant parts of the above examples are

Googlebot/2.1; +http://www.google.com/bot.html)
(Applebot/0.1; +http://www.apple.com/go/applebot)

The identity is helpful to the hapi server hosts for excluding bot requests from usage reports.
The bot should also respect robots.txt and all standard HTTP rate-limiting mechanisms (e.g., 429 response, Try-After header, etc.).
This may be related to issue #135 if the pinging is repetitive.
Currently, @rweigel , @jbfaden , and @sandyfreelance are known to be operating hapi bots.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions