Guide: How to use open safety models for Trust & Safety

The RMC's first community project sprint is to draft a short guide on how to use open safety models in a T&S context. This guide should include the following:

**Context Setting**
- Brief overview of the DIRE framework for T&S capabilities (for reference, see [ROOST's roadmap](https://github.com/roostorg/community/blob/main/roadmap.md?source=blog))
- Where AI models fit in that framework

**Deep Dive**
- How AI models fit into specific T&S architectures, with examples / diagrams
- Why and when T&S teams might use open safety models instead of closed / generalized models

**Specific Tips & Tricks**
- How to not break the bank using the wrong models / how to use with other model types (e.g., how to use a general classifier that's really cheap to run, then send off to specialized models or humans for more review)
- How to run and/or integrate different models (e.g., step by step guide to converting a HF link to something you can run locally / call)

My plan is to finish a draft guide by the end of the month! Ideally, I will draft one section per week, wrapping by 3/27. I'll share a Google Doc for any contributors who want to weigh in!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guide: How to use open safety models for Trust & Safety #52

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Guide: How to use open safety models for Trust & Safety #52

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions