@CobaltVelvet @orielle From what I know of neural nets, you're going to need to have a pretty big dataset, the dataset better not be polluted with implicit bias from the data collectors and people providing the rating of offensiveness, and, from my understanding of a couple of recent reports, might actually be fooled by some trivial changes by the adversaries? But it's worth looking into to see if it's feasible.