I’ve got a small Box AI application and we’re considering exposing the output of it to the public. Has anyone got some best practices for profanity and inappropriate content filtering with Box AI? It would be enough to just have Box AI say “I’m sorry. I can’t answer that question.” or something like that, when the user asks an inappropriate question. Do I just handle that through the prompt in the API call, or is there some other functionality I should be using that provides more structured content filtering?
Question
Box AI - profanity filter?
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.
