Using Role-Playing Scenarios to Identify Bias in LLMs
Software Engineering Institute (SEI) Podcast Series - En podcast af Members of Technical Staff at the Software Engineering Institute
Kategorier:
Harmful biases in large language models (LLMs) make AI less trustworthy and secure. Auditing for biases can help identify potential solutions and develop better guardrails to make AI safer. In this podcast from the Carnegie Mellon University Software Engineering Institute (SEI), Katie Robinson and Violet Turri, researchers in the SEI’s AI Division, discuss their recent work using role-playing game scenarios to identify biases in LLMs.