Devin Coldewey / TechCrunch:
Anthropic researchers find adding pleas to a prompt that tell its Claude 2 model not to be biased could reduce discrimination based on race, gender, and more — The problem of alignment is an important one when you’re setting AI models up to make decisions in matters of finance and health.
Source link