Anthropic says AI could one day ‘sabotage’ humanity but it’s fine for now

The firm investigated four distinct “sabotage” threat vectors for AI and determined that “minimal mitigations” were sufficient for current models. 

Add a Comment

Your email address will not be published. Required fields are marked *