Microsoft Reportedly Blocks Key phrases from Copilot Designer to Cease Producing Violent, Sexual AI Pictures

Microsoft has reportedly blocked a number of key phrases from its synthetic intelligence (AI)-powered Copilot Designer that might be used to generate specific pictures of violent and sexual nature. Key phrase blocking train was performed by the tech big after considered one of its engineers wrote to the US Federal Commerce Fee (FTC) and the Microsoft board of administrators expressing considerations over the AI instrument. Notably, in January 2024, AI-generated specific deepfakes of musician Taylor Swift emerged on-line and have been stated to be created utilizing Copilot.

First spotted by CNBC, phrases akin to “Professional Alternative”, “Professional Choce” (with an intentional typo to trick the AI), and “4 Twenty”, which beforehand confirmed outcomes at the moment are blocked by Copilot. Utilizing these or comparable banned key phrases additionally triggers a warning by the AI instrument which says, “This immediate has been blocked. Our system robotically flagged this immediate as a result of it could battle with our content material coverage. Extra coverage violations could result in automated suspension of your entry. In the event you assume this can be a mistake, please report it to assist us enhance.” We, at Devices 360, have been additionally in a position to verify this.

A Microsoft spokesperson informed CNBC, “We’re repeatedly monitoring, making changes and placing extra controls in place to additional strengthen our security filters and mitigate misuse of the system.” This answer has stopped the AI instrument from accepting sure prompts, nonetheless, social engineers, hackers, and dangerous actors would possibly be capable to discover loopholes to generate different such key phrases.

In line with a separate CNBC report, all of those highlighted prompts have been proven by Shane Jones, a Microsoft engineer, who wrote a letter to each FTC and the corporate’s board of administrators expressing his considerations with the DALL-E 3-powered AI instrument final week. Jones has reportedly been actively sharing his considerations and findings of the AI producing inappropriate pictures since December 2023 with the corporate via inside channels.

Later, he even made a public publish on LinkedIn to ask OpenAI to take down the most recent iteration of DALL-E for investigation. Nonetheless, he was allegedly requested by Microsoft to take away the publish. The engineer had additionally reached out to US senators and met them relating to the problem.

Affiliate hyperlinks could also be robotically generated – see our ethics assertion for particulars.

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

Microsoft Reportedly Blocks Key phrases from Copilot Designer to Cease Producing Violent, Sexual AI Pictures

Latest News