Results for "safety"

Safety Filter

Intermediate

Automated detection/prevention of disallowed outputs (toxicity, self-harm, illegal instruction, etc.).

A safety filter acts like a bouncer at a club, checking if people can enter based on certain rules. In the world of AI, it checks the outputs generated by a model to make sure they are safe and appropriate. For example, if an AI is asked to write a story, the safety filter will ensure that it doe...

Full Definition View in 3D WordGraph

41 results

SaMD Intermediate

Software regulated as a medical device.

AI in Healthcare

Artificial General Intelligence Frontier

AI capable of performing most intellectual tasks humans can.

AGI & General Intelligence

x-Risk Advanced

Existential risk from AI systems.

AI Safety & Alignment

Slow Takeoff Advanced