AI Glossary: Safety, Alignment & Evaluation
These concepts are critical for responsible AI product development—understanding failure modes, safety measures, and evaluation approaches directly informs design decisions. Safety Concepts AI Safety The interdisciplinary field preventing accidents, misuse, or harmful AI consequences. Encompasses alignment research, risk monitoring, robustness, and developing norms for beneficial AI operation. Reference: Amodei,