Game-theoretic safety guarantees for advanced AI systems
Using game theory to provide provable guarantees to mitigate misaligned behaviours and maintain control in multi-agent scenarios.
As information systems become increasingly AI-centric and autonomous, traditional security frameworks no longer adequately address questions of safety, control and privacy, especially in situations where multiple AI agents collaborate autonomously. Canada CIFAR AI Chair Zhijing Jin proposes using game theory, a robust theoretical framework, to provide provable guarantees to mitigate misaligned behaviours and offer concrete tools for policymakers and AI developers to maintain control in multi-agent scenarios.
Collaborators
Zhijing Jin
Canada CIFAR AI Chair, Vector Institute; University of Toronto
David Lie
University of Toronto
Related Research
AI Alignment Project

