Democratic Alignment of LLMs Through Economic Theory: Relative Preferences and Strategic Coordination
Developing a framework for democratic alignment using economic theory to address the limitations of current AI alignment methods.
This project addresses the limitations of current AI alignment methods, which often struggle to represent diverse societal values and resist strategic manipulation. By applying economic theory and mechanism design, the team will develop a framework for democratic alignment that moves beyond simple rankings to capture the intensity of human preferences. Using techniques like “quadratic voting,” the researchers aim to create protocols that empower communities to steer model behaviour while protecting against harmful exploitation. Ultimately, this work provides a blueprint for safer, more accountable AI systems that respect pluralistic perspectives and foster democratic resilience.
Collaborators
Elliot Creager
Rohit Lamba
Clemens Possnig


