Perfect alignment between AI and human values is mathematically impossible, study says

Perfect AI alignment with human values and interests is mathematically impossible, according to a study, but behavioral diversity among AI agents offers the promise of some control. Published in PNAS Nexus, Hector Zenil and colleagues used Gödel's incompleteness theorem and Turing's undecidability result for the Halting Problem to show that any LLM complex enough to exhibit general intelligence or superintelligence will also be computationally irreducible and produce unpredictable behavior, making forced alignment impossible.


1 w.
Science
ID: -2151556155054947351


Similar News expand_more


Science
Education
Technology
Science
Science
Technology
Technology
Technology
Science
Space
Technology
Space
Science
Sport
Science
Science
Science
Technology
Sport
Science
Technology
Science
Science
Politics
Technology
Technology
Military
Science
Science
Economics
Science
Economics
Technology
Politics
Science
Science
Science
Education
Education
Science
Space
Science
Technology
Science
Science
Education
Technology
Technology
Education
Popular countries based on strong economic and political relations

Add Watch Country

arrow_drop_down