AI SAFETY & ETHICS
Coherent Care
Alignment Forum
•
I've been trying to gather my thoughts for my next tiling theorem ( agenda write-up here; first paper; second paper; recent project update ). I have a lot of ideas for how to improve upon my work so far, and trying to narrow them down to an achievable next step has been difficult. However, my mind keeps returning to specific friends who are not yet convinced of Updateless Decision Theory (UDT). I am not out to argue that UDT is the perfect decision theory; see eg here and here. However, I strongly believe that those who don't see the appeal of UDT are missing something.