From r to Q∗: Your Language Model is Sec 04月日, 2024 Maomei Showing that the two methods of alignment are in fact identical in some sense.