WeiYa's Work Yard

A dog, who fell into the ocean of statistics, tries to write down his ideas and notes to save himself.

Perference Matching in RLHF

Posted on (Update: )
Tags: Large Language Model

This is the note for the talk Statistical Inference in Large Language Models: Alignment and Copyright given by Weijie Su at JSM 2024

image

image

image


Published in categories JSM-2024